Sat.May 30, 2020 - Fri.Jun 05, 2020

article thumbnail

A proven approach to land a Data Engineering job

Start Data Engineering

I have seen and been asked the following questions by students, backend engineers and analysts who want to get into the data engineering industry. What approach should i take to land a Data Engineering job? I really want to get into DE. What can I do to learn more about it? In this article, I will try to provide a general approach that you as a beginner, student, backend engineer or analyst can use to land your first data engineering job.

article thumbnail

Stream Processing with IoT Data: Challenges, Best Practices, and Techniques

Confluent

The rise of IoT devices means that we have to collect, process, and analyze orders of magnitude more data than ever before. As sensors and devices become ever more ubiquitous, […].

Process 126
article thumbnail

Building A Data Lake For The Database Administrator At Upsolver

Data Engineering Podcast

Summary Data lakes offer a great deal of flexibility and the potential for reduced cost for your analytics, but they also introduce a great deal of complexity. What used to be entirely managed by the database engine is now a composition of multiple systems that need to be properly configured to work in concert. In order to bring the DBA into the new era of data management the team at Upsolver added a SQL interface to their data lake platform.

Data Lake 100
article thumbnail

Using Data to Fight COVID-19 Supply Chain Disruption

Teradata

Excerpted & editorialized interview of Dr. Hani Mahmassani of Northwestern University and Stephen Brobst, CTO of Teradata, and their discussion of how companies are using real-time data for scenario crunching, such as supply chain risk assessment.

Data 93
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Thank You

Start Data Engineering

Thank you for contacting us. We will get back to you shortly.

100
100
article thumbnail

Real-Time Fleet Management Using Confluent Cloud and MongoDB

Confluent

Most organisations maintain fleets, a collection of vehicles put to use for day-to-day operations. Telcos use a variety of vehicles including cars, vans, and trucks for service, delivery, and maintenance. […].

MongoDB 108

More Trending

article thumbnail

Legacy or Modern? Why not Both!

Teradata

Modern architectures as they are presented today are being used as a wedge to force people to abandon their current solutions and brand those as legacy, but is being a legacy a bad thing?

article thumbnail

Akka HTTP to Heroku in 10 Minutes

Rock the JVM

Easily deploy your first Akka HTTP service to Heroku in minutes

52
article thumbnail

Confluent CLI 1.0 is Now Generally Available for Cloud and Platform

Confluent

Over a year ago, Confluent set out on a mission to improve user experience by empowering developers, operators, and architects with intuitive command line interfaces (CLIs) for managing their Confluent […].

Cloud 82
article thumbnail

Top 10 sessions for MongoDB.live 2020

Rockset

MongoDB World is going all virtual with MongoDB.live. Registration is free and there’s tons of content to get excited about. It’s so easy to get overwhelmed on what to pick (heck, you could just watch all of them)! If you’re short on time, fear not- here are our top 10 MongoDB sessions to watch out for: 10 Join the Data Movement: MongoDB and Apache Kafka One of the go-to picks for companies that need a streaming platform is Apache Kafka.

MongoDB 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Evaluating the True Cost of Pricing Analytics

Teradata

Tom Casey attempts to simplify the concept of consumption-based pricing for analytics, making the case that companies can benefit from a multi-genre approach.

59
article thumbnail

An Introduction to Monads in Scala

Rock the JVM

A Scala tutorial on Monads that starts with practical needs and builds up from scratch: derive the monad patterns (laws) with no assumptions

Scala 52
article thumbnail

Consistent Metastore Recovery for ksqlDB Using Apache Kafka Transactions

Confluent

This is the second of a series of posts that dive deep into key improvements made to ksqlDB to prepare for production availability in Confluent Cloud. This post assumes familiarity […].

Kafka 75
article thumbnail

Remote Compactions in RocksDB-Cloud

Rockset

Introduction RocksDB is an LSM storage engine whose growth has proliferated tremendously in the last few years. RocksDB-Cloud is open-source and is fully compatible with RocksDB, with the additional feature that all data is made durable by automatically storing it in cloud storage (e.g. Amazon S3). We, at Rockset, use RocksDB-Cloud as one of the building blocks of Rockset’s distributed Converged Index.

Cloud 40
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Pricing Models for Analytics

Teradata

Tom Casey attempts to simplify the concept of consumption-based pricing for analytics, making the case that companies can benefit from a multi-genre approach.

52
article thumbnail

Getting Started - Connect Superset To Google Sheets

Preset

This tutorial shows you how to connect your local deployment of Apache Superset with Google Sheets, so you can query any publicly available Google Sheet.

40