Top Data Engineering Digest MongoDB Amazon Web Services Content for Week of May 09

Sat.May 09, 2020 - Fri.May 15, 2020

Change Data Capture Using Debezium Kafka and Pg

Start Data Engineering

MAY 9, 2020

Change data capture is a software design pattern used to capture changes to data and take corresponding action based on that change. The change to data is usually one of read, update or delete. The corresponding action usually is supposed to occur in another system in response to the change that was made in the source system.

Kafka

Kafka Data Designing Systems

Apache Kafka Needs No Keeper: Removing the Apache ZooKeeper Dependency

Confluent

MAY 15, 2020

Currently, Apache Kafka® uses Apache ZooKeeper™ to store its metadata. Data such as the location of partitions and the configuration of topics are stored outside of Kafka itself, in a […].

Kafka

Kafka Metadata IT Project

Join 37,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Drafting Your Data Pipelines

Team Data Science

MAY 10, 2020

With careful consideration and learning about your market, the choices you need to make become narrower and more clear. I can now begin drafting my data ingestion/ streaming pipeline without being overwhelmed. For A Quick Recap You can find the first blog post here, where I learned which tech is most in demand in Toronto: [link] And the second blog post is here where I learn which Toronto industries need data engineers the most: [link] The Pipeline Proposal I'll be creating several pipelines in

Data Pipeline

Data Pipeline Data Ingestion AWS Kafka

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

COVID-19: The Perfect Storm

Teradata

MAY 13, 2020

The COVID-19 pandemic has brought with it a Perfect Storm of disruption that impacts all of us -- from our health to the economy to the supply chain. Read more.

A Guide to Debugging Apache Airflow® DAGs

In Airflow, DAGs (your data pipelines) support nearly every use case. As these workflows grow in complexity and scale, efficiently identifying and resolving issues becomes a critical skill for every data engineer. This is a comprehensive guide with best practices and examples to debugging Airflow DAGs. You’ll learn how to: Create a standardized process for debugging to quickly diagnose errors in your DAGs Identify common issues with DAGs, tasks, and connections Distinguish between Airflow-relate

Data Pipeline

StreamNative Brings Streaming Data To The Cloud Native Landscape With Pulsar

Data Engineering Podcast

MAY 11, 2020

Summary There have been several generations of platforms for managing streaming data, each with their own strengths and weaknesses, and different areas of focus. Pulsar is one of the recent entrants which has quickly gained adoption and an impressive set of capabilities. In this episode Sijie Guo discusses his motivations for spending so much of his time and energy on contributing to the project and growing the community.

Cloud

Cloud Lambda Architecture Kafka Hadoop

Building a Telegram Bot Powered by Apache Kafka and ksqlDB

Confluent

MAY 12, 2020

Imagine you’ve got a stream of data; it’s not “big data,” but it’s certainly a lot. Within the data, you’ve got some bits you’re interested in, and of those bits, […].

Kafka

Kafka Building Big Data MongoDB

Create Your Own Custom String Interpolator

Rock the JVM

MAY 11, 2020

Discover how to create your own custom string interpolator that feels like a native feature of Scala

Scala

More Trending

Create Your Own Custom String Interpolator

Rock the JVM

MAY 11, 2020

Discover how to create your own custom string interpolator that feels like a native feature of Scala

Scala

How China is Using Advanced Analytics During the COVID-19 Pandemic

Teradata

MAY 11, 2020

Learn how advanced analytics are being used in China amidst the COVID-19 pandemic to help combat the spread of coronavirus now and in the future.

Getting Started - Installing Apache Superset

Preset

MAY 10, 2020

Setting up Apache Superset for the first time can be difficult. Here's an easy way to accomplish the task!

From Eager to Smarter in Apache Kafka Consumer Rebalances

Confluent

MAY 11, 2020

Everyone wants their infrastructure to be highly available, and ksqlDB is no different. But crucial properties like high availability don’t come without a thoughtful, rigorous design. We thought hard about […].

Kafka

Kafka Designing Process

How Akka Typed Incentivizes Writing Good Code

Rock the JVM

MAY 10, 2020

Explore how Akka Typed integrates good practices directly into the API

Coding

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

Data

Emulate Your Heroes with Data… and Vantage on AWS

Teradata

MAY 12, 2020

Teradata's top-ranked analytic software capabilities that market-leading companies have been using for years is available right now on Amazon Web Services (AWS). Learn more.

AWS

AWS Amazon Web Services Data

Announcing ksqlDB 0.9.0

Confluent

MAY 13, 2020

We’re pleased to announce the release of ksqlDB 0.9.0! This version includes support for multi-join statements, enhanced LIKE expressions, and a host of usability improvements. We’ll go through a few […].

Process

Sat.May 09, 2020 - Fri.May 15, 2020

Change Data Capture Using Debezium Kafka and Pg

Apache Kafka Needs No Keeper: Removing the Apache ZooKeeper Dependency

Webinars

Trending Sources

Drafting Your Data Pipelines

Webinars

COVID-19: The Perfect Storm

A Guide to Debugging Apache Airflow® DAGs

StreamNative Brings Streaming Data To The Cloud Native Landscape With Pulsar

Building a Telegram Bot Powered by Apache Kafka and ksqlDB

Create Your Own Custom String Interpolator

Sign up to get articles personalized to your interests!

More Trending

Create Your Own Custom String Interpolator

How China is Using Advanced Analytics During the COVID-19 Pandemic

Getting Started - Installing Apache Superset

From Eager to Smarter in Apache Kafka Consumer Rebalances

How Akka Typed Incentivizes Writing Good Code

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Emulate Your Heroes with Data… and Vantage on AWS

Announcing ksqlDB 0.9.0

Stay Connected