2019, Building and Systems - Data Engineering Digest

Going from Developer to CEO: Chronosphere

The Pragmatic Engineer

OCTOBER 10, 2023

He’s solved interesting engineering challenges along the way, too – like building observability for Amazon’s EC2 offering, and being one of the first engineers on Uber’s observability platform. I wrote code for drivers on Windows, and started to put a basic observability system in place.

Software Engineer

Software Engineer Software Engineering Architecture Media

Building Transactional Systems Using Apache Kafka

Confluent

AUGUST 20, 2019

Traditional relational database systems are ubiquitous in software systems. The database system guarantees that multiple concurrent transactions will appear to the user to be executed one after the other. Upholding each property in a system based on Kafka is tricky but not impossible, as you are about to find out.

Kafka

Kafka Systems Building Relational Database

Why Open Table Format Architecture is Essential for Modern Data Systems

phData: Data Engineering

NOVEMBER 8, 2024

The world we live in today presents larger datasets, more complex data, and diverse needs, all of which call for efficient, scalable data systems. These systems are built on open standards and offer immense analytical and transactional processing flexibility. 2019 - Delta Lake Databricks released Delta Lake as an open-source project.

Architecture

Architecture Systems Data Lake Google Cloud

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Netflix Hack Day?—?November 2019

Netflix Tech

DECEMBER 10, 2019

Fall 2019 By Tom Richards , Carenina Garcia Motion , and Leslie Posada Hack Day at Netflix is an opportunity to build and show off a feature, tool, or quirky app. November 2019 was originally published in Netflix TechBlog on Medium, where people are continuing the conversation by highlighting and responding to this story.

Technology

Technology Project Systems Building

Top 15+ AI Agent Projects You Can Build Today

ProjectPro

JUNE 6, 2025

Leveraging machine learning and deep learning , these agents can process data, interact with systems, and adapt to changing conditions, thus enabling sophisticated automation and problem-solving capabilities. The rapid shift toward automation and intelligent systems is becoming impossible to ignore.

Project

Project Building Banking Healthcare

The Roots of Today's Modern Backend Engineering Practices

The Pragmatic Engineer

NOVEMBER 21, 2023

If you had a continuous deployment system up and running around 2010, you were ahead of the pack: but today it’s considered strange if your team would not have this for things like web applications. He then worked at the casual games company Zynga, building their in-game advertising platform.

Engineering

Engineering Bytes Cloud Computing AWS

Is the “AI developer”a threat to jobs – or a marketing stunt?

The Pragmatic Engineer

MARCH 19, 2024

A first, smaller wave of these stories included Magic.dev raising $100M in funding from Nat Friedman (CEO of GitHub from 2018-2021,) and Daniel Gross (cofounder of search engine Cue which Apple acquired in 2013,) to build a “superhuman software engineer.” AI dev tool startups need outlandish claims to grab attention.

Software Engineer

Software Engineer Software Engineering Programming Language Media

Datadog’s $65M/year customer mystery solved

The Pragmatic Engineer

MAY 11, 2023

Datadog is a leading observability tooling provider which went public in 2019, with a current market cap of $28B. A very popular open-source solution for systems and services monitoring. A fast and open-source column-oriented database management system, which is a popular choice for log management. But why is this?

AWS

AWS Software Engineer Software Engineering Google Cloud

Data Engineering Roadmap, Learning Path,& Career Track 2025

ProjectPro

JUNE 6, 2025

Data Engineering refers to creating practical designs for systems that can extract, keep, and inspect data at a large scale. It involves building pipelines that can fetch data from the source, transform it into a usable form, and analyze variables present in the data. Ability to demonstrate expertise in database management systems.

Data Engineer

Data Engineer Data Engineering Engineering Amazon Web Services

How to Build Generative AI Applications?

ProjectPro

JUNE 6, 2025

This blog is your complete guide to building Generative AI applications in Python. AI even de-aged actors in The Irishman (2019) using one of the popular generative models- Generative Adversarial Networks (GANs). The real question is: how do you build your own GenAI applications and tap into this power? Let’s get started!

Building

Building Banking SQL Deep Learning

Building end-to-end security for Messenger

Engineering at Meta

DECEMBER 6, 2023

This is the most significant milestone yet for this project, which began in earnest after Mark Zuckerberg outlined his vision for it in 2019. Neither WhatsApp nor Secret Conversations operated in this manner, and we didn’t want all users to have to rely on a device-side storage system.

Building

Building Consulting Designing Accessible

Kafka Summit London 2019 Session Videos

Confluent

MAY 23, 2019

She recounted a number of lessons Confluent has learned in building Confluent Cloud, and announced the availability of several new features in the cloud service. ?. We rightly spend a lot of time trying to figure out how to build things, so it was good to step back and see how our engineering work can drive internal cultural change as well.

Kafka

Kafka Cloud Database Engineering

Netflix at AWS re:Invent 2019

Netflix Tech

NOVEMBER 22, 2019

4:45pm-5:45pm NFX 202 A day in the life of a Netflix Engineer Dave Hahn , SRE Engineering Manager Abstract : Netflix is a large, ever-changing ecosystem serving millions of customers across the globe through cloud-based systems and a globally distributed CDN. In 2019, Netflix moved thousands of container hosts to bare metal.

AWS

AWS Entertainment Software Engineer Software Engineering

How to Build an AI Agent with Pydantic AI: A Beginner's Guide

ProjectPro

JUNE 6, 2025

Have you ever considered the challenges data professionals face when building complex AI applications and managing large-scale data interactions? These obstacles usually slow development, increase the likelihood of errors and make it challenging to build robust, production-grade AI applications that adapt to evolving business requirements.

Building

Building Pipeline-centric Database-centric Data Validation

Behind the Scenes with Two New Salary Transparency Websites

The Pragmatic Engineer

APRIL 6, 2023

This created an opportunity to build job sites which collect this data, make it easy to browse, and allow job seekers to apply to jobs paying at or above a certain level. For AI, we’ve built a system to efficiently use GPT-4 for this purpose, including auto-crafting prompts and performing pre and post-processing.

Software Engineer

Software Engineer Software Engineering Datasets Database

Foundation Model for Personalized Recommendation

Netflix Tech

MARCH 28, 2025

By Ko-Jen Hsiao , Yesu Feng and Sudarshan Lamkhede Motivation Netflixs personalized recommender system is a complex system, boasting a variety of specialized machine learned models each catering to distinct needs including Continue Watching and Todays Top Picks for You. Refer to our recent overview for more details).

Metadata

Metadata Bytes Entertainment Data Mining

The Recommendation System at Lyft

Lyft Engineering

APRIL 3, 2023

This blog post focuses on the scope and the goals of the recommendation system, and explores some of the most recent changes the Rider team has made to better serve Lyft’s riders. Introduction: Scope of the Recommendation System The recommendation system covers user experiences throughout the ride journey.

Systems

Systems Pipeline-centric Machine Learning Transportation

Kafka Summit New York 2019 Session Videos

Confluent

APRIL 16, 2019

To build the kinds of systems we are being called upon to build these days, we need infrastructure that gives equal priority to events and state together. and a couple of fantastic keynotes: Jay Kreps (CEO of Confluent and co-creator of Apache Kafka ® ) kept the unifying vision of the event streaming platform in front of us.

Kafka

Kafka Database Programming Systems

Kafka Summit San Francisco 2019: Day 2 Recap

Confluent

OCTOBER 1, 2019

If you looked at the Kafka Summits I’ve been a part of as a sequence of immutable events (and they are, unless you know something about time I don’t), it would look like this: New York City 2017, San Francisco 2017, London 2018, San Francisco 2018, New York City 2019, London 2019, San Francisco 2019. Yes, you read that right.

Kafka

Kafka Database Cloud Systems

Building Trust in Public Sector AI Starts with Trusting Your Data

Cloudera

DECEMBER 1, 2023

Launched in 2019, this strategy aims to position the US as a leader in AI research, development, and deployment. It focuses on five key pillars: investing in research and development; unleashing government AI resources; setting standards and policy; building the AI workforce; and advancing trust and security. million), among others.

Building

Building Government Transportation Data Governance

7 Tips to Build a Job-Winning Data Engineer Resume in 2025

ProjectPro

JUNE 6, 2025

Worried about building a great data engineer resume ? We also have a few tips and guidelines for beginner-level and senior data engineers on how they can build an impressive resume. Data engineering entails creating and developing data collection, storage, and analysis systems. Don’t worry!

Data Engineer

Data Engineer Data Engineering Recruitment Engineering

Data logs: The latest evolution in Meta’s access tools

Engineering at Meta

FEBRUARY 4, 2025

Here we explore initial system designs we considered, an overview of the current architecture, and some important principles Meta takes into account in making data accessible and easy to understand. 2019: Users can view their activity off Meta-technologies and clear their history. feature on Facebook. What are data logs?

Accessible

Accessible Accessibility Raw Data Data Warehouse

Is the “AI developer”a threat to jobs – or a marketing stunt?

The Pragmatic Engineer

MAY 1, 2024

A first, smaller wave of these stories included Magic.dev raising $100M in funding from Nat Friedman (CEO of GitHub from 2018-2021,) and Daniel Gross (cofounder of search engine Cue which Apple acquired in 2013,) to build a “superhuman software engineer.” AI dev tool startups need outlandish claims to grab attention.

Software Engineer

Software Engineer Software Engineering Programming Language Media

Kafka Summit San Francisco 2019 Session Videos

Confluent

OCTOBER 9, 2019

Based on the votes of Summit attendees from within the Kafka Summit mobile app, here are the top-rated talks: Building Stream Processing Applications with Apache Kafka Using KSQL by Robin Moffatt of Confluent. With so many sessions to choose from, perhaps you’re wondering where to start. Why Stop the World When You Can Change It?

Kafka

Kafka Software Engineer Software Engineering Certification

?? On Track with Apache Kafka – Building a Streaming ETL Solution with Rail Data

Confluent

OCTOBER 16, 2019

As with any system out there, the data often needs processing before it can be used. In traditional data warehousing, we’d call this ETL, and whilst more “modern” systems might not recognise this term, it’s what most of us end up doing whether we call it pipelines or wrangling or engineering. Handling time.

Kafka

Kafka Building Data PostgreSQL

Building Netflix’s Distributed Tracing Infrastructure

Netflix Tech

OCTOBER 19, 2020

which is difficult when troubleshooting distributed systems. This insight led us to build Edgar: a distributed tracing infrastructure and user experience. Troubleshooting a session in Edgar When we started building Edgar four years ago, there were very few open-source distributed tracing systems that satisfied our needs.

Building

Building Transportation Java Metadata

Kafka Summit SF 2019: Day 1 Recap

Confluent

SEPTEMBER 30, 2019

I even stopped by Build-A-Bear at lunchtime with the inaugural class of Confluent Community Catalysts ! He is the co-presenter of various O’Reilly training videos on topics ranging from Git to distributed systems, and is the author of Gradle Beyond the Basics. And, I saw fresh Kafka swag in the making. and all over the world.

Kafka

Kafka Architecture Cloud Designing

Your Step-by-Step Guide to Become a Data Engineer in 2025

ProjectPro

JUNE 6, 2025

Additionally, the website reported that the number of job positions was almost similar in 2019 and 2020. Build and deploy ETL/ELT data pipelines that can begin with data ingestion and complete various data-related tasks. Build solutions highlighting data quality, operational efficiency, and other feature describing data.

Data Engineer

Data Engineer Data Engineering Engineering Amazon Web Services

Revisiting The Technical And Social Benefits Of The Data Mesh

Data Engineering Podcast

DECEMBER 26, 2021

Zhamak Dehghani introduced the concepts behind this architectural patterns in 2019, and since then it has been gaining popularity with many companies adopting some version of it in their systems. How has your view of the principles of the data mesh changed since our conversation in July of 2019?

BI

BI Data Warehouse Data Engineer Data Engineering

Introducing the Plato Research Dialogue System: Building Conversational Applications at Uber’s Scale

KDnuggets

AUGUST 15, 2019

While the process of building simple, domain-specific chatbots has gotten way easier, building large scale, multi-agent conversational applications remains a massive challenge.

Systems

Systems Building Engineering Process

Building Pinterest’s new wide column database using RocksDB

Pinterest Engineering

JANUARY 4, 2024

In 2020, anticipating the growing needs of the business and to simplify our storage offerings, we decided to consolidate our different key-value systems in the company into a single unified service called KVStore. In order to build a distributed and replicated service using RocksDB, we built a real time replicator library: Rocksplicator.

Database

Database Building Datasets Relational Database

Top 10 Essential Data Engineering Skills

ProjectPro

JUNE 6, 2025

In fact, as per a report by Dice Insights in 2019, companies are hungry for data engineers as the job role ranked at the top of the list of trending jobs. Data engineers are responsible for creating pipelines enabling data flow from various sources to data storage and processing systems.

Data Engineer

Data Engineer Data Engineering Engineering Hadoop

30+ Artificial Intelligence Project Ideas for Beginners [2025]

ProjectPro

JUNE 6, 2025

Building Artificial Intelligence projects not only improves your skillset as an AI engineer/ data scientist, but it also is a great way to display your artificial intelligence skills to prospective employers to land your dream future job. Project Idea: You can use the Resume Dataset available on Kaggle to build this model.

Project

Project Datasets Deep Learning Machine Learning

DataOps vs. DevOps-Key Differences Data Engineers Must Know

MAY 27, 2021

An authoritarian regime is manipulating an artificial intelligence (AI) system to spy on technology users. When developing ethical AI systems, the most important part is intent and diligence in evaluating models on an ongoing basis,” said Santiago Giraldo Anduaga, director of product marketing, data engineering and ML at Cloudera.

Algorithm

Algorithm Media Consulting Machine Learning

Going from Developer to CEO: Chronosphere

Building Transactional Systems Using Apache Kafka

Webinars

Trending Sources

Why Open Table Format Architecture is Essential for Modern Data Systems

Webinars

Netflix Hack Day?—?November 2019

Top 15+ AI Agent Projects You Can Build Today

The Roots of Today's Modern Backend Engineering Practices

Is the “AI developer”a threat to jobs – or a marketing stunt?

Datadog’s $65M/year customer mystery solved

Data Engineering Roadmap, Learning Path,& Career Track 2025

How to Build Generative AI Applications?

Building end-to-end security for Messenger

Kafka Summit London 2019 Session Videos

Netflix at AWS re:Invent 2019

How to Build an AI Agent with Pydantic AI: A Beginner's Guide

Behind the Scenes with Two New Salary Transparency Websites

Foundation Model for Personalized Recommendation

The Recommendation System at Lyft

Kafka Summit New York 2019 Session Videos

Kafka Summit San Francisco 2019: Day 2 Recap

Building Trust in Public Sector AI Starts with Trusting Your Data

7 Tips to Build a Job-Winning Data Engineer Resume in 2025

Data logs: The latest evolution in Meta’s access tools

Is the “AI developer”a threat to jobs – or a marketing stunt?

Kafka Summit San Francisco 2019 Session Videos

?? On Track with Apache Kafka – Building a Streaming ETL Solution with Rail Data

Building Netflix’s Distributed Tracing Infrastructure

Kafka Summit SF 2019: Day 1 Recap

Your Step-by-Step Guide to Become a Data Engineer in 2025

Revisiting The Technical And Social Benefits Of The Data Mesh

Introducing the Plato Research Dialogue System: Building Conversational Applications at Uber’s Scale

Building Pinterest’s new wide column database using RocksDB

Top 10 Essential Data Engineering Skills

30+ Artificial Intelligence Project Ideas for Beginners [2025]

DataOps vs. DevOps-Key Differences Data Engineers Must Know

100 SQL Interview Questions and Answers

How to use Transfer Learning in Deep Learning projects?

Analysis of Confluent Buying Immerok

The Foundations of a Modern Data-Driven Organisation: Change from Within (part 2 of 2)

The evolution of Facebook’s iOS app architecture

25+ Computer Vision Projects Ideas for Beginners in 2025

Full Stack Developer vs Software Developer – Which Is Better ?

Lyft Expands Team to Czechia

Spring for Apache Kafka Deep Dive – Part 3: Apache Kafka and Spring Cloud Data Flow

The Ethics of AI Comes Down to Conscious Decisions

Stay Connected