Data Ingestion, Java and MySQL - Data Engineering Digest

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Data Engineering Podcast

NOVEMBER 20, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Lake

Data Lake Data Ingestion MongoDB MySQL

Discover And De-Clutter Your Unstructured Data With Aparavi

Data Engineering Podcast

JUNE 12, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Unstructured Data

Unstructured Data MongoDB MySQL Scala

Going From Transactional To Analytical And Self-managed To Cloud On One Database With MariaDB

Data Engineering Podcast

OCTOBER 23, 2022

In this episode field CTO Manjot Singh shares his experiences as an early user of MySQL and MariaDB and explains how the suite of products being built on top of the open source foundation address the growing needs for advanced storage and analytical capabilities. Can you describe what MariaDB is and the story behind it?

Database

Database MySQL Cloud MongoDB

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

Data Engineering Podcast

NOVEMBER 6, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Scala Machine Learning

Level Up Your Data Platform With Active Metadata

Data Engineering Podcast

JUNE 19, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Metadata

Metadata MongoDB MySQL Scala

Building Data Pipelines That Run From Source To Analysis And Activation With Hevo Data

Data Engineering Podcast

SEPTEMBER 11, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Pipeline

Data Pipeline Building MongoDB MySQL

An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications

Data Engineering Podcast

AUGUST 21, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Lambda Architecture

Lambda Architecture MongoDB MySQL Scala

Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus

Data Engineering Podcast

AUGUST 6, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Machine Learning

Machine Learning Database MySQL PostgreSQL

Joe Reis Flips The Script And Interviews Tobias Macey About The Data Engineering Podcast

Data Engineering Podcast

JULY 17, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Engineer

Data Engineer Data Engineering Engineering MongoDB

Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery

Data Engineering Podcast

AUGUST 13, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Metadata

Metadata MongoDB MySQL Scala

A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore

Data Engineering Podcast

MAY 29, 2022

The Ascend Data Automation Cloud provides a unified platform for data ingestion, transformation, orchestration, and observability. Ascend users love its declarative pipelines, powerful SDK, elegant UI, and extensible plug-in architecture, as well as its support for Python, SQL, Scala, and Java.

Database

Database Architecture Data Architecture PostgreSQL

Simplify Data Security For Sensitive Information With The Skyflow Data Privacy Vault

Data Engineering Podcast

JUNE 5, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Security

Data Security Metadata MongoDB MySQL

Be Confident In Your Data Integration By Quickly Validating Matching Records With data-

Data Engineering Podcast

JULY 3, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Integration

Data Integration MongoDB MySQL Scala

Maintain Your Data Engineers' Sanity By Embracing Automation

Data Engineering Podcast

JULY 10, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Engineer

Data Engineer Data Engineering Engineering MongoDB

Re-Bundling The Data Stack With Data Orchestration And Software Defined Assets Using Dagster

Data Engineering Podcast

JULY 24, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Scala Data Lake

Interactive Exploratory Data Analysis On Petabyte Scale Data Sets With Arkouda

Data Engineering Podcast

JULY 31, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Analysis

Data Analysis MongoDB Algorithm MySQL

Alumni Of AirBnB's Early Years Reflect On What They Learned About Building Data Driven Organizations

Data Engineering Podcast

AUGUST 28, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Building

Building MongoDB MySQL Scala

Operational Analytics To Increase Efficiency For Multi-Location Businesses With OpsAnalitica

Data Engineering Podcast

SEPTEMBER 18, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Hospitality

Hospitality Food MongoDB MySQL

Power Your Real-Time Analytics Without The Headache Using Fivetran's Change Data Capture Integrations

Data Engineering Podcast

SEPTEMBER 25, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Food

Food MongoDB MySQL Scala

An Exploration Of The Open Data Lakehouse And Dremio's Contribution To The Ecosystem

Data Engineering Podcast

OCTOBER 16, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Lake

Data Lake Food MongoDB MySQL

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Data Engineering Podcast

OCTOBER 30, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Engineering

Engineering MongoDB MySQL Scala

Taking A Look Under The Hood At CreditKarma's Data Platform

Data Engineering Podcast

NOVEMBER 13, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Google Cloud Scala

Updates, Inserts, Deletes: Comparing Elasticsearch and Rockset for Real-Time Data Ingest

Rockset

OCTOBER 11, 2022

In this blog, we’ll compare and contrast how Elasticsearch and Rockset handle data ingestion as well as provide practical techniques for using these systems for real-time analytics. Logstash is an event processing pipeline that ingests and transforms data before sending it to Elasticsearch.

Data Ingestion

Data Ingestion Kafka Relational Database PostgreSQL

Strategies And Tactics For A Successful Master Data Management Implementation

Data Engineering Podcast

JUNE 26, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Management

Data Management Management MongoDB MySQL

Bringing Automation To Data Labeling For Machine Learning With Watchful

Data Engineering Podcast

AUGUST 13, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Machine Learning

Machine Learning Pipeline-centric Database-centric MongoDB

Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin

Data Engineering Podcast

OCTOBER 2, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

IT

IT Food PostgreSQL MongoDB

Investing In Understanding The Customer Journey At American Express

Data Engineering Podcast

OCTOBER 9, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Food

Food MongoDB MySQL Scala

Introduce Climate Analytics Into Your Data Platform Without The Heavy Lifting Using Sust Global

Data Engineering Podcast

SEPTEMBER 4, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Scala Machine Learning

Sqoop vs. Flume Battle of the Hadoop ETL tools

ProjectPro

OCTOBER 28, 2015

Apache Hadoop is synonymous with big data for its cost-effectiveness and its attribute of scalability for processing petabytes of data. Data analysis using hadoop is just half the battle won. Getting data into the Hadoop cluster plays a critical role in any big data deployment. then you are on the right page.

ETL Tools

ETL Tools Hadoop Relational Database Unstructured Data

100+ Big Data Interview Questions and Answers 2023

ProjectPro

JANUARY 31, 2023

There are three steps involved in the deployment of a big data model: Data Ingestion: This is the first step in deploying a big data model - Data ingestion, i.e., extracting data from multiple data sources. Spark provides APIs for the programming languages Java, Scala, and Python.

Big Data

Big Data Hadoop Relational Database AWS

20+ Data Engineering Projects for Beginners with Source Code

ProjectPro

AUGUST 24, 2021

Data Engineering Project for Beginners If you are a newbie in data engineering and are interested in exploring real-world data engineering projects, check out the list of data engineering project examples below. This big data project discusses IoT architecture with a sample use case.

Data Engineer

Data Engineer Data Engineering Coding Project

Data Lake Explained: A Comprehensive Guide to Its Architecture and Use Cases

AltexSoft

AUGUST 29, 2023

In 2010, a transformative concept took root in the realm of data storage and analytics — a data lake. The term was coined by James Dixon , Back-End Java, Data, and Business Intelligence Engineer, and it started a new era in how organizations could store, manage, and analyze their data. Semi-structured data sources.

Data Lake

Data Lake Architecture IT Amazon Web Services

Apache Spark Use Cases & Applications

Knowledge Hut

MAY 2, 2024

As per Apache, “ Apache Spark is a unified analytics engine for large-scale data processing ” Spark is a cluster computing framework, somewhat similar to MapReduce but has a lot more capabilities, features, speed and provides APIs for developers in many languages like Scala, Python, Java and R.

Scala

Scala Hospitality Machine Learning Healthcare

Data Engineer Learning Path, Career Track & Roadmap for 2023

ProjectPro

JANUARY 19, 2022

Data Engineering Requirements Here is a list of skills needed to become a data engineer: Highly skilled at graduation-level mathematics. Good skills in computer programming languages like R, Python, Java, C++, etc. Depending on the type of database a data engineer is working with, they will use specific software.

Data Engineer

Data Engineer Data Engineering Engineering Amazon Web Services

Data Engineering Digest

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Discover And De-Clutter Your Unstructured Data With Aparavi

Webinars

Trending Sources

Going From Transactional To Analytical And Self-managed To Cloud On One Database With MariaDB

Webinars

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

Level Up Your Data Platform With Active Metadata

Building Data Pipelines That Run From Source To Analysis And Activation With Hevo Data

An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications

Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus

Joe Reis Flips The Script And Interviews Tobias Macey About The Data Engineering Podcast

Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery

A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore

Simplify Data Security For Sensitive Information With The Skyflow Data Privacy Vault

Be Confident In Your Data Integration By Quickly Validating Matching Records With data-

Maintain Your Data Engineers' Sanity By Embracing Automation

Re-Bundling The Data Stack With Data Orchestration And Software Defined Assets Using Dagster

Interactive Exploratory Data Analysis On Petabyte Scale Data Sets With Arkouda

Alumni Of AirBnB's Early Years Reflect On What They Learned About Building Data Driven Organizations

Operational Analytics To Increase Efficiency For Multi-Location Businesses With OpsAnalitica

Power Your Real-Time Analytics Without The Headache Using Fivetran's Change Data Capture Integrations

An Exploration Of The Open Data Lakehouse And Dremio's Contribution To The Ecosystem

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Taking A Look Under The Hood At CreditKarma's Data Platform

Updates, Inserts, Deletes: Comparing Elasticsearch and Rockset for Real-Time Data Ingest

Strategies And Tactics For A Successful Master Data Management Implementation

Bringing Automation To Data Labeling For Machine Learning With Watchful

Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin

Investing In Understanding The Customer Journey At American Express

Introduce Climate Analytics Into Your Data Platform Without The Heavy Lifting Using Sust Global

Sqoop vs. Flume Battle of the Hadoop ETL tools

100+ Big Data Interview Questions and Answers 2023

20+ Data Engineering Projects for Beginners with Source Code

Data Lake Explained: A Comprehensive Guide to Its Architecture and Use Cases

Apache Spark Use Cases & Applications

Data Engineer Learning Path, Career Track & Roadmap for 2023

Top 100 Hadoop Interview Questions and Answers 2023

Stay Connected