Data Lake, MySQL and Scala - Data Engineering Digest

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Data Engineering Podcast

NOVEMBER 20, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Lake

Data Lake Data Ingestion MongoDB MySQL

Discover And De-Clutter Your Unstructured Data With Aparavi

Data Engineering Podcast

JUNE 12, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Unstructured Data

Unstructured Data MongoDB MySQL Scala

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

Data Engineering Podcast

NOVEMBER 6, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Scala Machine Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Going From Transactional To Analytical And Self-managed To Cloud On One Database With MariaDB

Data Engineering Podcast

OCTOBER 23, 2022

In this episode field CTO Manjot Singh shares his experiences as an early user of MySQL and MariaDB and explains how the suite of products being built on top of the open source foundation address the growing needs for advanced storage and analytical capabilities. Mention the podcast to get a free "In Data We Trust World Tour" t-shirt.

Database

Database MySQL Cloud MongoDB

Level Up Your Data Platform With Active Metadata

Data Engineering Podcast

JUNE 19, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Metadata

Metadata MongoDB MySQL Scala

An Exploration Of The Open Data Lakehouse And Dremio's Contribution To The Ecosystem

Data Engineering Podcast

OCTOBER 16, 2022

Summary The "data lakehouse" architecture balances the scalability and flexibility of data lakes with the ease of use and transaction support of data warehouses. Mention the podcast to get a free "In Data We Trust World Tour" t-shirt.

Data Lake

Data Lake Food MongoDB MySQL

Building Data Pipelines That Run From Source To Analysis And Activation With Hevo Data

Data Engineering Podcast

SEPTEMBER 11, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. Sifflet also offers a 2-week free trial.

Data Pipeline

Data Pipeline Building MongoDB MySQL

Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus

Data Engineering Podcast

AUGUST 6, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. Sifflet also offers a 2-week free trial.

Machine Learning

Machine Learning Database MySQL MongoDB

Joe Reis Flips The Script And Interviews Tobias Macey About The Data Engineering Podcast

Data Engineering Podcast

JULY 17, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Engineer

Data Engineer Data Engineering Engineering MongoDB

AWS Glue-Unleashing the Power of Serverless ETL Effortlessly

ProjectPro

FEBRUARY 8, 2023

It offers a simple and efficient solution for data processing in organizations. It offers users a data integration tool that organizes data from many sources, formats it, and stores it in a single repository, such as data lakes, data warehouses, etc., where it can be used to facilitate business decisions.

AWS

AWS Scala Metadata Data Lake

Be Confident In Your Data Integration By Quickly Validating Matching Records With data-

Data Engineering Podcast

JULY 3, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Integration

Data Integration MongoDB MySQL Scala

Maintain Your Data Engineers' Sanity By Embracing Automation

Data Engineering Podcast

JULY 10, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Engineer

Data Engineer Data Engineering Engineering MongoDB

Re-Bundling The Data Stack With Data Orchestration And Software Defined Assets Using Dagster

Data Engineering Podcast

JULY 24, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Scala Data Lake

Interactive Exploratory Data Analysis On Petabyte Scale Data Sets With Arkouda

Data Engineering Podcast

JULY 31, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. Sifflet also offers a 2-week free trial.

Data Analysis

Data Analysis MongoDB Algorithm MySQL

Operational Analytics To Increase Efficiency For Multi-Location Businesses With OpsAnalitica

Data Engineering Podcast

SEPTEMBER 18, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Hospitality

Hospitality Food MongoDB MySQL

Power Your Real-Time Analytics Without The Headache Using Fivetran's Change Data Capture Integrations

Data Engineering Podcast

SEPTEMBER 25, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Food

Food MongoDB MySQL Scala

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Data Engineering Podcast

OCTOBER 30, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Engineering

Engineering MongoDB MySQL Scala

Taking A Look Under The Hood At CreditKarma's Data Platform

Data Engineering Podcast

NOVEMBER 13, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

MongoDB

MongoDB MySQL Google Cloud Scala

Strategies And Tactics For A Successful Master Data Management Implementation

Data Engineering Podcast

JUNE 26, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Data Management

Data Management Management MongoDB MySQL

Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin

Data Engineering Podcast

OCTOBER 2, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

IT

IT Food MongoDB PostgreSQL

Investing In Understanding The Customer Journey At American Express

Data Engineering Podcast

OCTOBER 9, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Food

Food MongoDB MySQL Scala

Python for Data Engineering

Ascend.io

SEPTEMBER 14, 2023

Read More: Data Automation Engineer: Skills, Workflow, and Business Impact Python for Data Engineering Versus SQL, Java, and Scala When diving into the domain of data engineering, understanding the strengths and weaknesses of your chosen programming language is essential.

Data Engineer

Data Engineer Data Engineering Python Engineering

The Top 25 Data Engineering Influencers and Content Creators on LinkedIn

Databand.ai

DECEMBER 13, 2022

He currently runs a YouTube channel, E-Learning Bridge , focused on video tutorials for aspiring data professionals and regularly shares advice on data engineering, developer life, careers, motivations, and interviewing on LinkedIn.

Data Engineer

Data Engineer Data Engineering Engineering AWS

100+ Big Data Interview Questions and Answers 2023

ProjectPro

JANUARY 31, 2023

Spark Architecture has three major components: API, Data Storage, and Management Framework. Spark provides APIs for the programming languages Java, Scala, and Python. Data Storage: Spark stores data using the HDFS file system. Any Hadoop-compatible data source, such as HDFS, HBase, and Cassandra , etc.,

Big Data

Big Data Hadoop Relational Database AWS

Top Hadoop Projects and Spark Projects for Beginners 2021

ProjectPro

NOVEMBER 14, 2015

It plays a key role in streaming in the form of Spark Streaming libraries, interactive analytics in the form of SparkSQL and also provides libraries for machine learning that can be imported using Python or Scala. Objective and Summary of the project: With so much use of online applications the inflow of data has increased exponentially.

Hadoop

Hadoop Project Big Data Healthcare

100+ Data Engineer Interview Questions and Answers for 2023

ProjectPro

JULY 27, 2021

Non-relational databases are ideal if you need flexibility for storing the data since you cannot create documents without having a fixed schema. E.g. PostgreSQL, MySQL, Oracle, Microsoft SQL Server. E.g. Redis, MongoDB, Cassandra, HBase , Neo4j, CouchDB What is data modeling? A data warehouse can contain unstructured data too.

Data Engineer

Data Engineer Data Engineering Engineering Hadoop

Data News — Week 24.08

Christophe Blefari

FEBRUARY 23, 2024

JVM vs. SQL data engineer — There's a big discussion in the community about what real data engineering is. Is it Java/Scala or Python? Is it lake or warehouse? It's a sterile debate: both are useful and can serve different organisations with different service level for data users and stakeholders.

Data Lake

Data Lake MongoDB PostgreSQL MySQL

Data Engineering Digest

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Discover And De-Clutter Your Unstructured Data With Aparavi

Webinars

Trending Sources

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

Webinars

Going From Transactional To Analytical And Self-managed To Cloud On One Database With MariaDB

Level Up Your Data Platform With Active Metadata

An Exploration Of The Open Data Lakehouse And Dremio's Contribution To The Ecosystem

Building Data Pipelines That Run From Source To Analysis And Activation With Hevo Data

Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus

Joe Reis Flips The Script And Interviews Tobias Macey About The Data Engineering Podcast

AWS Glue-Unleashing the Power of Serverless ETL Effortlessly

Be Confident In Your Data Integration By Quickly Validating Matching Records With data-

Maintain Your Data Engineers' Sanity By Embracing Automation

Re-Bundling The Data Stack With Data Orchestration And Software Defined Assets Using Dagster

Interactive Exploratory Data Analysis On Petabyte Scale Data Sets With Arkouda

Operational Analytics To Increase Efficiency For Multi-Location Businesses With OpsAnalitica

Power Your Real-Time Analytics Without The Headache Using Fivetran's Change Data Capture Integrations

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Taking A Look Under The Hood At CreditKarma's Data Platform

Strategies And Tactics For A Successful Master Data Management Implementation

Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin

Investing In Understanding The Customer Journey At American Express

Python for Data Engineering

The Top 25 Data Engineering Influencers and Content Creators on LinkedIn

100+ Big Data Interview Questions and Answers 2023

Top Hadoop Projects and Spark Projects for Beginners 2021

100+ Data Engineer Interview Questions and Answers for 2023

Data News — Week 24.08

Stay Connected