Building, Data Workflow and Pipeline-centric

Building

Data Workflow

Pipeline-centric

Snowflake’s New Python API Empowers Data Engineers to Build Modern Data Pipelines with Ease

Snowflake

APRIL 17, 2024

In today’s data-driven world, developer productivity is essential for organizations to build effective and reliable products, accelerate time to value, and fuel ongoing innovation. Dive in to experience how the enhanced Python API streamlines your data workflows and unlocks the full potential of Python within Snowflake.

Data Pipeline

Data Pipeline Python Data Engineering Data Engineer

Improve Data Quality Through Engineering Rigor And Business Engagement With Synq

Data Engineering Podcast

JUNE 30, 2024

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Data lakes are notoriously complex. What does an on-call rotation for a data engineer/data platform engineer look like as compared with an application-focused team?

Pipeline-centric

Pipeline-centric Engineering Data Lake High Quality Data

Join 37,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Data Engineering Weekly #196

Data Engineering Weekly

NOVEMBER 3, 2024

impactdatasummit.com Thumbtack: What we learned building an ML infrastructure team at Thumbtack Thumbtack shares valuable insights from building its ML infrastructure team. The blog emphasizes the importance of starting with a clear client focus to avoid over-engineering and ensure user-centric development.

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Microsoft Fabric vs. Snowflake: Key Differences You Need to Know

Edureka

APRIL 22, 2025

Since all of Fabric’s tools run natively on OneLake, real-time performance without data duplication is possible in Direct Lake mode. Because of the architecture’s ability to abstract infrastructure complexity, users can focus solely on data workflows. Cloud support Microsoft Fabric: Works only on Microsoft Azure.

BI Pipeline-centric Data Lake Google Cloud

Data Engineering Weekly #214

Data Engineering Weekly

MARCH 30, 2025

A few exciting theses exist around composite data stack, catalogs, and MCP. Eval plays a critical role in the growth and maturity of LLM-centric systems. The paper critically examines the Text2SQL task, highlighting that limitations go beyond model performance to encompass the entire solution pipeline and evaluation process.

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

Toward a Data Mesh (part 2) : Architecture & Technologies

François Nguyen

MARCH 22, 2021

TL;DR After setting up and organizing the teams, we are describing 4 topics to make data mesh a reality. How do we build data products ? The next problem will be the diversity of these mini data platforms (because of the configuration) and you even go deeper in problems with managing different technologies or version.

Technology

Technology Architecture Google Cloud Metadata

How to Become a Data Engineer in 2024?

Knowledge Hut

DECEMBER 26, 2023

Data Engineering is typically a software engineering role that focuses deeply on data – namely, data workflows, data pipelines, and the ETL (Extract, Transform, Load) process. What is the role of a Data Engineer? They are required to have deep knowledge of distributed systems and computer science.

Data Engineering

Data Engineering Data Engineer Engineering Hadoop

What is Azure Data Factory – Here’s Everything You Need to Know

Edureka

JULY 3, 2024

ADF connects to various data sources, including on-premises systems, cloud services, and SaaS applications. It then gathers and relocates information to a centralized hub in the cloud using the Copy Activity within data pipelines. Transform and Enhance the Data: Once centralized, data undergoes transformation and enrichment.

Pipeline-centric

Pipeline-centric Data Lake Database-centric Data Pipeline

Data Pipeline vs. ETL: Which Delivers More Value?

Ascend.io

MAY 31, 2023

In the modern world of data engineering, two concepts often find themselves in a semantic tug-of-war: data pipeline and ETL. Fast forward to the present day, and we now have data pipelines. Data Ingestion Data ingestion is the first step of both ETL and data pipelines.

Data Pipeline

Data Pipeline ETL Tools Pipeline-centric Data Warehouse

Hadoop vs Spark: Main Big Data Tools Explained

AltexSoft

JUNE 7, 2021

These pitfalls along with the need to cover an end-to-end Big Data workflow prompted the emergence of various additional services, compatible with each other. Spark Streaming empowers the core engine with near-real-time processing capabilities and facilitates building streaming analytics products. Apache Hadoop ecosystem.

Big Data Tools

Big Data Tools Hadoop Big Data Database-centric

Data Orchestration Tools (Quick Reference Guide)

Monte Carlo

NOVEMBER 14, 2023

This is the world that data orchestration tools aim to create. Data orchestration tools minimize manual intervention by automating the movement of data within data pipelines. According to one Redditor on r/dataengineering, “Seems like 99/100 data engineering jobs mention Airflow.”

Pipeline-centric

Pipeline-centric Google Cloud Python Data Workflow

The Top Data Strategy Influencers and Content Creators on LinkedIn

Databand.ai

DECEMBER 29, 2022

Chad writes on data management, contracts, and products on his Substack blog and serves as an advisor and investor to several startups. He recently joined Databand’s MAD Data Podcast to talk about how his team is building one of the most advanced experimentation and machine learning platforms in the world from the ground up.

BI Consulting Data Science Data Governance

Data Engineering Digest

Snowflake’s New Python API Empowers Data Engineers to Build Modern Data Pipelines with Ease

Improve Data Quality Through Engineering Rigor And Business Engagement With Synq

Webinars

Trending Sources

Data Engineering Weekly #196

Webinars

Microsoft Fabric vs. Snowflake: Key Differences You Need to Know

Data Engineering Weekly #214

Toward a Data Mesh (part 2) : Architecture & Technologies

How to Become a Data Engineer in 2024?

What is Azure Data Factory – Here’s Everything You Need to Know

Data Pipeline vs. ETL: Which Delivers More Value?

Hadoop vs Spark: Main Big Data Tools Explained

Data Orchestration Tools (Quick Reference Guide)

The Top Data Strategy Influencers and Content Creators on LinkedIn

Stay Connected