Data, Data Warehouse and SQL - Data Engineering Digest

Cloud Data Warehouse Migrations: Success Stories from WHOOP and Nexon

Snowflake

NOVEMBER 26, 2024

Many of our customers — from Marriott to AT&T — start their journey with the Snowflake AI Data Cloud by migrating their data warehousing workloads to the platform. Today we’re focusing on customers who migrated from a cloud data warehouse to Snowflake and some of the benefits they saw.

Data Warehouse

Data Warehouse Cloud PostgreSQL Data Lake

Build Better Data Pipelines with SQL and Python in Snowflake

Snowflake

JUNE 10, 2025

Data transformations are the engine room of modern data operations — powering innovations in AI, analytics and applications. As the core building blocks of any effective data strategy, these transformations are crucial for constructing robust and scalable data pipelines. This puts data engineers in a critical position.

Data Pipeline

Data Pipeline SQL Python Building

Simplify Data Warehouse Migrations: Free SnowConvert with Redshift Support

Snowflake

JANUARY 28, 2025

Migrating from a traditional data warehouse to a cloud data platform is often complex, resource-intensive and costly. As part of this announcement, Snowflake is also announcing private preview support of a new end-to-end data migration experience for Amazon Redshift.

Data Warehouse

Data Warehouse Professional Services SQL Coding

Simplify Data Warehouse Migrations: Free SnowConvert

Snowflake

JANUARY 28, 2025

Migrating from a traditional data warehouse to a cloud data platform is often complex, resource-intensive and costly. As part of this announcement, Snowflake is also announcing private preview support of a new end-to-end data migration experience for Amazon Redshift.

Data Warehouse

Data Warehouse Professional Services SQL Data

Implementing a Dimensional Data Warehouse with Databricks SQL: Part 2

databricks

MAY 7, 2025

As organizations consolidate analytics workloads to Databricks, they often need to adapt traditional data warehouse techniques. This series explores how to implement dimensional modelingspecifically, star

Data Warehouse

Data Warehouse SQL Data

Implementing a Dimensional Data Warehouse with Databricks SQL, Part 3

databricks

MAY 27, 2025

Dimensional modeling is a time-tested approach to building analytics-ready data warehouses. While many organizations are shifting to modern platforms like Databricks, these foundational techniques still

Data Warehouse

Data Warehouse SQL Data Building

Azure SQL Data Warehouse and Its Architecture- An Overview

ProjectPro

JUNE 6, 2025

When it comes to generating useful and meaningful insights for business, data is extremely powerful and essential—but only when handled properly. However, only a small percentage of corporate data is analyzed and stored efficiently. Microsoft offers Azure SQL Data Warehouse, a cloud-based data warehousing solution.

Data Warehouse

Data Warehouse SQL Architecture IT

How Meta discovers data flows via lineage at scale

Engineering at Meta

JANUARY 22, 2025

Data lineage is an instrumental part of Metas Privacy Aware Infrastructure (PAI) initiative, a suite of technologies that efficiently protect user privacy. It is a critical and powerful tool for scalable discovery of relevant data and data flows, which supports privacy controls across Metas systems.

Data Warehouse

Data Warehouse SQL Programming Language Data

How to Normalize Relational Databases With SQL Code?

Analytics Vidhya

FEBRUARY 27, 2023

Introduction Data is the new oil in this century. The database is the major element of a data science project. So, we are […] The post How to Normalize Relational Databases With SQL Code? So, we are […] The post How to Normalize Relational Databases With SQL Code? appeared first on Analytics Vidhya.

Relational Database

Relational Database Database SQL Coding

Data News — Week 25.02

Christophe Blefari

JANUARY 11, 2025

The Data News are here to stay, the format might vary during the year, but here we are for another year. We published videos about the Forward Data Conference, you can watch Hannes, DuckDB co-creator, keynote about Changing Large Tables. HNY 2025 ( credits ) Happy new year ✨ I wish you the best for 2025. Not really digest.

Data

Data Data Warehouse Programming Language Coding

The Open Lakehouse Stack: DuckDB and the Rise of Table Formats

Simon Späti

JUNE 16, 2025

Wouldn’t it be great to build a data warehouse on top of affordable storage and scattered files? SSDs and fast storage are expensive, but storing data in a data lake on S3 or R2 is significantly cheaper, allowing you to save a greater amount of essential data. That’s where databases shine, right?

Data Lake

Data Lake Data Warehouse Government SQL

Mosaic AI Announcements at Data + AI Summit 2025

databricks

JUNE 11, 2025

Events Data + AI Summit Data + AI World Tour Data Intelligence Days Event Calendar Blog and Podcasts Databricks Blog Explore news, product announcements, and more Databricks Mosaic Research Blog Discover the latest in our Gen AI research Data Brew Podcast Let’s talk data! REGISTER Ready to get started?

Entertainment

Entertainment Manufacturing Consulting Retail

Redshift vs. BigQuery: Choosing the Right Data Warehouse

ProjectPro

JUNE 6, 2025

Are you looking to choose the best cloud data warehouse for your next big data project? This blog presents a detailed comparison of two of the very famous cloud warehouses - Redshift vs. BigQuery - to help you pick the right solution for your data warehousing needs. billion by 2028 from $21.18

Data Warehouse

Data Warehouse Data Mining Google Cloud PostgreSQL

50+ Data Warehouse Interview Questions and Answers for 2025

ProjectPro

JUNE 6, 2025

Are you looking for data warehouse interview questions and answers to prepare for your upcoming interviews? This guide lists top interview questions on the data warehouse to help you ace your next job interview. The data warehousing market was worth $21.18 What are the different types of data warehouses?

Data Warehouse

Data Warehouse Data Mining Recruitment Database

Data Warehouse Engineer - A Complete Career Guide

ProjectPro

JUNE 6, 2025

By 2028, the size of the global market for data warehousing is likely to reach $51.18 The volume of enterprise data generated, including structured data, sensor data, network logs, video and audio feeds, and other unstructured data, is expanding exponentially as businesses diversify their client bases and adopt new technologies.

Data Warehouse

Data Warehouse Engineering Business Intelligence Google Cloud

SQL for Data Engineering: Success Blueprint for Data Engineers

ProjectPro

JUNE 6, 2025

The demand for skilled data engineers who can build, maintain, and optimize large data infrastructures does not seem to slow down any sooner. At the heart of these data engineering skills lies SQL that helps data engineers manage and manipulate large amounts of data. of data engineer job postings on Indeed?

Data Engineer

Data Engineer Data Engineering SQL Engineering

Introducing Lakebridge: Free, Open Data Migration to Databricks SQL

databricks

JUNE 12, 2025

We’re excited to introduce Lakebridge, a free migration tool that simplifies and accelerates enterprise data warehouse (EDW) migrations to Databricks SQL. Modernizing from legacy, siloed

SQL

SQL Data Warehouse Data

15 Data Warehouse Project Ideas for Practice with Source Code

ProjectPro

JUNE 6, 2025

The worldwide data warehousing market is expected to be worth more than $30 billion by 2025. Data warehousing and analytics will play a significant role in a company’s future growth and profitability. Table of Contents What is Data Warehousing? Why Data Warehouse Projects Fail? So let's get started!

Data Warehouse

Data Warehouse Coding Project Google Cloud

Data Lake vs Data Warehouse - Working Together in the Cloud

ProjectPro

JUNE 6, 2025

“Data Lake vs Data Warehouse = Load First, Think Later vs Think First, Load Later” The terms data lake and data warehouse are frequently stumbled upon when it comes to storing large volumes of data. Data Warehouse Architecture What is a Data lake?

Data Lake

Data Lake Data Warehouse Cloud Hadoop

Tackling Real Time Streaming Data With SQL Using RisingWave

Data Engineering Podcast

FEBRUARY 4, 2024

Summary Stream processing systems have long been built with a code-first design, adding SQL as a layer on top of the existing framework. In this episode Yingjun Wu explains how it is architected to power analytical workflows on continuous data flows, and the challenges of making it responsive and scalable.

SQL

SQL Data Lake High Quality Data Kafka

Snowflake vs. BigQuery- Head-to-Head Comparison of Cloud Data Warehouses

ProjectPro

JUNE 6, 2025

The success or failure of a data warehouse project depends on the time taken to identify the right technology. You are likely to be aware of the two pioneers in data warehouse technologies, Snowflake and Google BigQuery , if you are a big data developer or simply a business owner who takes big data seriously.

Data Warehouse

Data Warehouse Cloud Google Cloud Cloud Storage

Seamless SQL And Python Transformations For Data Engineers And Analysts With SQLMesh

Data Engineering Podcast

JUNE 25, 2023

Summary Data transformation is a key activity for all of the organizational roles that interact with data. Because of its importance and outsized impact on what is possible for downstream data consumers it is critical that everyone is able to collaborate seamlessly. Can you describe what SQLMesh is and the story behind it?

Data Engineer

Data Engineer Data Engineering Python Engineering

Data News — Week 24.11

Christophe Blefari

MARCH 15, 2024

With yato you give a folder with SQL queries and it guesses the DAG and runs the queries in the right order. Saying mainly that " Sora is a tool to extend creativity " Last point Mira has been mocked and criticised online because as a CTO she wasn't able to say on which public / licensed data Sora has been trained on.

Metadata

Metadata Software Engineer Software Engineering Data Warehouse

How to get started with dbt

Christophe Blefari

MARCH 1, 2023

dbt Core is an open-source framework that helps you organise data warehouse SQL transformation. dbt was born out of the analysis that more and more companies were switching from on-premise Hadoop data infrastructure to cloud data warehouses. This switch has been lead by modern data stack vision.

Data Warehouse

Data Warehouse SQL Metadata Raw Data

Introducing Agent Bricks: Auto-Optimized Agents Using Your Data

databricks

JUNE 11, 2025

Events Data + AI Summit Data + AI World Tour Data Intelligence Days Event Calendar Blog and Podcasts Databricks Blog Explore news, product announcements, and more Databricks Mosaic Research Blog Discover the latest in our Gen AI research Data Brew Podcast Let’s talk data! REGISTER Ready to get started?

Entertainment

Entertainment Manufacturing Consulting Retail

Data Warehouse Schemas: Meet the Big 3 Everyone’s Using

Monte Carlo

FEBRUARY 11, 2025

Think of your data warehouse like a well-organized library. Thats where data warehouse schemas come in. A data warehouse schema is a blueprint for how your data is structured and linkedusually with fact tables (for measurable data) and dimension tables (for descriptive attributes). Total chaos.

Data Warehouse

Data Warehouse Electronics Retail Data

Using Trino And Iceberg As The Foundation Of Your Data Lakehouse

Data Engineering Podcast

FEBRUARY 18, 2024

Summary A data lakehouse is intended to combine the benefits of data lakes (cost effective, scalable storage and compute) and data warehouses (user friendly SQL interface). Data lakes are notoriously complex. Join in with the event for the global data community, Data Council Austin.

Data Lake

Data Lake High Quality Data Data Warehouse Google Cloud

Mirroring SQL Server Database to Microsoft Fabric

Striim

NOVEMBER 19, 2024

SQL2Fabric Mirroring is a new fully managed service offered by Striim to mirror on premise SQL Databases. It’s a collaborative service between Striim and Microsoft based on Fabric Open Mirroring that enables real-time data replication from on-premise SQL Server databases to Azure Fabric OneLake. Striim automates the rest.

SQL

SQL Database Data Warehouse Data Pipeline

Introducing the New SQL Editor

databricks

OCTOBER 14, 2024

Over the last few years, we've seen tremendous growth and adoption of Databricks SQL , our intelligent data warehouse purpose-built on the Data.

SQL

SQL Data Warehouse Data

7 Best Data Warehousing Tools for Efficient Data Storage Needs

ProjectPro

JUNE 6, 2025

Data is often referred to as the new oil, and just like oil requires refining to become useful fuel, data also needs a similar transformation to unlock its true value. This transformation is where data warehousing tools come into play, acting as the refining process for your data. Why Choose a Data Warehousing Tool?

Data Storage

Data Storage PostgreSQL Data Warehouse AWS

Data Engineering Roadmap, Learning Path,& Career Track 2025

ProjectPro

JUNE 6, 2025

Data Engineering is gradually becoming a popular career option for young enthusiasts. That's why we've created a comprehensive data engineering roadmap for 2023 to guide you through the essential skills and tools needed to become a successful data engineer. Let's dive into ProjectPro's Data Engineer Roadmap!

Data Engineer

Data Engineer Data Engineering Engineering Amazon Web Services

Databricks Delta Lake: A Scalable Data Lake Solution

ProjectPro

JUNE 6, 2025

Want to process peta-byte scale data with real-time streaming ingestions rates, build 10 times faster data pipelines with 99.999% reliability, witness 20 x improvement in query performance compared to traditional data lakes, enter the world of Databricks Delta Lake now. It's a sobering thought - all that data, driving no value.

Data Lake

Data Lake Data Warehouse Metadata Unstructured Data

Announcing New Innovations for Data Warehouse, Data Lake, and Data Lakehouse in the Data Cloud

Snowflake

NOVEMBER 2, 2023

Over the years, the technology landscape for data management has given rise to various architecture patterns, each thoughtfully designed to cater to specific use cases and requirements. These patterns include both centralized storage patterns like data warehouse , data lake and data lakehouse , and distributed patterns such as data mesh.

Data Lake

Data Lake Data Warehouse Cloud Unstructured Data

10 AWS Redshift Project Ideas to Build Data Pipelines

ProjectPro

JUNE 6, 2025

Today, businesses use traditional data warehouses to centralize massive amounts of raw data from business operations. Amazon Redshift is helping over 10000 customers with its unique features and data analytics properties. Table of Contents AWS Redshift Data Warehouse Architecture 1. Client Applications 2.

Data Pipeline

Data Pipeline AWS Project Building

Databricks SQL Year in Review (Part II): SQL Programming Features

databricks

JANUARY 31, 2024

Welcome to the blog series covering product advancements in 2023 for Databricks SQL, the serverless data warehouse from Databricks. This is part 2.

SQL

SQL Programming Data Warehouse Data

Stop Overcomplicating Data Quality

Towards Data Science

DECEMBER 10, 2024

Three Zero-Cost Solutions That Take Hours, NotMonths A data quality certified pipeline. Source: unsplash.com In my career, data quality initiatives have usually meant big changes. Whats more, fixing the data quality issues this way often leads to new problems. Create a custom dashboard for your specific data qualityproblem.

PostgreSQL

PostgreSQL Data SQL Python

Your Step-by-Step Guide to Become a Data Engineer in 2025

ProjectPro

JUNE 6, 2025

If you are planning to make a career transition into data engineering and want to know how to become a data engineer, this is the perfect place to begin your journey. Beginners will especially find it helpful if they want to know how to become a data engineer from scratch. Table of Contents What is a Data Engineer?

Data Engineer

Data Engineer Data Engineering Engineering Amazon Web Services

50+ Azure Data Factory Interview Questions and Answers [2025]

ProjectPro

JUNE 6, 2025

Discover 50+ Azure Data Factory interview questions and answers for all experience levels. A report by ResearchAndMarkets projects the global data integration market size to grow from USD 12.24 A report by ResearchAndMarkets projects the global data integration market size to grow from USD 12.24 billion in 2020 to USD 24.84

Data Lake

Data Lake Metadata SQL Datasets

How to Transition from ETL Developer to Data Engineer?

ProjectPro

JUNE 6, 2025

In the thought process of making a career transition from ETL developer to data engineer job roles? Read this blog to know how various data-specific roles, such as data engineer, data scientist, etc., differ from ETL developer and the additional skills you need to transition from ETL developer to data engineer job roles.

Data Engineer

Data Engineer Data Engineering Engineering ETL Tools

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Cloudera

DECEMBER 3, 2024

Many enterprises have heterogeneous data platforms and technology stacks across different business units or data domains. For decades, they have been struggling with scale, speed, and correctness required to derive timely, meaningful, and actionable insights from vast and diverse big data environments.

Metadata

Metadata SQL Data Warehouse Database

100 Data Modelling Interview Questions To Prepare For In 2025

ProjectPro

JUNE 6, 2025

Data modeling is a crucial skill for every big data professional, but it can be challenging to master. So, if you are preparing for a data modelling interview, you have landed on the right page. We have compiled the top 50 data modelling interview questions and answers from beginner to advanced levels. billion by 2028.

Data Warehouse

Data Warehouse NoSQL PostgreSQL Relational Database

Data Engineering Weekly #198

Data Engineering Weekly

NOVEMBER 24, 2024

Editor’s Note: Launching Data & Gen-AI courses in 2025 I can’t believe DEW will reach almost its 200th edition soon. What I started as a fun hobby has become one of the top-rated newsletters in the data engineering industry. The blog narrates a few examples of Pipe Syntax in comparison with the SQL queries.

Data Engineer

Data Engineer Data Engineering Engineering Insurance

Snowflake Architecture and It's Fundamental Concepts

ProjectPro

JUNE 6, 2025

As the demand for big data grows, an increasing number of businesses are turning to cloud data warehouses. The cloud is the only platform to handle today's colossal data volumes because of its flexibility and scalability. Launched in 2014, Snowflake is one of the most popular cloud data solutions on the market.

Architecture

Architecture IT Data Warehouse Amazon Web Services

Cloud Data Warehouse Migrations: Success Stories from WHOOP and Nexon

Build Better Data Pipelines with SQL and Python in Snowflake

Trending Sources

Simplify Data Warehouse Migrations: Free SnowConvert with Redshift Support

Simplify Data Warehouse Migrations: Free SnowConvert

Implementing a Dimensional Data Warehouse with Databricks SQL: Part 2

Implementing a Dimensional Data Warehouse with Databricks SQL, Part 3

Azure SQL Data Warehouse and Its Architecture- An Overview

How Meta discovers data flows via lineage at scale

How to Normalize Relational Databases With SQL Code?

Data News — Week 25.02

The Open Lakehouse Stack: DuckDB and the Rise of Table Formats

Mosaic AI Announcements at Data + AI Summit 2025

Redshift vs. BigQuery: Choosing the Right Data Warehouse

50+ Data Warehouse Interview Questions and Answers for 2025

Data Warehouse Engineer - A Complete Career Guide

SQL for Data Engineering: Success Blueprint for Data Engineers

Top 5 SQL Interview Questions With Implementation

Introducing Lakebridge: Free, Open Data Migration to Databricks SQL

15 Data Warehouse Project Ideas for Practice with Source Code

Data Lake vs Data Warehouse - Working Together in the Cloud

Tackling Real Time Streaming Data With SQL Using RisingWave

Snowflake vs. BigQuery- Head-to-Head Comparison of Cloud Data Warehouses

Seamless SQL And Python Transformations For Data Engineers And Analysts With SQLMesh

Data News — Week 24.11

How to get started with dbt

Introducing Agent Bricks: Auto-Optimized Agents Using Your Data

Data Warehouse Schemas: Meet the Big 3 Everyone’s Using

Using Trino And Iceberg As The Foundation Of Your Data Lakehouse

Mirroring SQL Server Database to Microsoft Fabric

Introducing the New SQL Editor

7 Best Data Warehousing Tools for Efficient Data Storage Needs

Data Engineering Roadmap, Learning Path,& Career Track 2025

Databricks Delta Lake: A Scalable Data Lake Solution

Announcing New Innovations for Data Warehouse, Data Lake, and Data Lakehouse in the Data Cloud

10 AWS Redshift Project Ideas to Build Data Pipelines

Databricks SQL Year in Review (Part II): SQL Programming Features

Stop Overcomplicating Data Quality

Your Step-by-Step Guide to Become a Data Engineer in 2025

50+ Azure Data Factory Interview Questions and Answers [2025]

How to Transition from ETL Developer to Data Engineer?

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

100 Data Modelling Interview Questions To Prepare For In 2025

Data Engineering Weekly #198

Snowflake Architecture and It's Fundamental Concepts

Stay Connected