Data Management, Data Warehouse and Demo

Introducing Agent Bricks: Auto-Optimized Agents Using Your Data

databricks

JUNE 11, 2025

Events Data + AI Summit Data + AI World Tour Data Intelligence Days Event Calendar Blog and Podcasts Databricks Blog Explore news, product announcements, and more Databricks Mosaic Research Blog Discover the latest in our Gen AI research Data Brew Podcast Let’s talk data! REGISTER Ready to get started?

Entertainment

Entertainment Manufacturing Consulting Retail

Scale Your Analytics On The Clickhouse Data Warehouse

Data Engineering Podcast

JULY 8, 2019

Summary The market for data warehouse platforms is large and varied, with options for every use case. It was interesting to learn about some of the custom data types and performance optimizations that are included. What are some of the advanced capabilities, such as SQL extensions, supported data types, etc.

Data Warehouse

Data Warehouse MySQL Data Lake Hadoop

What Is a Lakebase?

databricks

JUNE 11, 2025

Events Data + AI Summit Data + AI World Tour Data Intelligence Days Event Calendar Blog and Podcasts Databricks Blog Explore news, product announcements, and more Databricks Mosaic Research Blog Discover the latest in our Gen AI research Data Brew Podcast Let’s talk data! REGISTER Ready to get started?

Entertainment

Entertainment Data Lake Manufacturing Consulting

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Cloudera

DECEMBER 3, 2024

Solution Overview Data sharing is the capability to share data managed in Cloudera , specifically Iceberg tables, with external users (clients) who are outside of the Cloudera environment. In this case I’m using a role named – “UnitedAirlinesRole” that I can use to share data.

Metadata

Metadata SQL Data Warehouse Database

Making The Total Cost Of Ownership For External Data Manageable With Crux

Data Engineering Podcast

JULY 17, 2022

In this episode Crux CTO Mark Etherington discusses the different costs involved in managing external data, how to think about the total return on investment for your data, and how the Crux platform is architected to reduce the toil involved in managing third party data. Tired of deploying bad data?

Data Management

Data Management Management Metadata MongoDB

Leading The Charge For The ELT Data Integration Pattern For Cloud Data Warehouses At Matillion

Data Engineering Podcast

MAY 1, 2022

He describes how the platform is architected, the challenges related to selling cloud technologies into enterprise organizations, and how you can adopt Matillion for your own workflows to reduce the maintenance burden of data integration workflows. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.

Data Warehouse

Data Warehouse Data Integration Cloud Google Cloud

An Agile Approach To Master Data Management with Mark Marinelli - Episode 46

Data Engineering Podcast

SEPTEMBER 3, 2018

Summary With the proliferation of data sources to give a more comprehensive view of the information critical to your business it is even more important to have a canonical view of the entities that you care about. Request a demo at dataengineeringpodcast.com/metis-machine to learn more about how Metis Machine is operationalizing data science.

Data Management

Data Management Management Relational Database Business Intelligence

AWS at Databricks Data + AI Summit 2025

databricks

JUNE 4, 2025

Events Data + AI Summit Data + AI World Tour Data Intelligence Days Event Calendar Blog and Podcasts Databricks Blog Explore news, product announcements, and more Databricks Mosaic Research Blog Discover the latest in our Gen AI research Data Brew Podcast Let’s talk data! REGISTER Ready to get started?

AWS

AWS Entertainment Manufacturing Media

Mosaic AI Announcements at Data + AI Summit 2025

databricks

JUNE 11, 2025

Events Data + AI Summit Data + AI World Tour Data Intelligence Days Event Calendar Blog and Podcasts Databricks Blog Explore news, product announcements, and more Databricks Mosaic Research Blog Discover the latest in our Gen AI research Data Brew Podcast Let’s talk data! REGISTER Ready to get started?

Entertainment

Entertainment Manufacturing Consulting Retail

Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams

Data Engineering Podcast

DECEMBER 28, 2022

In this episode he shares his thoughts on the strategic and tactical elements of moving your work as a data professional from being task-oriented to being product-oriented and the long term improvements in your productivity that it provides. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.

Data Lake

Data Lake Data Warehouse Data Pipeline MongoDB

Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems

Data Engineering Podcast

DECEMBER 25, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you're ready to build your next pipeline, or want to test out the projects you hear about on the show, you'll need somewhere to deploy it, so check out our friends at Linode. or any other destination you choose.

Machine Learning

Machine Learning Systems Data Lake Data Warehouse

Introducing Databricks One

databricks

JUNE 12, 2025

Get a Demo DATA + AI SUMMIT Data + AI Summit Happening Now Watch the free livestream of the keynotes! Join now Ready to get started? 160 Spear Street, 15th Floor San Francisco, CA 94105 1-866-330-0121 See Careers at Databricks © Databricks 2025.

BI

BI Entertainment Manufacturing Consulting

Data Lake vs. Data Warehouse vs. Data Lakehouse

Sync Computing

NOVEMBER 7, 2024

Data volume and velocity, governance, structure, and regulatory requirements have all evolved and continue to. Despite these limitations, data warehouses, introduced in the late 1980s based on ideas developed even earlier, remain in widespread use today for certain business intelligence and data analysis applications.

Data Lake

Data Lake Data Warehouse Business Intelligence Unstructured Data

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

Data Engineering Podcast

NOVEMBER 6, 2022

In this episode she shares the story behind the project, the details of how it is implemented, and how you can use it for your own data projects. Datafold integrates with all major data warehouses as well as frameworks such as Airflow & dbt and seamlessly plugs into CI workflows. Who is the target audience for Zingg?

MongoDB

MongoDB MySQL Scala Data Lake

A Primer On Enterprise Data Curation with Todd Walter - Episode 49

Data Engineering Podcast

SEPTEMBER 23, 2018

This includes modeling the lifecycle of your information as a pipeline from the raw, messy, loosely structured records in your data lake, through a series of transformations and ultimately to your data warehouse. How do you define data curation?

Data Lake

Data Lake Data Warehouse Data Architecture Architecture

Combining The Simplicity Of Spreadsheets With The Power Of Modern Data Infrastructure At Canvas

Data Engineering Podcast

JUNE 19, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Metadata

Metadata Unstructured Data MongoDB MySQL

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Data Engineering Podcast

NOVEMBER 20, 2022

In this episode CEO and founder Salma Bakouk shares her views on the causes and impacts of "data entropy" and how you can tame it before it leads to failures. Datafold integrates with all major data warehouses as well as frameworks such as Airflow & dbt and seamlessly plugs into CI workflows.

Data Lake

Data Lake MongoDB Data Ingestion MySQL

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Data Engineering Podcast

OCTOBER 30, 2022

In this episode he shares his experiences working with organizations to adopt analytics engineering patterns and the ways that Optimus and dbt were combined to let data analysts deliver insights without the roadblocks of complex pipeline management. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.

Engineering

Engineering MongoDB MySQL Scala

Modernizing Data Platforms for AI/ML and Generative AI: The Case for Migrating from Hadoop to Teradata Vantage

Teradata

APRIL 22, 2025

With nearly 20 years of experience in business analytics and intelligence, he's driven impactful insights for global sales, marketing, and product management teams. His expertise spans from operational platforms to emerging data management paradigms.

Hadoop

Hadoop Database-centric Media Big Data

Low Friction Data Governance With Immuta

Data Engineering Podcast

DECEMBER 21, 2020

If you are starting down the path of implementing a data governance strategy then this episode will provide a great overview of what is involved. If you hand a book to a new data engineer, what wisdom would you add to it? What is data governance? If you hand a book to a new data engineer, what wisdom would you add to it?

Data Governance

Data Governance Government Data Lake Banking

Bring Geospatial Analytics Across Disparate Datasets Into Your Toolkit With The Unfolded Platform

Data Engineering Podcast

JUNE 26, 2022

In this episode Isaac Brodsky explains how the Unfolded platform is architected, their experience joining the team at Foursquare, and how you can start using it for analyzing your spatial data today. Datafold integrates with all major data warehouses as well as frameworks such as Airflow & dbt and seamlessly plugs into CI workflows.

Datasets

Datasets Unstructured Data Metadata MongoDB

Combining Transactional And Analytical Workloads On MemSQL with Nikita Shamgunov - Episode 51

Data Engineering Podcast

OCTOBER 9, 2018

Summary One of the most complex aspects of managing data for analytical workloads is moving it from a transactional database into the data warehouse. This was a deep dive on how to build a successful company around a powerful platform, and how that platform simplifies operations for enterprise grade data management.

PostgreSQL

PostgreSQL BI Data Warehouse Machine Learning

Modern Data Management Essentials: Exploring Data Fabric

Precisely

JULY 18, 2024

Key Takeaways Data Fabric is a modern data architecture that facilitates seamless data access, sharing, and management across an organization. Data management recommendations and data products emerge dynamically from the fabric through automation, activation, and AI/ML analysis of metadata.

Data Management

Data Management Management Metadata Database-centric

How to Choose the Right Data Management Solution

The Modern Data Company

MAY 10, 2023

In our previous post, The Pros and Cons of Leading Data Management and Storage Solutions , we untangled the differences among data lakes, data warehouses, data lakehouses, data hubs, and data operating systems. What factors are most important when building a data management ecosystem?

Data Management

Data Management Management Data Lake Data Warehouse

Building A New Foundation For CouchDB

Data Engineering Podcast

MARCH 16, 2020

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Building

Building Data Warehouse NoSQL Data Lake

Digital Transformation is a Data Journey From Edge to Insight

Cloudera

JANUARY 20, 2021

Most of what is written though has to do with the enabling technology platforms (cloud or edge or point solutions like data warehouses) or use cases that are driving these benefits (predictive analytics applied to preventive maintenance, financial institution’s fraud detection, or predictive health monitoring as examples) not the underlying data.

Manufacturing

Manufacturing Data Warehouse Kafka Retail

How to Choose the Right Data Management Solution

The Modern Data Company

MAY 10, 2023

In our previous post, The Pros and Cons of Leading Data Management and Storage Solutions , we untangled the differences among data lakes, data warehouses, data lakehouses, data hubs, and data operating systems. What factors are most important when building a data management ecosystem?

Data Management

Data Management Management Data Lake Data Warehouse

How to Choose the Right Data Management Solution

The Modern Data Company

MAY 10, 2023

In our previous post, The Pros and Cons of Leading Data Management and Storage Solutions , we untangled the differences among data lakes, data warehouses, data lakehouses, data hubs, and data operating systems. What factors are most important when building a data management ecosystem?

Data Management

Data Management Management Data Lake Data Warehouse

Accelerate Development Of Enterprise Analytics With The Coalesce Visual Workflow Builder

Data Engineering Podcast

APRIL 3, 2022

Summary The flexibility of software oriented data workflows is useful for fulfilling complex requirements, but for simple and repetitious use cases it adds significant complexity. Coalesce is a platform designed to reduce repetitive work for common workflows by adopting a visual pipeline builder to support your data warehouse transformations.

Data Warehouse

Data Warehouse Data Workflow Data Architecture Software Engineer

Revisit The Fundamental Principles Of Working With Data To Avoid Getting Caught In The Hype Cycle

Data Engineering Podcast

DECEMBER 18, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you're ready to build your next pipeline, or want to test out the projects you hear about on the show, you'll need somewhere to deploy it, so check out our friends at Linode. or any other destination you choose.

Data Lake

Data Lake Data Warehouse Data Pipeline MongoDB

Data Warehouse Migration Best Practices

Monte Carlo

FEBRUARY 6, 2023

So, you’re planning a cloud data warehouse migration. But be warned, a warehouse migration isn’t for the faint of heart. As you probably already know if you’re reading this, a data warehouse migration is the process of moving data from one warehouse to another. A worthy quest to be sure.

Data Warehouse

Data Warehouse AWS Data Validation Data

Easily Build Advanced Similarity Search With The Pinecone Vector Database

Data Engineering Podcast

MAY 25, 2021

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. How is Pinecone implemented?

Database

Database Building Data Warehouse Machine Learning

The Pros and Cons of Leading Data Management and Storage Solutions

The Modern Data Company

MAY 8, 2023

Data lakes, data warehouses, data hubs, data lakehouses, and data operating systems are data management and storage solutions designed to meet different needs in data analytics, integration, and processing. However, data warehouses can experience limitations and scalability challenges.

Data Management

Data Management Management Data Lake Data Governance

The Pros and Cons of Leading Data Management and Storage Solutions

The Modern Data Company

MAY 8, 2023

Data lakes, data warehouses, data hubs, data lakehouses, and data operating systems are data management and storage solutions designed to meet different needs in data analytics, integration, and processing. However, data warehouses can experience limitations and scalability challenges.

Data Management

Data Management Management Data Lake Data Governance

Adopting Real-Time Data At Organizations Of Every Size

Data Engineering Podcast

DECEMBER 4, 2022

In this episode Arjun Narayan explains how the technical barriers to adopting real-time data in your analytics and applications have become surmountable by organizations of all sizes. Datafold integrates with all major data warehouses as well as frameworks such as Airflow & dbt and seamlessly plugs into CI workflows.

Data Lake

Data Lake MongoDB MySQL Data Warehouse

The Pros and Cons of Leading Data Management and Storage Solutions

The Modern Data Company

MAY 8, 2023

Data lakes, data warehouses, data hubs, data lakehouses, and data operating systems are data management and storage solutions designed to meet different needs in data analytics, integration, and processing. However, data warehouses can experience limitations and scalability challenges.

Data Management

Data Management Management Data Lake Data Governance

Unlocking The Power of Data Lineage In Your Platform with OpenLineage

Data Engineering Podcast

MAY 18, 2021

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

Metadata

Metadata Kafka Data Warehouse Hadoop

Run Your Applications Worldwide Without Worrying About The Database With Planetscale

Data Engineering Podcast

DECEMBER 11, 2022

Summary One of the most critical aspects of software projects is managing its data. Managing the operational concerns for your database can be complex and expensive, especially if you need to scale to large volumes of data, high traffic, or geographically distributed usage. or any other destination you choose.

Database

Database MySQL Data Lake MongoDB

Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase

Data Engineering Podcast

NOVEMBER 27, 2022

He also discusses the improvements that have been incorporated into FeatureBase to simplify integration with the rest of your data stack, and the SQL interface that was added to make working with the product easier. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. or any other destination you choose.

Data Lake

Data Lake Data Warehouse MongoDB MySQL

Making Data Pipelines Self-Serve For Everyone With Shipyard

Data Engineering Podcast

JUNE 1, 2021

This is an interesting conversation about how to make data more accessible and more useful by improving the user experience of the tools that we create. RudderStack’s smart customer data pipeline is warehouse-first. Mention that you’re a Data Engineering Podcast listener, and they’ll send you a free t-shirt.

Data Pipeline

Data Pipeline Data Warehouse Data Workflow Data

Operational Analytics At Speed With Minimal Busy Work Using Incorta

Data Engineering Podcast

APRIL 24, 2022

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. Missing data?

Data Warehouse

Data Warehouse Data Lake BI Data Pipeline

Building Real-Time Data Platforms For Large Volumes Of Information With Aerospike

Data Engineering Podcast

OCTOBER 2, 2021

If you need to deal with massive data, at high velocities, in milliseconds, then Aerospike is definitely worth learning about. Datafold integrates with all major data warehouses as well as frameworks such as Airflow & dbt and seamlessly plugs into CI workflows. Can you describe what Aerospike is and the story behind it?

Building

Building BI Data Architecture Data Warehouse

Connecting To The Next Frontier Of Computing With Quantum Networks

Data Engineering Podcast

APRIL 17, 2022

In this episode Prineha Narang, co-founder and CTO of Aliro, explains how these systems work, the capabilities that they can offer, and how you can start preparing for a post-quantum future for your data systems. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.

Data Warehouse

Data Warehouse SQL Data Engineer Data Engineering

Shining A Light on Shadow IT In Data And Analytics

Data Engineering Podcast

FEBRUARY 24, 2020

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

IT

IT Data Lake Data Pipeline Data Warehouse

Introducing Agent Bricks: Auto-Optimized Agents Using Your Data

Scale Your Analytics On The Clickhouse Data Warehouse

Webinars

Trending Sources

What Is a Lakebase?

Webinars

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Making The Total Cost Of Ownership For External Data Manageable With Crux

Leading The Charge For The ELT Data Integration Pattern For Cloud Data Warehouses At Matillion

An Agile Approach To Master Data Management with Mark Marinelli - Episode 46

AWS at Databricks Data + AI Summit 2025

Mosaic AI Announcements at Data + AI Summit 2025

Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams

Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems

Introducing Databricks One

Data Lake vs. Data Warehouse vs. Data Lakehouse

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

A Primer On Enterprise Data Curation with Todd Walter - Episode 49

Combining The Simplicity Of Spreadsheets With The Power Of Modern Data Infrastructure At Canvas

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Modernizing Data Platforms for AI/ML and Generative AI: The Case for Migrating from Hadoop to Teradata Vantage

Low Friction Data Governance With Immuta

Bring Geospatial Analytics Across Disparate Datasets Into Your Toolkit With The Unfolded Platform

Combining Transactional And Analytical Workloads On MemSQL with Nikita Shamgunov - Episode 51

Modern Data Management Essentials: Exploring Data Fabric

How to Choose the Right Data Management Solution

Building A New Foundation For CouchDB

Digital Transformation is a Data Journey From Edge to Insight

How to Choose the Right Data Management Solution

How to Choose the Right Data Management Solution

Accelerate Development Of Enterprise Analytics With The Coalesce Visual Workflow Builder

Revisit The Fundamental Principles Of Working With Data To Avoid Getting Caught In The Hype Cycle

Data Warehouse Migration Best Practices

Easily Build Advanced Similarity Search With The Pinecone Vector Database

The Pros and Cons of Leading Data Management and Storage Solutions

The Pros and Cons of Leading Data Management and Storage Solutions

Adopting Real-Time Data At Organizations Of Every Size

The Pros and Cons of Leading Data Management and Storage Solutions

Unlocking The Power of Data Lineage In Your Platform with OpenLineage

Run Your Applications Worldwide Without Worrying About The Database With Planetscale

Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase

Making Data Pipelines Self-Serve For Everyone With Shipyard

Operational Analytics At Speed With Minimal Busy Work Using Incorta

Building Real-Time Data Platforms For Large Volumes Of Information With Aerospike

Connecting To The Next Frontier Of Computing With Quantum Networks

Shining A Light on Shadow IT In Data And Analytics

Stay Connected