article thumbnail

Level Up Your Data Platform With Active Metadata

Data Engineering Podcast

Summary Metadata is the lifeblood of your data platform, providing information about what is happening in your systems. In order to level up their value a new trend of active metadata is being implemented, allowing use cases like keeping BI reports up to date, auto-scaling your warehouses, and automated data governance.

Metadata 130
article thumbnail

Is Apache Iceberg the New Hadoop? Navigating the Complexities of Modern Data Lakehouses

Data Engineering Weekly

This ecosystem includes: Catalogs: Services that manage metadata about Iceberg tables (e.g., Compute Engines: Tools that query and process data stored in Iceberg tables (e.g., Maintenance Processes: Operations that optimize Iceberg tables, such as compacting small files and managing metadata. Trino, Spark, Snowflake, DuckDB).

Hadoop 57
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery

Data Engineering Podcast

report having current investments in automation, 85% of data teams plan on investing in automation in the next 12 months. The Ascend Data Automation Cloud provides a unified platform for data ingestion, transformation, orchestration, and observability. In fact, while only 3.5% That’s where our friends at Ascend.io

Metadata 100
article thumbnail

Manufacturing Data Ingestion into Snowflake

Snowflake

Working with our partners, this architecture includes MQTT-based data ingestion into Snowflake. This provides a highly scalable, fast, flexible (OT data published by exception from edge to cloud), and secure communication to Snowflake. Stay tuned for more insights on Industry 4.0 and supply chain in the coming months.

article thumbnail

Data Engineering Zoomcamp – Data Ingestion (Week 2)

Hepta Analytics

DE Zoomcamp 2.2.1 – Introduction to Workflow Orchestration Following last weeks blog , we move to data ingestion. We already had a script that downloaded a csv file, processed the data and pushed the data to postgres database. This week, we got to think about our data ingestion design.

article thumbnail

Data Engineering Weekly #213

Data Engineering Weekly

link] LinkedIn: Journey of next-generation control plane for data systems LinkedIn writes about the evolution of Nuage, its internal control plane framework for managing data infrastructure resources. link] Grab: Improving Hugo's stability and addressing oncall challenges through automation.

article thumbnail

Improved Ascend for Databricks, New Lineage Visualization, and Better Incremental Data Ingestion

Ascend.io

Instead, it is a Sankey diagram driven by the same dynamic metadata that runs the Ascend control plane. Other data ingestion enhancements include: Incremental read for MS SQL can now be based on a datetime column Native data types support in our Salesforce Read Connector and support for the new Hubspot API token.