Sun.Oct 27, 2024

article thumbnail

Data Engineering Weekly #195

Data Engineering Weekly

Astasia Myers: The three components of the unstructured data stack LLMs and vector databases significantly improved the ability to process and understand unstructured data. I never thought of PDF as a self-contained document database, but that seems a reality that we can’t deny. The blog is an excellent summary of the existing unstructured data landscape.

article thumbnail

Announcing General Availability: Publish to Microsoft Power BI Service from Unity Catalog

databricks

We're excited to announce the General Availability of Publish to Microsoft Power BI Service from Unity Catalog, an integration that makes it easy.

BI 119
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

DataMesh: How Uber laid the foundations for the data lake cloud migration

Uber Engineering

Learn how Uber is streamlining the Cloud migration of its massive Data Lake by incorporating key Data Mesh principles.

article thumbnail

Sparkle: Standardizing Modular ETL at Uber

Uber Engineering

Discover how Uber’s in-house ETL framework helps standardize modular ETL development, improving developer productivity and ensuring reliable data pipelines.

article thumbnail

A Guide to Debugging Apache Airflow® DAGs

In Airflow, DAGs (your data pipelines) support nearly every use case. As these workflows grow in complexity and scale, efficiently identifying and resolving issues becomes a critical skill for every data engineer. This is a comprehensive guide with best practices and examples to debugging Airflow DAGs. You’ll learn how to: Create a standardized process for debugging to quickly diagnose errors in your DAGs Identify common issues with DAGs, tasks, and connections Distinguish between Airflow-relate

article thumbnail

Genie: Uber’s Gen AI On-Call Copilot

Uber Engineering

Explore how Uber is leveraging Genie, its Generative AI-powered On-Call CoPilot, to transform on-call operations and empower engineering teams.

article thumbnail

Preon: Presto Query Analysis for Intelligent and Efficient Analytics

Uber Engineering

Discover how to enable intelligent and efficient data analytics at Uber scale with Preon, a Presto Query Analysis service that unlocks insights for deduplicating queries, creating efficient table layouts, and more.