article thumbnail

Managing Uber’s Data Workflows at Scale

Uber Engineering

At Uber’s scale, thousands of microservices serve millions of rides and deliveries a day, generating more than a hundred petabytes of raw data. Internally, engineering and data teams across the company leverage this data to improve the Uber experience.

article thumbnail

11 Data Engineering Best Practices To Streamline Your Data Workflows

ProjectPro

These practices are crucial for building robust and scalable data pipelines, maintaining data quality, and enabling data-driven decision-making. Let us dive into some of the crucial best practices for data engineering that data engineers must implement in their data workflows and projects.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

New Fivetran connector streamlines data workflows for real-time insights

ThoughtSpot

The pathway from ETL to actionable analytics can often feel disconnected and cumbersome, leading to frustration for data teams and long wait times for business users. And even when we manage to streamline the data workflow, those insights aren’t always accessible to users unfamiliar with antiquated business intelligence tools.

article thumbnail

DBT vs. Airflow-Which is The Best Tool for Your Data Workflows?

ProjectPro

Airflow and DBT both have the overall purpose of helping teams in providing reliable data to the users with whom they interact by using a standard interface.

article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. With the 3.0

article thumbnail

Introducing WorkflowGuard: The Workflow Governance and Observability System That Oversees over 120,000 Data Workflows

Uber Engineering

Our Data Workflow Platform team introduces WorkflowGuard: a new service to govern executions, prioritize resources, and manage life cycle for repetitive data jobs. Check out how it improved workflow reliability and cost efficiency while bringing more observability to users.

article thumbnail

Scale Unstructured Text Analytics with Batch LLM Inference

Snowflake

And to create significant technology and team efficiencies, organizations need to consider opportunities to integrate LLM pipelines with existing structured data workflows. This unification can also empower data engineers, who already manage structured pipelines, to easily onboard and maintain unstructured data workflows.