Sun.Jun 02, 2024

article thumbnail

Practical First Steps In Data Governance For Long Term Success

Data Engineering Podcast

Summary Modern businesses aspire to be data driven, and technologists enjoy working through the challenge of building data systems to support that goal. Data governance is the binding force between these two parts of the organization. Nicola Askham found her way into data governance by accident, and stayed because of the benefit that she was able to provide by serving as a bridge between the technology and business.

article thumbnail

Introducing the Open Variant Data Type in Delta Lake and Apache Spark

databricks

We are excited to announce a new data type called variant for semi-structured data. Variant provides an order of magnitude performance improvements compared.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Weekly #174

Data Engineering Weekly

Data Engineering Weekly is sponsored by Astronomer—Enterprise-Grade Apache Airflow. Deliver data on time with the speed and scale your application demands. Learn More → AI Verify Foundation: Model AI Governance Framework for Generative AI Several countries are working on building governance rules for Gen AI. Data sovereignty will play a vital role as countries formulate regulations.

article thumbnail

Pride 2024: Pride is a verb, not just a noun by Caitlin Salt

Scott Logic

It’s June! It’s Pride month! Rainbows! Love is love! We’re your ally! Buy stuff with rainbows on! Let’s come to your Pride parade, but make sure you tone it down a bit! More rainbows! Buy our products! Look, we’ve put a rainbow on it! We love everyone ! We love absolutely everyone, in a very non-specific way! We definitely love sparkly unicorn rainbows!

article thumbnail

A Guide to Debugging Apache Airflow® DAGs

In Airflow, DAGs (your data pipelines) support nearly every use case. As these workflows grow in complexity and scale, efficiently identifying and resolving issues becomes a critical skill for every data engineer. This is a comprehensive guide with best practices and examples to debugging Airflow DAGs. You’ll learn how to: Create a standardized process for debugging to quickly diagnose errors in your DAGs Identify common issues with DAGs, tasks, and connections Distinguish between Airflow-relate

article thumbnail

Hosting an internal Engineering Conference

Zalando Engineering

Introduction Our Data Science colleagues had been hosting an internal Data Science Days event for a few years. For our 2,000+ Engineers, we had been missing a similar community event. For several years we wanted to organize one, but got distracted by other priorities and external factors. Finally, in 2022 we decided to commit to hosting an internal Engineering Conference every year and included this commitment in our Engineering Strategy.

article thumbnail

What’s a Data Infrastructure Engineer? Skills, Role, Future & Salary

Monte Carlo

Every day, an uncountable amount of data flows through millions of businesses. Data Infrastructure Engineers are the professionals who ensure that this data flows smoothly and reliably. But what does a data infrastructure engineer do exactly? Table of Contents What is a Data Infrastructure Engineer? Difference Between a Data Infrastructure Engineer and a Data Science Engineer Skill Sets for Data Infrastructure Engineers Technical Expertise Day-to-Day Operations Collaboration with Cross-Functiona