Sun.Jun 02, 2024

article thumbnail

Practical First Steps In Data Governance For Long Term Success

Data Engineering Podcast

Summary Modern businesses aspire to be data driven, and technologists enjoy working through the challenge of building data systems to support that goal. Data governance is the binding force between these two parts of the organization. Nicola Askham found her way into data governance by accident, and stayed because of the benefit that she was able to provide by serving as a bridge between the technology and business.

article thumbnail

Introducing the Open Variant Data Type in Delta Lake and Apache Spark

databricks

We are excited to announce a new data type called variant for semi-structured data. Variant provides an order of magnitude performance improvements compared.

article thumbnail

Data Engineering Weekly #174

Data Engineering Weekly

Data Engineering Weekly is sponsored by Astronomer—Enterprise-Grade Apache Airflow. Deliver data on time with the speed and scale your application demands. Learn More → AI Verify Foundation: Model AI Governance Framework for Generative AI Several countries are working on building governance rules for Gen AI. Data sovereignty will play a vital role as countries formulate regulations.

article thumbnail

Pride 2024: Pride is a verb, not just a noun by Caitlin Salt

Scott Logic

It’s June! It’s Pride month! Rainbows! Love is love! We’re your ally! Buy stuff with rainbows on! Let’s come to your Pride parade, but make sure you tone it down a bit! More rainbows! Buy our products! Look, we’ve put a rainbow on it! We love everyone ! We love absolutely everyone, in a very non-specific way! We definitely love sparkly unicorn rainbows!

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Hosting an internal Engineering Conference

Zalando Engineering

Introduction Our Data Science colleagues had been hosting an internal Data Science Days event for a few years. For our 2,000+ Engineers, we had been missing a similar community event. For several years we wanted to organize one, but got distracted by other priorities and external factors. Finally, in 2022 we decided to commit to hosting an internal Engineering Conference every year and included this commitment in our Engineering Strategy.

article thumbnail

What’s a Data Infrastructure Engineer? Skills, Role, Future & Salary

Monte Carlo

Every day, an uncountable amount of data flows through millions of businesses. Data Infrastructure Engineers are the professionals who ensure that this data flows smoothly and reliably. But what does a data infrastructure engineer do exactly? Table of Contents What is a Data Infrastructure Engineer? Difference Between a Data Infrastructure Engineer and a Data Science Engineer Skill Sets for Data Infrastructure Engineers Technical Expertise Day-to-Day Operations Collaboration with Cross-Functiona