Sat.Mar 03, 2018 - Fri.Mar 09, 2018

article thumbnail

The Future Data Economy with Roger Chen - Episode 21

Data Engineering Podcast

Summary Data is an increasingly sought after raw material for business in the modern economy. One of the factors driving this trend is the increase in applications for machine learning and AI which require large quantities of information to work from. As the demand for data becomes more widespread the market for providing it will begin transform the ways that information is collected and shared among and between organizations.

Raw Data 100
article thumbnail

Altus SDX: Shared services for cloud-based analytics

Cloudera

The real power in machine learning and analytics is when multiple analytics disciplines are able to work together in concert, sharing data in service of solving more complex and more valuable questions. That’s what Cloudera SDX (Shared Data Experience) enables for our customers and why we’re so excited to introduce it today for Cloudera Altus.

Cloud 40
article thumbnail

How to Spot a Bad Product

Zalando Engineering

Red flags to look out for in badly written projects. Let’s talk about common red flags or alternatively, how to define badly-written project. Many of us have experienced a project which is crying and begging for something drastic to change, or even for it to be put out of its misery altogether, but alas; we don’t have the heart or the resources to “pull the plug” as it were.

Project 40
article thumbnail

Just Run a Game Day

Zalando Engineering

Scaling operational excellence Zalando is iterating our production incident handling process. The previous process had a dedicated Tier One 24/7 team who coordinated the incident response communication while escalating to the service-owning Tier Two team(s) for a resolution. That has been rationalized to those service-owning teams handling their incident flow, from alert to post-mortem to reduce time to resolution.

AWS 40
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.