Sat.Jul 28, 2018 - Fri.Aug 03, 2018

article thumbnail

Databook: Turning Big Data into Knowledge with Metadata at Uber

Uber Engineering

From driver and rider locations and destinations, to restaurant orders and payment transactions, every interaction on Uber’s transportation platform is driven by data. Data powers Uber’s global marketplace, enabling more reliable and seamless user experiences across our products for riders, … The post Databook: Turning Big Data into Knowledge with Metadata at Uber appeared first on Uber Engineering Blog.

Metadata 110
article thumbnail

Mobile Data Collection And Analysis Using Ona And Canopy With Peter Lubell-Doughtie - Episode 41

Data Engineering Podcast

Summary With the attention being paid to the systems that power large volumes of high velocity data it is easy to forget about the value of data collection at human scales. Ona is a company that is building technologies to support mobile data collection, analysis of the aggregated information, and user-friendly presentations. In this episode CTO Peter Lubell-Doughtie describes the architecture of the platform, the types of environments and use cases where it is being employed, and the value of s

article thumbnail

A New Era in Data Warehousing

Cloudera

How do you know when your Data Warehousing solution is working well? Surprisingly, when you fail to notice it. Here are some interesting observations that are often taken for granted: Credit card transactions are handled safely. True – millions of credit card transactions are processed within minutes for consistency, fraud and compliance, using petabytes of historical transactions as reference data.

article thumbnail

Recap of Hadoop News for July 2018

ProjectPro

News on Hadoop - July 2018 Hadoop data governance services surface in wake of GDPR.TechTarget.com, July 2, 2018. GDPR has turned out to be a strong motivator that would bring greater governance to big data. At the recent DataWorks Summit 2018 , though most of the attention was focussed on how Hadoop pioneer Hortonworks is all set to expand its service in the cloud, there was great interest and importance put on managing data privacy as well.

Hadoop 52
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Agile Principles Over Frameworks

Zalando Engineering

Embracing the diverse in working agile Very often I get asked what agile working looks like at Zalando. Do we use scrum? Do we use Kanban? Do we work with LeSS? Do we use SaFE? The answer to all of these is, “Yes”. As Agile Coaches we value principles more than frameworks. The principles are derived out of these diverse frameworks and they evolve over time.

article thumbnail

Azure Marketplace features Cloudera Customer 360 offering

Cloudera

Cloudera’s diverse and expansive partner ecosystem includes major tech companies constantly redefining the industry, consultancies guiding some of the most comprehensive digital transformations, fast-emerging ISVs challenging status-quo, and cloud companies providing unparalleled flexibility and scalability. Individually, these companies deliver great value to customers, so imagine the business outcomes and customer benefits made possible when two or more of these companies develop a joint offer