Sat.Mar 02, 2019 - Fri.Mar 08, 2019

article thumbnail

Open Sourcing Peloton, Uber’s Unified Resource Scheduler

Uber Engineering

First introduced by Uber in November 2018, Peloton , a unified resource scheduler, manages resources across distinct workloads, combining separate compute clusters. Peloton is designed for web-scale companies like Uber with millions of containers and tens of thousands of nodes. … The post Open Sourcing Peloton, Uber’s Unified Resource Scheduler appeared first on Uber Engineering Blog.

article thumbnail

Customer Analytics At Scale With Segment

Data Engineering Podcast

Summary Customer analytics is a problem domain that has given rise to its own industry. In order to gain a full understanding of what your users are doing and how best to serve them you may need to send data to multiple services, each with their own tracking code or APIs. To simplify this process and allow your non-engineering employees to gain access to the information they need to do their jobs Segment provides a single interface for capturing data and routing it to all of the places that you

article thumbnail

CloudBank’s Journey from Mainframe to Streaming with Confluent Cloud

Confluent

Cloud is one of the key drivers for innovation. Innovative companies experiment with data to come up with something useful. It usually starts with the opening of a firehose that continuously broadcasts tons of events before they start mining it to create music out of simply noise. Today, companies from all around the world are witnessing an explosion of event generation coming from everywhere, including their own internal systems.

Cloud 89
article thumbnail

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

Netflix Tech

Netflix’s engineering culture is predicated on Freedom & Responsibility, the idea that everyone (and every team) at Netflix is entrusted with a core responsibility and they are free to operate with freedom to satisfy their mission. This freedom allows teams and individuals to move fast to deliver on innovation and feel responsible for quality and robustness of their delivery.

Cloud 74
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Using Machine Learning to Ensure the Capacity Safety of Individual Microservices

Uber Engineering

Reliability engineering teams at Uber build the tools, libraries, and infrastructure that enable engineers to operate our thousands of microservices reliably at scale. At its essence, reliability engineering boils down to actively preventing outages that affect the mean time between … The post Using Machine Learning to Ensure the Capacity Safety of Individual Microservices appeared first on Uber Engineering Blog.

article thumbnail

Building a Diverse and Inclusive Teradata

Teradata

Teradata CEO Oliver Ratzesberger celebrates International Women's Day.

More Trending

article thumbnail

Advancing Analytics at #SQLBits 2019

Advancing Analytics: Data Engineering

Today is my first day back in the office after attending SQLBits in Manchester last week. SQLBits is the UK's largest Microsoft Data Platform conference. What makes this event special is that it is organised for the community, by the community and is not for profit - All the proceeds go in to funding the event and in particular, the awesome Friday night party.

article thumbnail

Themes and Conferences per Pacoid, Episode 7

Domino Data Lab: Data Engineering

Paco Nathan covers recent research on data infrastructure as well as adoption of machine learning and AI in the enterprise.

article thumbnail

Is Your Data Scientist Team Contributing to Your Company’s ROI?

Teradata

Find out how data scientists can contribute to solving real-life business problems, as well as how Teradata Vantage can help data scientists enable a better ROI for their company.

Data 49
article thumbnail

Trace Event, Chrome and More Profile Formats on FlameScope

Netflix Tech

FlameScope sub-second heatmap visualization. Less than a year ago, FlameScope was released as a proof of concept for a new profile visualization. Since then, it helped us, and many other users, to easily find and fix performance issues, and allowed us to see patterns that we had never noticed before in our profiles. As a tool, FlameScope was limited.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

MezzFS?—?Mounting object storage in Netflix’s media processing platform

Netflix Tech

MezzFS?—?Mounting object storage in Netflix’s media processing platform By Barak Alon (on behalf of Netflix’s Media Cloud Engineering team) MezzFS (short for “Mezzanine File System”) is a tool we’ve developed at Netflix that mounts cloud objects as local files via FUSE. It’s used extensively in our media processing platform, which includes services like Archer and runs features like video encoding and title image generation on tens of thousands of Amazon EC2 instances.

Media 88
article thumbnail

Design Principles for Mathematical Engineering in Experimentation Platform

Netflix Tech

Design Principles for Mathematical Engineering in Experimentation Platform at Netflix Jeffrey Wong, Senior Modeling Architect, Experimentation Platform Colin McFarland, Director, Experimentation Platform At Netflix, we have data scientists coming from many backgrounds such as neuroscience, statistics and biostatistics, economics, and physics; each of these backgrounds has a meaningful contribution to how experiments should be analyzed.