October, 2017

article thumbnail

Introducing AthenaX, Uber Engineering’s Open Source Streaming Analytics Platform

Uber Engineering

Uber facilitates seamless and more enjoyable user experiences by channeling data from a variety of real-time sources. These insights range from in-the-moment traffic conditions that provide guidance on trip routes to the Estimated Time of Delivery (ETD) of an UberEATS … The post Introducing AthenaX, Uber Engineering’s Open Source Streaming Analytics Platform appeared first on Uber Engineering Blog.

article thumbnail

Deep Learning in Cloudera

Cloudera

Deep learning is in the news. It’s changing the game. It’s changing your life. It’s changing everything. It will change the world. It’s good to see people excited about technology. But deep learning is a tool that enterprises use to solve practical problems. Nothing more, and nothing less. In this blog, we provide a few examples that show how organizations put deep learning to work.

article thumbnail

Reattaching Kafka EBS in AWS

Zalando Engineering

At Zalando we’ve created Nakadi , a distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues. It helps to provide an available, durable, and fault tolerant publish/subscribe messaging system for simple microservices communication. A Kafka cluster is able to grow to a huge amount of data stored on the disks. Hosting Kafka requires support of instance termination (on purpose or just because the “cloud provider” decided to terminate the instance), which in our cas

Kafka 40
article thumbnail

Rethinking Data Marts in the Cloud

Cloudera

Published originally on O’Reilly.com. Become more agile with business intelligence and data analytics. Clouds (source: Pexels ). Check out Greg Rahn’s session, “ Rethinking data marts in the cloud: Common architectural patterns for analytics ” at the Strata Data Conference in Singapore, December 4-7, 2017, to learn how to architect analytic workloads in the cloud and the core elements of data governance.

Cloud 44
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

San Francisco Business Times ‘Fast 100’ List of Bay Area’s Fastest-Growing Private Companies – We Made the List!

Cloudera

Cloudera was recently named to the San Francisco Business Times “Fast 100” list of the fastest-growing private companies in the Bay Area. We were selected based on our pre-IPO revenue growth driven by demand for our machine learning and analytic platform. This year’s winners were selected based on percent growth in revenue from fiscal years 2014 to 2016. .

article thumbnail

Big Data Forecast: Cloudy, with Increasing Chances of Success (Part 1)

Cloudera

Today, public cloud is a compelling proposition for businesses and government organizations seeking to be more agile. Increasingly, self-service is seen as the most effective way to scale user access to data for analytics and operations. Cloud elasticity, combined with the right user applications, can reduce the friction of waiting for IT to fulfill requests and provision resources and data.

More Trending

article thumbnail

Cloudera CDH 5.13 Delivers Enhanced Unified Platform and Security Capabilities

Cloudera

Your enterprise big data and machine learning initiatives can be delayed, fail, or risk security breach unless you choose the most mature, most tightly integrated platform. The good news is that enterprise big data and machine learning initiatives will benefit from increased agility and decreased risk with the recent release of CDH 5.13. One primary goal was making multi-disciplinary analytics workloads run smoothly and efficiently in the cloud when backed by our new Shared Data Experience.

article thumbnail

Internet of Things in Healthcare – Three Examples of How IoT is Ushering in Advanced Healthcare

Cloudera

Most of us have seen the news stories and forecasts about the Internet of Things (IoT) and what a vast market and field of opportunity it will be. Hundreds of the world’s largest enterprises now use IoT in ways so innovative, they’re disrupting their own industries. What they’ve found is that by intelligently deploying IoT solutions, they’re able to drive operational efficiencies, introduce new products and services, improve the customer experience and create wholly new business models.

article thumbnail

Announcing the Cloudera Partner Impact Awards!

Cloudera

With an extensive and vibrant partner ecosystem, Cloudera continues to provide an open data platform for machine learning and analytics that transforms how businesses manage data. In fact, a large part of how customers work with Cloudera, is through our diverse partner community. That said, we are proud to announce our inaugural Cloudera Partner Impact Awards where we will highlight partners that set themselves apart by their solutions and business excellence.

article thumbnail

Zalando's Smart Product Platform

Zalando Engineering

Fashion meets tech in our Dublin hub At the Fashion Insights Centre in Dublin, one of the core tech products being developed is the Smart Product Platform (SPP). The fashion products we sell are the fundamental building blocks of what we do as a business. How to manage and represent these products and their associated data in today's competitive fashion marketplace is challenging.

Media 40
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

All Systems Go

Zalando Engineering

Zalando Flies the Fashion Flag at RecSys 2017 RecSys, the annual ACM Recommender Systems Conference held its 11th session this year in the gorgeous city of Como, Italy. As part of our platform strategy, it’s vital that we fully engage with the wider tech community, and so we brought a full team to soak up the great learnings and bring some of our own.

Systems 40
article thumbnail

A Plea For Small Pull Requests

Zalando Engineering

Pull Requests (PRs) are the norm today when it comes to common software development practices in teams. It is the right way to submit code changes so that your peers can check them out, add in their thoughts and help you create the best code you can - i.e. PRs allow us to easily introduce code review to our development process and enable a great deal of teamwork, while also decreasing the number of bugs our software contains.

Coding 40
article thumbnail

On the Road to Full Stack Responsibility

Zalando Engineering

Programming is hard, and being part of an engineering team is even harder. Depending on requirements, cross-functional teams are not equally formed with frontend and backend engineers in most organizations. Also, they are neither stable nor do people have an equal amount of experience. People come and go but software stays on, so we need to buckle up and maintain it.

Scala 40
article thumbnail

Event First Development - Moving Towards Kafka Pipeline Applications

Zalando Engineering

A Challenge Shortly after joining Zalando, I, along with a small number of other new colleagues (in a newly opened Dublin office), was entrusted with the task of building an important part of the new Fashion Platform - in particular, the core services around the Article data of Zalando. This task came with several interesting challenges, not least of which was ensuring the new platform provided not just sufficient capacity/throughput for existing workloads, but also had capacity for longer term

Kafka 40
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.