Sat.Mar 16, 2019 - Fri.Mar 22, 2019

article thumbnail

SOA vs. EDA: Is Not Life Simply a Series of Events?

Confluent

When should you use an API? When should you use an event? Most contemporary software architectures are some mix of these two approaches. I will attempt to articulate in layman’s terms what an event-driven architecture (EDA) is and contrast it with service-oriented architecture (SOA). In essence, this is an attempt to differentiate and/or associate APIs with events.

article thumbnail

A DataOps vs DevOps Cookoff In The Data Kitchen

Data Engineering Podcast

Summary Delivering a data analytics project on time and with accurate information is critical to the success of any business. DataOps is a set of practices to increase the probability of success by creating value early and often, and using feedback loops to keep your project on course. In this episode Chris Bergh, head chef of Data Kitchen, explains how DataOps differs from DevOps, how the industry has begun adopting DataOps, and how to adopt an agile approach to building your data platform.

article thumbnail

Teradata Has Been Named One of the World's Most Ethical Companies 2019

Teradata

Teradata is thrilled to be named one the of the World’s Most Ethical Companies, for the tenth consecutive year.

99
article thumbnail

Netflix Public Bug Bounty, 1 year later

Netflix Tech

by Astha Singhal (Netflix Application Security) As Netflix continues to create entertainment people love, the security team continues to keep our members, partners, and employees secure. The security research community has partnered with us to improve the security of the Netflix service for the past few years through our responsible disclosure and bug bounty programs.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Kafka Streams’ Take on Watermarks and Triggers

Confluent

Back in May 2017, we laid out why we believe that Kafka Streams is better off without a concept of watermarks or triggers , and instead opts for a continuous refinement model. This article explains how we are fundamentally sticking with this model, while also opening the door for use cases that are incompatible with continuous refinement. By continuous refinement , I mean that Kafka Streams emits new results whenever records are updated.

Kafka 106
article thumbnail

Serverless Data Management: A SQL Search and Analytics Engine

Rockset

When we started Rockset, we envisioned building a powerful cloud data management system that was really easy to use. Making the data stack simpler is fundamental to making data usable by developers and data scientists. Simplifying the Data Stack To that end, we incorporated user-friendly features that alleviate the pain we personally experienced as data practitioners.

SQL 52

More Trending

article thumbnail

What Is Your Value Curve?

Cloudera

Are you floundering in a Red Ocean of competition? Modern enterprises operate in an extremely competitive (red ocean) and turbulent, hyperconnected white water world. To better appreciate the competitive environment in which your company is operating, ask yourself the following questions: Are you confronted with increased competition both domestically and internationally?

Cloud 48
article thumbnail

Running Apache Flink on Kubernetes

Zalando Engineering

Recently, I was developing a small stream processing application using Apache Flink. Zalando uses Kubernetes as the default deployment target, so naturally I wanted to deploy Flink and the developed job to our Kubernetes cluster. I learned a lot about Flink and Kubernetes along the way, which I want to share in this article. Challenges Compliance - At Zalando, all code running in production has to be reviewed by at least two people and all deployed artifacts have to be traceable to a git commit.

article thumbnail

Case Study: The Path to Better Pollution Forecasting Goes Through Nested JSON

Rockset

Think about the steel industry in the US, and you’ll likely think of Pittsburgh. Known as the “Steel City” for leading the nation in steel production in the first half of the 20th century, Pittsburgh also went by the moniker “the Smoky City,” due to the air pollution from steel and other heavy industries. With increased regulation and the decline of the steel industry, Pittsburgh has gotten much cleaner since its darkest, smokiest days in the 1940s, but it still hasn’t shed all the vestiges of s

article thumbnail

Case Study: Fynd Uses Kafka and Rockset to Respond to E-Commerce Consumer Behavior in Real Time

Rockset

Fynd is an online to offline (O2O) fashion e-commerce portal that brings in-store fashion products from retail brands to an online audience. Fynd pulls real-time streams of inventory data from over 9,000 stores in India to provide its 17 million customers up-to-date information on the latest offers and trends in fashion. Data and technology are at the heart of Fynd’s business.

Kafka 40
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Case Study: Implementing Real-Time IoT Analytics Simply and Efficiently - An MIT Smart City Project

Rockset

A group of MIT students traveled to Instituto Alpha Lumen , a school in São José dos Campos, Brazil, in early 2019 to assist in the formation of a smart city initiative. The school offers a rigorous educational program for talented youth, providing students many opportunities to train in science and technology fields and preparing them for studies at top universities internationally and in Brazil.

Project 40
article thumbnail

Open Source: February Updates - Release new projects, join Google Summer of Code Program

Zalando Engineering

Project Highlights Kube Metrics Adapter gained community attention as it was featured in a medium post 'Kubernetes autoscaling with Istio metrics'. Users provided very positive feedback on the project. Kube Metrics Adapter is currently maintained by Developer Productivity team at Zalando. It is a general purpose metrics adapter for Kubernetes that can collect and serve custom and external metrics for Horizontal Pod Autoscaling.