Top Data Engineering Digest BI Python Content for Week of May 25

Sat.May 25, 2019 - Fri.May 31, 2019

Data Lineage For Your Pipelines

Data Engineering Podcast

MAY 26, 2019

Summary Some problems in data are well defined and benefit from a ready-made set of tools. For everything else, there’s Pachyderm, the platform for data science that is built to scale. In this episode Joe Doliner, CEO and co-founder, explains how Pachyderm started as an attempt to make data provenance easier to track, how the platform is architected and used today, and examples of how the underlying principles manifest in the workflows of data engineers and data scientists as they collabor

Data Science

Data Science Data Pipeline Data Kafka

3 Easy Ways to Turn Data into Actionable Answers

Teradata

MAY 30, 2019

Rob Armstrong explains three critical ways to get better answers from your data.

Data

Join 37,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Deploying Kafka Streams and KSQL with Gradle – Part 2: Managing KSQL Implementations

Confluent

MAY 29, 2019

In part 1 , we discussed an event streaming architecture that we implemented for a customer using Apache Kafka ® , KSQL from Confluent, and Kafka Streams. Now in part 2, we’ll discuss the challenges we faced developing, building, and deploying the KSQL portion of our application and how we used Gradle to address them. In part 3, we’ll explore using Gradle to build and deploy KSQL user-defined functions (UDFs) and Kafka Streams microservices.

Kafka

Kafka Management Bytes SQL

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Making our Android Studio Apps Reactive with UI Components & Redux

Netflix Tech

MAY 30, 2019

By Juliano Moraes , David Henry , Corey Grunewald & Jim Isaacs Recently Netflix has started building mobile apps to bring technology and innovation to our Studio Physical Productions , the portion of the business responsible for producing our TV shows and movies. Our very first mobile app is called Prodicle and was built for Android & iOS using the same reactive architecture in both platforms, which allowed us to build 2 apps from scratch in 3 months with 4 software engineers.

Architecture

Architecture Coding Software Engineer Software Engineering

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

Data

Cloudera Data Science Workbench: where innovation meets security, compliance and scale on the road to industrialized AI

Cloudera

MAY 28, 2019

Gartner states that “By 2022, 75% of new end-user solutions leveraging machine learning (ML) and AI techniques will be built with commercial instead of open source platforms” ¹. Spoiler alert: it’s not because data scientists will stop relying on open source for the latest innovation in ML algorithms and development environments. But rather as businesses look to operationalize machine learning capabilities at scale, they’ll turn increasingly to commercial platforms, with connectors to open so

Data Science

Data Science Transportation Machine Learning Algorithm

How to Drive Marketing Personalization in an Increasingly Non-Personal World

Teradata

MAY 28, 2019

Tom Casey discusses marketing personalization and why it's important to the modern customer experience.

Spring for Apache Kafka Deep Dive – Part 3: Apache Kafka and Spring Cloud Data Flow

Confluent

MAY 30, 2019

Following part 1 and part 2 of the Spring for Apache Kafka Deep Dive blog series, here in part 3 we will discuss another project from the Spring team: Spring Cloud Data Flow , which focuses on enabling developers to easily develop, deploy, and orchestrate event streaming pipelines based on Apache Kafka ®. As a continuation from the previous blog series, this blog post explains how Spring Cloud Data Flow helps you gain developer productivity and manage Apache-Kafka-based event streaming applicati

Kafka

Kafka Cloud Data Pipeline PostgreSQL

More Trending

Spring for Apache Kafka Deep Dive – Part 3: Apache Kafka and Spring Cloud Data Flow

Confluent

MAY 30, 2019

Kafka

Kafka Cloud Data Pipeline PostgreSQL

Using Tableau for Live Dashboards on Event Data

Rockset

MAY 31, 2019

Live dashboards can help organizations make sense of their event data and understand what's happening in their businesses in real time. Marketing managers constantly want to know how many signups there were in the last hour, day, or week. Product managers are always looking to understand which product features are working well and most heavily utilized.

BI Java Data SQL

How we release open source projects

Zalando Engineering

MAY 26, 2019

This blog post describes how we manage the process of proposing, reviewing and approving projects to become open source, while at the same time ensuring project code follows our compliance rules, and the maintainers of the projects are aware of their responsibilities. See our formal release guidelines Overview The process involves five steps that take the project from internal source code, through a review phase to our incubator, which eventually results in the project being graduated into our t

Project

Project PostgreSQL Coding Machine Learning

17 Ways to Mess Up Self-Managed Schema Registry

Confluent

MAY 28, 2019

Part 1 of this blog series by Gwen Shapira explained the benefits of schemas, contracts between services, and compatibility checking for schema evolution. In particular, using Confluent Schema Registry makes this really easy for developers to use schemas, and it is designed to be highly available. But it’s important to configure it properly from the start and manage it well, or else the schemas may not be available to the applications that need them.

Management

Management Kafka Java Certification

Sat.May 25, 2019 - Fri.May 31, 2019

Data Lineage For Your Pipelines

3 Easy Ways to Turn Data into Actionable Answers

Webinars

Trending Sources

Deploying Kafka Streams and KSQL with Gradle – Part 2: Managing KSQL Implementations

Webinars

Making our Android Studio Apps Reactive with UI Components & Redux

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Cloudera Data Science Workbench: where innovation meets security, compliance and scale on the road to industrialized AI

How to Drive Marketing Personalization in an Increasingly Non-Personal World

Spring for Apache Kafka Deep Dive – Part 3: Apache Kafka and Spring Cloud Data Flow

Sign up to get articles personalized to your interests!

More Trending

Spring for Apache Kafka Deep Dive – Part 3: Apache Kafka and Spring Cloud Data Flow

Using Tableau for Live Dashboards on Event Data

How we release open source projects

17 Ways to Mess Up Self-Managed Schema Registry

Stay Connected