Sat.Sep 19, 2020 - Fri.Sep 25, 2020

article thumbnail

Apache Kafka DevOps with Kubernetes and GitOps

Confluent

Operating critical Apache Kafka® event streaming applications in production requires sound automation and engineering practices. Streaming applications are often at the center of your transaction processing and data systems, requiring […].

Kafka 143
article thumbnail

Five Steps Towards Delivering Better Analytic Outcomes

Teradata

Get tips on how to cast a more critical eye on the seemingly endless amount of data-driven conclusions presented to us. Learn more.

Data 106
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Cutting Through The Noise And Focusing On The Fundamentals Of Data Engineering With The Data Janitor

Data Engineering Podcast

Summary Data engineering is a constantly growing and evolving discipline. There are always new tools, systems, and design patterns to learn, which leads to a great deal of confusion for newcomers. Daniel Molnar has dedicated his time to helping data professionals get back to basics through presentations at conferences and meetups, and with his most recent endeavor of building the Pipeline Data Engineering Academy.

article thumbnail

Operational Database Security – Part 2

Cloudera

In this blogpost, we are going to take a look at some of the OpDB related security features of a CDP Private Cloud Base deployment. We are going to talk about auditing, different security levels, security features of Data Catalog, and Client Considerations. You can find part 1 of this series, here. . Auditing. Comprehensive auditing is provided to enable enterprises to effectively and efficiently meet their compliance requirements by auditing access and other types of operations across OpDB (thr

article thumbnail

A Guide to Debugging Apache Airflow® DAGs

In Airflow, DAGs (your data pipelines) support nearly every use case. As these workflows grow in complexity and scale, efficiently identifying and resolving issues becomes a critical skill for every data engineer. This is a comprehensive guide with best practices and examples to debugging Airflow DAGs. You’ll learn how to: Create a standardized process for debugging to quickly diagnose errors in your DAGs Identify common issues with DAGs, tasks, and connections Distinguish between Airflow-relate

article thumbnail

Building a Machine Learning Logging Pipeline with Kafka Streams at Twitter

Confluent

Twitter, one of the most popular social media platforms today, is well known for its ever-changing environment—user behaviors evolve quickly; trends are dynamic and versatile; and special and emergent events […].

article thumbnail

Accelerate Your Path to a Modern Analytics Architecture

Teradata

A modern analytics architecture means something different to everyone. What does it mean for your organization? Find out more.

More Trending

article thumbnail

Cloudera Data Platform in AWS Marketplace Simplifies and Accelerates Cloud Adoption

Cloudera

As organizations look to optimize the speed and cost of their cloud journey in today’s rapidly evolving economy, Cloudera is delighted to announce the availability of Cloudera Data Platform (CDP) Public Cloud in AWS Marketplace. Now customers can easily, confidently and cost-effectively discover, procure and deploy the world’s first Enterprise Data Cloud, powered by AWS, for faster time-to-insight from their advanced analytics and machine learning services.

AWS 83
article thumbnail

Infrastructure Modernization with Google Anthos and Apache Kafka

Confluent

The promise of cloud computing is simplicity, speed, and cost savings. But what about workloads that can’t move to the cloud? Are they stuck using expensive legacy tooling and practices? […].

Kafka 59
article thumbnail

Scala 3: New Types Quickly Explained

Rock the JVM

Explore the Game-Changing New Types in Scala 3: What We're Eagerly Anticipating

Scala 52
article thumbnail

Building a Data Science Platform in 10 days

Afterpay Tech

Photo by Pietro Jeng on Unsplash By Letian Wang Context At Afterpay, we are generating lots of data from customer transactions, website views and consumer referrals every day. Being able to derive insights from this data, and to use those insights to  improve our consumer experience and provide value to our merchants and consumers, is a critical competitive differentiator for Afterpay.

article thumbnail

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Speaker: Tamara Fingerlin, Developer Advocate

article thumbnail

Choosing the right Data Warehouse SQL Engine: Apache Hive LLAP vs Apache Impala

Cloudera

Aren’t two superheroes better than one? Some of the most powerful results come from combining complementary superpowers, and the “dynamic duo” of Apache Hive LLAP and Apache Impala, both included in Cloudera Data Warehouse , is further evidence of this. Both Impala and Hive can operate at an unprecedented and massive scale, with many petabytes of data.

article thumbnail

Exports is not a function

Grouparoo

I have been working on the Salesforce integration. That experience will be its own story. In the process, though, I found something tricky that I might be uniquely experiencing given the combinatorics of the modern Node/Javascript/Typescript world. Grouparoo connects with sources, processes the data from them, and sends that data to destinations. When data comes from a source, we call it an import.

Coding 52
article thumbnail

Customer Journey Analytics & Real-Time Marketing: Lessons Learned from Those That Got it Right

Teradata

Learn about the early adopters for both Customer Journey Analytics and Real-Time Marketing who overcame initial hurdles and realized superior business outcomes.

IT 52
article thumbnail

Build a Slack Dashboard (Part 1): Extracting Data Using Meltano

Preset

Build a beautiful Slack dashboard using open source tools Meltano and Superset. Part 1 of 3.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Using Data to Drive Meaningful Diversity and Inclusion Efforts

Cloudera

A conversation with civil rights activist and author Dr. Mary Frances Berry about the importance of data in diversity and inclusions initiatives. . COVID-19 has forced businesses to change in ways we didn’t know was possible, and at a speed many had never imagined. The summer of 2020 made something else very clear; while we’ve been agile on digital transformation in business, we have failed to tap data to help us confront deep-seated social justice issues.

Data 68
article thumbnail

Today’s ‘Breakfast Roll People’ Will Change How Energy Retail Operates

Teradata

What will it take for energy retailers to transition to world-class segment leaders? The answer is millions of modest improvements, implemented by business users themselves.

Retail 52