Sat.Sep 19, 2020 - Fri.Sep 25, 2020

article thumbnail

Apache Kafka DevOps with Kubernetes and GitOps

Confluent

Operating critical Apache Kafka® event streaming applications in production requires sound automation and engineering practices. Streaming applications are often at the center of your transaction processing and data systems, requiring […].

Kafka 143
article thumbnail

Five Steps Towards Delivering Better Analytic Outcomes

Teradata

Get tips on how to cast a more critical eye on the seemingly endless amount of data-driven conclusions presented to us. Learn more.

Data 106
article thumbnail

Cutting Through The Noise And Focusing On The Fundamentals Of Data Engineering With The Data Janitor

Data Engineering Podcast

Summary Data engineering is a constantly growing and evolving discipline. There are always new tools, systems, and design patterns to learn, which leads to a great deal of confusion for newcomers. Daniel Molnar has dedicated his time to helping data professionals get back to basics through presentations at conferences and meetups, and with his most recent endeavor of building the Pipeline Data Engineering Academy.

article thumbnail

Cloudera Data Platform in AWS Marketplace Simplifies and Accelerates Cloud Adoption

Cloudera

As organizations look to optimize the speed and cost of their cloud journey in today’s rapidly evolving economy, Cloudera is delighted to announce the availability of Cloudera Data Platform (CDP) Public Cloud in AWS Marketplace. Now customers can easily, confidently and cost-effectively discover, procure and deploy the world’s first Enterprise Data Cloud, powered by AWS, for faster time-to-insight from their advanced analytics and machine learning services.

AWS 88
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Building a Machine Learning Logging Pipeline with Kafka Streams at Twitter

Confluent

Twitter, one of the most popular social media platforms today, is well known for its ever-changing environment—user behaviors evolve quickly; trends are dynamic and versatile; and special and emergent events […].

article thumbnail

Accelerate Your Path to a Modern Analytics Architecture

Teradata

A modern analytics architecture means something different to everyone. What does it mean for your organization? Find out more.

More Trending

article thumbnail

Operational Database Security – Part 2

Cloudera

In this blogpost, we are going to take a look at some of the OpDB related security features of a CDP Private Cloud Base deployment. We are going to talk about auditing, different security levels, security features of Data Catalog, and Client Considerations. You can find part 1 of this series, here. . Auditing. Comprehensive auditing is provided to enable enterprises to effectively and efficiently meet their compliance requirements by auditing access and other types of operations across OpDB (thr

article thumbnail

Infrastructure Modernization with Google Anthos and Apache Kafka

Confluent

The promise of cloud computing is simplicity, speed, and cost savings. But what about workloads that can’t move to the cloud? Are they stuck using expensive legacy tooling and practices? […].

Kafka 59
article thumbnail

Building a Data Science Platform in 10 days

Afterpay Tech

Photo by Pietro Jeng on Unsplash By Letian Wang Context At Afterpay, we are generating lots of data from customer transactions, website views and consumer referrals every day. Being able to derive insights from this data, and to use those insights to  improve our consumer experience and provide value to our merchants and consumers, is a critical competitive differentiator for Afterpay.

article thumbnail

3 Ways to Offload Read-Heavy Applications from MongoDB

Rockset

According to over 40,000 developers, MongoDB is the most popular NOSQL database in use right now. The tool’s meteoric rise is likely due to its JSON structure which makes it easy for Javascript developers to use. From a developer perspective, MongoDB is a great solution for supporting modern data applications. Nevertheless, developers sometimes need to pull specific workflows out of MongoDB and integrate them into a secondary system while continuing to track any changes to the underlying MongoDB

MongoDB 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Choosing the right Data Warehouse SQL Engine: Apache Hive LLAP vs Apache Impala

Cloudera

Aren’t two superheroes better than one? Some of the most powerful results come from combining complementary superpowers, and the “dynamic duo” of Apache Hive LLAP and Apache Impala, both included in Cloudera Data Warehouse , is further evidence of this. Both Impala and Hive can operate at an unprecedented and massive scale, with many petabytes of data.

article thumbnail

Exports is not a function

Grouparoo

I have been working on the Salesforce integration. That experience will be its own story. In the process, though, I found something tricky that I might be uniquely experiencing given the combinatorics of the modern Node/Javascript/Typescript world. Grouparoo connects with sources, processes the data from them, and sends that data to destinations. When data comes from a source, we call it an import.

Coding 52
article thumbnail

Customer Journey Analytics & Real-Time Marketing: Lessons Learned from Those That Got it Right

Teradata

Learn about the early adopters for both Customer Journey Analytics and Real-Time Marketing who overcame initial hurdles and realized superior business outcomes.

IT 52
article thumbnail

Build a Slack Dashboard (Part 1): Extracting Data Using Meltano

Preset

Build a beautiful Slack dashboard using open source tools Meltano and Superset. Part 1 of 3.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Using Data to Drive Meaningful Diversity and Inclusion Efforts

Cloudera

A conversation with civil rights activist and author Dr. Mary Frances Berry about the importance of data in diversity and inclusions initiatives. . COVID-19 has forced businesses to change in ways we didn’t know was possible, and at a speed many had never imagined. The summer of 2020 made something else very clear; while we’ve been agile on digital transformation in business, we have failed to tap data to help us confront deep-seated social justice issues.

Data 72
article thumbnail

Today’s ‘Breakfast Roll People’ Will Change How Energy Retail Operates

Teradata

What will it take for energy retailers to transition to world-class segment leaders? The answer is millions of modest improvements, implemented by business users themselves.

Retail 52