Sat.Apr 17, 2021 - Fri.Apr 23, 2021

article thumbnail

What’s New in Apache Kafka 2.8

Confluent

I’m proud to announce the release of Apache Kafka 2.8.0 on behalf of the Apache Kafka® community. The 2.8.0 release contains many new features and improvements. This blog post highlights […].

Kafka 138
article thumbnail

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

Cloudera

Like most of our customers, Cloudera’s internal operations rely heavily on data. For more than a decade, Cloudera has built internal tools and data analysis primarily on a single production CDH cluster. This cluster runs workloads for every department – from real-time user interfaces for Support to providing recommendations in the Cloudera Data Platform (CDP) Upgrade Advisor to analyzing our business and closing our books.

Cloud 122
article thumbnail

Moving Machine Learning Into The Data Pipeline at Cherre

Data Engineering Podcast

Summary Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformations is actually a full-fledged machine learning project in its own right. In this episode Tal Galfsky explains how he and the team at Cherre tackled the problem of messy data for Addresses by building a natural language processing and entity resolution system that is served

article thumbnail

Reshaping the supermarket post-pandemic

Retail Insight

Social distancing and a life lived largely online have been the reality for over a year. But, as the world gradually emerges from lockdown, ha s the shape of retail really changed forever?

Retail 52
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Monitoring Your Event Streams: Tutorial for Observability Into Apache Kafka Clients

Confluent

Why should you monitor your Apache Kafka® client applications? Apart from the usual reasons for monitoring any application, such as ensuring uptime SLAs, there are a few specific reasons for […].

Kafka 72
article thumbnail

Relationship intelligence will shape the workplace of the future

Cloudera

Our latest Influential Women in Data session featured Brenda Le Sueur from Cambridge Assessments. Brenda has worked across many organisations and continents, but what has always been crucial to her is relationships – how we cultivate them, how we nurture them and how they, in turn, define us. I sat down with Brenda to ask her about her journey as a woman in tech and understand more about the impact of relationships on our career.

More Trending

article thumbnail

Understanding Types with SQLite and Node.js

Grouparoo

Two fun facts about SQLite : The initial release was more than 20 years ago! It is the most widely used database (and likely one of the most widely deployed pieces of software). And here are a few of my opinions on SQLite: It's super cool. We don't talk about it enough. It's actually really easy to use (which is likely why it's so widely used).

Bytes 52
article thumbnail

Hyper-Personalization: Understanding Customers Using Digital Payments Data

Teradata

Hyper-personalization is a must-have for businesses today. But how do digital payments data help? By bringing granularity to your personalization strategies.

Data 52
article thumbnail

Apache Ozone and Dense Data Nodes

Cloudera

This post was co-authored by two Cisco Employees as well: Karthik Krishna, Silesh Bijjahalli. Today’s enterprise data analytics teams are constantly looking to get the best out of their platforms. Storage plays one of the most important roles in the data platforms strategy, it provides the basis for all compute engines and applications to be built on top of it.

article thumbnail

The battle to combat data sprawl: what CIOs need to do now

DataKitchen

The post The battle to combat data sprawl: what CIOs need to do now first appeared on DataKitchen.

Data 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

How to Approach Your Data Engineering Transformation

Silectis

Should you build your own tooling, take a “best of breed” approach, or buy a turnkey data engineering platform? We’ve got you covered. Data Engineering Platforms: Build, Best of Breed, or Buy? Every company wants to be data-driven. Modern organizations that thrive based on data have a common strength: a solid data engineering practice.

article thumbnail

Welcome, Pedro!

Grouparoo

Building an open source tool to connect data to many different services means a lot of integrations. It can be pretty tricky, so we were lucky to meet Pedro S Lopez a few weeks back when he started adding several plugins to that integration list. He has now come aboard officially and will work more on the core product. Pedro makes the Grouparoo team an international one.

article thumbnail

What is Streaming Analytics?

Cloudera

What is Streaming Analytics? Streaming Analytics is a type of data analysis that processes data streams for real-time analytics. It continuously processes data from multiple streams and performs simple calculations to complex event processing for delivering sophisticated use cases. The primary purpose is to present the most up-to-date operational events for the user to stay on top of the business needs and take action as changes happen in real-time.

Kafka 100
article thumbnail

The Worst of Times - The Best of Times

Teradata

As customer behavior changes rapidly, the challenges & opportunities for fast, flexible, agile, and future fit improvements for retailers are huge. Read more.

Retail 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

How Resident Reduced Data Issues by 90% with Monte Carlo

Monte Carlo

Many data leaders tell us that their data scientists and engineers spend 40 percent or more of their time tackling data issues instead of working on projects that actually move the needle. It doesn’t have to be this way. Here’s how the data engineering team at Resident, a house of direct-to-consumer furnishings brands, reduced their data incidents by 90% with data observability at s cale.

article thumbnail

Making the Remote Onboarding a Success

Zalando Engineering

When the pandemic started in 2020 many Zalando employees went into home office. It changed our working habits and many other things and Zalando published remote working guidelines to support their employees. This concentrates only on remote working, but what happens if you change companies during the pandemic? Joining a new company and getting onboarded can be already pretty tough during normal times.

article thumbnail

Deep Learning with Nvidia GPUs in Cloudera Machine Learning

Cloudera

Introduction. In our previous blog post in this series , we explored the benefits of using GPUs for data science workflows, and demonstrated how to set up sessions in Cloudera Machine Learning (CML) to access NVIDIA GPUs for accelerating Machine Learning Projects. While the time-saving potential of using GPUs for complex and large tasks is massive, setting up these environments and tasks such as wrangling NVIDIA drivers, managing CUDA versions and deploying custom engines for your specific proje

article thumbnail

Data Analyst Responsibilities-What does a data analyst do?

ProjectPro

Are you passionate about numbers and algebraic functions? Does the idea of evaluating, processing, analyzing, and interpreting statistical data makes you roll up your sleeves and get the job done? Do you love to distinguish the trends and patterns in data? Do you enjoy sharing your work and communicating your knowledge with others in the team? Do you have the attitude of self-learning and can figure things out on your own?

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

HDFS Data Encryption at Rest on Cloudera Data Platform

Cloudera

Introduction: Encryption of Data at Rest is a highly desirable or sometimes mandatory requirement for data platforms in a range of industry verticals including HealthCare, Financial & Government organizations. The capability increases security and protects sensitive data from various kinds of attack that could be internal or external to the platform.

MySQL 73
article thumbnail

#ClouderaLife Spotlight: Bogi Egyed, Engineering Manager

Cloudera

Meet Boglarka Egyed, also known as “Bogi” to her colleagues. . She’s a 5-year Clouderan who recently transitioned into the role of Engineering Manager. . Bogi originally graduated from college with her degree in Applied Mathematics but has spent her career as a Software Engineer. “Mathematics provided me with solid fundamentals to use in this field but programming was what really caught my attention due to its creative nature while being able to get results fast.” .

article thumbnail

The Intersection of Climate and Capital Markets

Cloudera

Happy Earth Day! Earth Day was introduced in 1970 and has celebrated various milestone achievements including expanding globally and leveraging the power of social media to expand climate awareness and action. A great summary and history can be found on earthday.org/history. For those in financial services, climate initiatives are another major market event with far-reaching impact on capital adequacy and compliance regulations.