Sat.Apr 17, 2021 - Fri.Apr 23, 2021

article thumbnail

What’s New in Apache Kafka 2.8

Confluent

I’m proud to announce the release of Apache Kafka 2.8.0 on behalf of the Apache Kafka® community. The 2.8.0 release contains many new features and improvements. This blog post highlights […].

Kafka 138
article thumbnail

Moving Machine Learning Into The Data Pipeline at Cherre

Data Engineering Podcast

Summary Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformations is actually a full-fledged machine learning project in its own right. In this episode Tal Galfsky explains how he and the team at Cherre tackled the problem of messy data for Addresses by building a natural language processing and entity resolution system that is served

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

Cloudera

Like most of our customers, Cloudera’s internal operations rely heavily on data. For more than a decade, Cloudera has built internal tools and data analysis primarily on a single production CDH cluster. This cluster runs workloads for every department – from real-time user interfaces for Support to providing recommendations in the Cloudera Data Platform (CDP) Upgrade Advisor to analyzing our business and closing our books.

Cloud 121
article thumbnail

Reshaping the supermarket post-pandemic

Retail Insight

Social distancing and a life lived largely online have been the reality for over a year. But, as the world gradually emerges from lockdown, ha s the shape of retail really changed forever?

Retail 52
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Monitoring Your Event Streams: Tutorial for Observability Into Apache Kafka Clients

Confluent

Why should you monitor your Apache Kafka® client applications? Apart from the usual reasons for monitoring any application, such as ensuring uptime SLAs, there are a few specific reasons for […].

Kafka 72
article thumbnail

Understanding Types with SQLite and Node.js

Grouparoo

Two fun facts about SQLite : The initial release was more than 20 years ago! It is the most widely used database (and likely one of the most widely deployed pieces of software). And here are a few of my opinions on SQLite: It's super cool. We don't talk about it enough. It's actually really easy to use (which is likely why it's so widely used).

Bytes 52

More Trending

article thumbnail

How to Approach Your Data Engineering Transformation

Silectis

Should you build your own tooling, take a “best of breed” approach, or buy a turnkey data engineering platform? We’ve got you covered. Data Engineering Platforms: Build, Best of Breed, or Buy? Every company wants to be data-driven. Modern organizations that thrive based on data have a common strength: a solid data engineering practice.

article thumbnail

The Worst of Times - The Best of Times

Teradata

As customer behavior changes rapidly, the challenges & opportunities for fast, flexible, agile, and future fit improvements for retailers are huge. Read more.

Retail 52
article thumbnail

Welcome, Pedro!

Grouparoo

Building an open source tool to connect data to many different services means a lot of integrations. It can be pretty tricky, so we were lucky to meet Pedro S Lopez a few weeks back when he started adding several plugins to that integration list. He has now come aboard officially and will work more on the core product. Pedro makes the Grouparoo team an international one.

article thumbnail

Apache Ozone and Dense Data Nodes

Cloudera

This post was co-authored by two Cisco Employees as well: Karthik Krishna, Silesh Bijjahalli. Today’s enterprise data analytics teams are constantly looking to get the best out of their platforms. Storage plays one of the most important roles in the data platforms strategy, it provides the basis for all compute engines and applications to be built on top of it.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

The battle to combat data sprawl: what CIOs need to do now

DataKitchen

The post The battle to combat data sprawl: what CIOs need to do now first appeared on DataKitchen.

Data 52
article thumbnail

Hyper-Personalization: Understanding Customers Using Digital Payments Data

Teradata

Hyper-personalization is a must-have for businesses today. But how do digital payments data help? By bringing granularity to your personalization strategies.

Data 52
article thumbnail

Cats Effect 3: Introduction to Fibers

Rock the JVM

An Introduction to Asynchronous Computations with Fibers in Cats Effect 3, Tailored for Scala 3

Scala 52
article thumbnail

What is Streaming Analytics?

Cloudera

What is Streaming Analytics? Streaming Analytics is a type of data analysis that processes data streams for real-time analytics. It continuously processes data from multiple streams and performs simple calculations to complex event processing for delivering sophisticated use cases. The primary purpose is to present the most up-to-date operational events for the user to stay on top of the business needs and take action as changes happen in real-time.

Kafka 95
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

How Resident Reduced Data Issues by 90% with Monte Carlo

Monte Carlo

Many data leaders tell us that their data scientists and engineers spend 40 percent or more of their time tackling data issues instead of working on projects that actually move the needle. It doesn’t have to be this way. Here’s how the data engineering team at Resident, a house of direct-to-consumer furnishings brands, reduced their data incidents by 90% with data observability at s cale.

article thumbnail

Making the Remote Onboarding a Success

Zalando Engineering

When the pandemic started in 2020 many Zalando employees went into home office. It changed our working habits and many other things and Zalando published remote working guidelines to support their employees. This concentrates only on remote working, but what happens if you change companies during the pandemic? Joining a new company and getting onboarded can be already pretty tough during normal times.

article thumbnail

Data Analyst Responsibilities-What does a data analyst do?

ProjectPro

Are you passionate about numbers and algebraic functions? Does the idea of evaluating, processing, analyzing, and interpreting statistical data makes you roll up your sleeves and get the job done? Do you love to distinguish the trends and patterns in data? Do you enjoy sharing your work and communicating your knowledge with others in the team? Do you have the attitude of self-learning and can figure things out on your own?

article thumbnail

Deep Learning with Nvidia GPUs in Cloudera Machine Learning

Cloudera

Introduction. In our previous blog post in this series , we explored the benefits of using GPUs for data science workflows, and demonstrated how to set up sessions in Cloudera Machine Learning (CML) to access NVIDIA GPUs for accelerating Machine Learning Projects. While the time-saving potential of using GPUs for complex and large tasks is massive, setting up these environments and tasks such as wrangling NVIDIA drivers, managing CUDA versions and deploying custom engines for your specific proje

article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

HDFS Data Encryption at Rest on Cloudera Data Platform

Cloudera

Introduction: Encryption of Data at Rest is a highly desirable or sometimes mandatory requirement for data platforms in a range of industry verticals including HealthCare, Financial & Government organizations. The capability increases security and protects sensitive data from various kinds of attack that could be internal or external to the platform.

MySQL 73
article thumbnail

#ClouderaLife Spotlight: Bogi Egyed, Engineering Manager

Cloudera

Meet Boglarka Egyed, also known as “Bogi” to her colleagues. . She’s a 5-year Clouderan who recently transitioned into the role of Engineering Manager. . Bogi originally graduated from college with her degree in Applied Mathematics but has spent her career as a Software Engineer. “Mathematics provided me with solid fundamentals to use in this field but programming was what really caught my attention due to its creative nature while being able to get results fast.” .

article thumbnail

The Intersection of Climate and Capital Markets

Cloudera

Happy Earth Day! Earth Day was introduced in 1970 and has celebrated various milestone achievements including expanding globally and leveraging the power of social media to expand climate awareness and action. A great summary and history can be found on earthday.org/history. For those in financial services, climate initiatives are another major market event with far-reaching impact on capital adequacy and compliance regulations.