Sat.Feb 20, 2021 - Fri.Feb 26, 2021

article thumbnail

Lessons Learned from Running Apache Kafka at Scale at Pinterest

Confluent

Apache Kafka® is at the heart of the data transportation layer at Pinterest. The amount of data that runs through Kafka has constantly grown over the years. This growth sometimes […].

Kafka 145
article thumbnail

Why Data Capabilities Follow Up a Digital Transformation

Team Data Science

Companies can now make data useful to elevate decision making and to optimise products and processes. But what organizational capabilities are necessary and how to get started? It's currently easy to acquire data strategically. First, consider that smartphones function like questionnaires that customers are frequently filling out in a passive or active manner [ , 1 ].

article thumbnail

The rise and fall of the Agile Spotify Model

François Nguyen

If you are working in the tech field, I think you have already heard of Squads, Tribes, Chapters or Guild. It comes from Spotify, a swedish audio streaming company.If you are organizing #datateams, it could be tempting to copy/paste. You should really not ! The Spotify Model and Engineering Culture If you want to go back to the original article, it his here.

article thumbnail

#ClouderaLife Spotlight: Kevin Smith, Staff Customer Operations Engineer

Cloudera

Meet Kevin Smith, a Staff Customer Operations Engineer within the US Public Sector support team. He sums up his day-to-day by saying he works directly with clients on technical cases and provides support and guidance as they troubleshoot unexpected behavior. He also serves as a member of several project teams focusing on upgrade experiences, internal tools, product testing, training, and documentation.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Self Service Open Source Data Integration With AirByte

Data Engineering Podcast

Summary Data integration is a critical piece of every data pipeline, yet it is still far from being a solved problem. There are a number of managed platforms available, but the list of options for an open source system that supports a large variety of sources and destinations is still embarrasingly short. The team at Airbyte is adding a new entry to that list with the goal of making robust and easy to use data integration more accessible to teams who want or need to maintain full control of thei

article thumbnail

Apache Kafka and SAP Integration with the Kafka Connect ODP Source Connector

Confluent

SAP is a German multinational software corporation that develops and markets enterprise software to manage business operations and customer relations. SAP is most famous for its enterprise resource planning (ERP) […].

Kafka 88

More Trending

article thumbnail

Black History Month 2021: Be the light

Cloudera

Be the light – Accepting the call to become the change we seek. As Black History Month comes to a close, global communities and companies alike are left reflecting on recent historical events with shock, awe and a commitment to drive change. We find ourselves faced with the unhealed wounds of our past, a defining moment for our future and an opportunity to become the change we seek as citizens and professionals. .

article thumbnail

Teradata Has Been Named One of the World's Most Ethical Companies 2021

Teradata

Teradata has again been recognized as one of the World’s Most Ethical Companies, for 12th consecutive year! Read more.

75
article thumbnail

How to Manage Secrets for Confluent Platform with Kubernetes and HashiCorp Vault

Confluent

This blog post walks through an end-to-end demo that uses the Confluent Operator to deploy Confluent Platform to Kubernetes. We will deploy a connector that watches for commits to a […].

article thumbnail

Packaging award-winning shows with award-winning technology

Netflix Tech

By Cyril Concolato Introduction In previous blog posts, our colleagues at Netflix have explained how 4K video streams are optimized , how even legacy video streams are improved and more recently how new audio codecs can provide better aural experiences to our members. In all these cases, prior to being delivered through our content delivery network Open Connect , our award-winning TV shows, movies and documentaries like The Crown need to be packaged to enable crucial features for our members.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Building loyalty with data and analytics

Cloudera

In 1969, my aunt graduated from university and joined IBM, the dominant player in the nascent tech industry at the time. She remained at “Big Blue” where she met and married my uncle, and rose up through the management ranks, until their joint semi-retirement exactly 30 years later. She recently told me, “the only way you could get fired in those days was to murder someone, embezzle or steal”.

article thumbnail

Knowing Me, Knowing You: Data-Driven CX in Retail

Teradata

Today's retailers need to focus on using data to create scenarios that encourage the customer to engage with them, and then ensure that they act appropriately when they do.

Retail 59
article thumbnail

Data-driven performance improvements: Football and retail execution

Retail Insight

When I left school to start a professional football career, I understood very little about data – I did keep a note of the goals I scored, the assists I made and, most likely, the keepie-ups I could perform, but that was about it.

Retail 52
article thumbnail

Containerized Testing with Kerberos and SSH

Confluent

Kerberos authentication is widely used in today’s client/server applications; however getting started with Kerberos may be a daunting task if you don’t have prior experience. Information on setting up Kerberos […].

Kafka 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Change The Way You Do ML With Applied ML Prototypes

Cloudera

Today’s enterprise data science teams have one of the most challenging, yet most important roles to play in your business’s ML strategy. In our current landscape, businesses that have adopted a successful ML strategy are outperforming their competitors by over 9%. The implications of ML on the future of business are clear. However, only 4% of enterprise executives today report seeing success from their ML investment.

article thumbnail

Declarative Data Sync

Grouparoo

Developers have been using the Grouparoo UI to set up automated data movement from their databases to Mailchimp, Marketo, Salesforce, and more. While having these integrations already written for them saved plenty of time, there was something they missed: their normal developer workflow. Grouparoo now supports declarative data models and integrations to continuously sync your data to all of your cloud-based tools.

Data 52
article thumbnail

Knowing Your Data Starts with Data Lineage

Silectis

Data lineage can be a tremendously useful tool for data engineering and analytics, but is often treated as an afterthought both because of the challenges in implementation and the fact that it has not been broadly available within organizations. Many practitioners have never had access to data lineage information and may not know what they are missing.

article thumbnail

RippleNet Engineering's Inclusive Language Initiative: Part 1

Ripple Engineering

In 2020, Ripple accelerated our efforts to enhance diversity and inclusion throughout the company. As part of this commitment, we are sharing RippleNet Engineering 's initiative to replace language in our codebase that does not align with the reality in which we collectively want to live. This project was inspired especially by the protests last summer denouncing police brutality against Black citizens and the long fight against systemic racism in the United States.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Sample applications for Cloudera Operational Database

Cloudera

Cloudera Operational Database is an operational database-as-a-service that brings ease of use and flexibility to Apache HBase. Cloudera Operational Database enables developers to quickly build future-proof applications that are architected to handle data evolution. In the previous blog posts, we looked at application development concepts and how Cloudera Operational Database (COD) interacts with other CDP services.

article thumbnail

Knowing Me, Knowing You: Data-Driven CX in Retail

Teradata

Today's retailers need to focus on using data to create scenarios that encourage the customer to engage with them, and then ensure that they act appropriately when they do.

Retail 52
article thumbnail

Intro to databases on Azure: Basics for aspiring data engineers

A Cloud Guru: Data Engineering

How do you get started with an Azure database? As a database novice or someone new to Microsoft Azure, there are so many options it can be hard to know where to begin. Which is right for you as you get started on the path to becoming a data engineer? Let’s turn the question around […] The post Intro to databases on Azure: Basics for aspiring data engineers appeared first on A Cloud Guru.

article thumbnail

Data Observability: How Blinkist Prevents Broken Data Pipelines at Scale with Monte Carlo

Monte Carlo

Companies spend upwards of $15 million an nually tackling data downtime , in other words, periods of time where data is missing, broken, or otherwise erroneous, and over 88 percent of U.S. bu sinesses have lost money as a result of data quality issues. Fortunately, there’s hope in the n ext frontier of data engineering: observability. Here’s how the data engineering team at Blinkist, a book-summarizing subscription service, increases cost savings, collaboration, and productivity with data observ

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Elasticsearch or Rockset for Real-Time Analytics: How Much Query Flexibility Do You Have?

Rockset

It’s difficult to create data analytics systems that can easily query across your various data sources while maintaining fast performance and real-time capabilities. In an attempt to mitigate these challenges, many companies are turning to more modern database solutions. Two of these real-time analytics solutions are Elasticsearch and Rockset. Elasticsearch , originally developed for text search, has recently tried to push into the data analytics space.

SQL 40
article thumbnail

Semantic Versioning Starting in Apache Superset™ 1.1

Preset

Semantic versioning is a common strategy for handling releases. We discuss why the Apache Superset™ community is adopting this approach. | Apache Superset™ 1.

40
article thumbnail

Integration tests with Testcontainers

Zalando Engineering

In this article, I will show how teams at Zalando Marketing Services are using integration tests in Java-based backend applications. We will follow the idea of integration tests: the main concept and the attributes of a good integration test. Then, we will discuss an example based on the TestContainers library used in the Spring environment. Integration tests There are many definitions of integration testing.

article thumbnail

Monte Carlo Recognized as a 2021 Enterprise Tech 30 Company

Monte Carlo

We’re excited to share that Monte Carlo was today named to the third-annual 2021 Enterprise Tech 30 , a prestigious list of early-stage, mid-stage, and late-stage private companies identified by top VC and analysts as the most promising in enterprise tech. Organized by Wing Venture Capital, the Enterprise Tech 30 considers over 15,000 companies each year for this celebrated award, narrowing it down to just 30.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Five Trends for the Financial Services Industry to Track in 2021

Cloudera

With a new year ahead, it’s time for financial services to pause, take stock of the “new normal,” and plan a path forward. COVID-19 forced nearly every industry to adapt to a new reality, and the financial services industry was no exception. Consumer habits shifted drastically. Suddenly, many people started working from home. Employee and customer needs changed.

Banking 62
article thumbnail

Machine Learning Engineer Salary-The Ultimate Guide for 2023

ProjectPro

Wondering how much is the machine learning engineer salary? Well, we have got you covered. In this article, you’ll get some insider expert advice, including helpful resources, to help determine the machine learning engineer's average salary for your location, skills, and experience level. So, let’s get started! Table of Contents Machine Learning Engineer Salary – How much can you earn in 2023?

article thumbnail

The Multifaceted Value Proposition of the Cloudera Data Platform

Cloudera

The Cloudera Data Platform (CDP) represents a paradigm shift in modern data architecture by addressing all existing and future analytical needs. It builds on a foundation of technologies from CDH (Cloudera Data Hub) and HDP (Hortonworks Data Platform) technologies and delivers a holistic, integrated data platform from Edge to AI helping clients to accelerate complex data pipelines and democratize data assets.