Sat.May 15, 2021 - Fri.May 21, 2021

article thumbnail

The Architecture of Uber’s API gateway

Uber Engineering

API gateways are an integral part of microservices architecture in recent years. An API gateway provides a single point of entry for all our apps and provides an interface to access data, logic, or functionality from back-end microservices. It also … The post The Architecture of Uber’s API gateway appeared first on Uber Engineering Blog.

article thumbnail

NVIDIA RAPIDS in Cloudera Machine Learning

Cloudera

Introduction. In the previous blog post in this series, we walked through the steps for leveraging Deep Learning in your Cloudera Machine Learning (CML) projects. This year, we expanded our partnership with NVIDIA , enabling your data teams to dramatically speed up compute processes for data engineering and data science workloads with no code changes using RAPIDS AI.

article thumbnail

Confluent CLI Launches Exciting New Features and an Intuitive UI

Confluent

With so many technologies in the modern development ecosystem, a common complaint is having to go through the mental gymnastics of adopting new products and keeping up with ever-expanding feature […].

article thumbnail

A Holistic Approach To Data Governance Through Self Reflection At Collibra

Data Engineering Podcast

Summary Data governance is a phrase that means many different things to many different people. This is because it is actually a concept that encompasses the entire lifecycle of data, across all of the people in an organization who interact with it. Stijn Christiaens co-founded Collibra with the goal of addressing the wide variety of technological aspects that are necessary to realize such an important and expansive process.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Twelve Thoughts About the Data Mesh

Teradata

The concept of Data Mesh is abuzz in the industry right now. Find out why we're so enthusiastic about it.

Data 97
article thumbnail

Streaming Market Data with Flink SQL Part II: Intraday Value-at-Risk

Cloudera

This article is the second in a multipart series to showcase the power and expressibility of FlinkSQL applied to market data. In case you missed it, part I starts with a simple case of calculating streaming VWAP. Code and data for this series are available on github. Speed matters in financial markets. Whether the goal is to maximize alpha or minimize exposure, financial technologists invest heavily in having the most up-to-date insights on the state of the market and where it is going.

SQL 104

More Trending

article thumbnail

Unlocking The Power of Data Lineage In Your Platform with OpenLineage

Data Engineering Podcast

Summary Data lineage is the common thread that ties together all of your data pipelines, workflows, and systems. In order to get a holistic understanding of your data quality, where errors are occurring, or how a report was constructed you need to track the lineage of the data from beginning to end. The complicating factor is that every framework, platform, and product has its own concepts of how to store, represent, and expose that information.

Metadata 100
article thumbnail

Thirteen Thoughts About the Data Mesh

Teradata

The concept of Data Mesh is abuzz in the industry right now. Find out why we're so enthusiastic about it.

Data 72
article thumbnail

#ClouderaLife Spotlight: Kathleen Merto, Early Talent Program Manager

Cloudera

Meet Kathleen Merto. To her colleagues, she’s Kat. . She works on our Emerging Talent team managing the hiring process for Interns and entry level roles. It’s a job she feels passionately about, so much so that she was eager to give her whole team a shout out! . Kat fell into the perfect career path for her. Growing up, she witnessed her mom, a nurse of 41 years now, dedicate so much of herself to helping others.

article thumbnail

Kafka Summit Europe 2021 Recap

Confluent

And that’s a wrap on Kafka Summit Europe 2021, the first of three global Kafka Summits this year. We’ve seen 17,000 registrations from over 7,000 companies and 137 different countries. […].

Kafka 123
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Cloud Migration Series (Step 4 of 5): Adopt a Cloud-First Mindset

Cloud Academy

This is part 4 of a 5-part series on best practices for enterprise cloud migration. Released weekly from the end of April to the end of May 2021, each article will cover a new phase of a business’s transition to the cloud, what to be on the lookout for, and how to ensure the journey is a success. Be sure to subscribe to our blog to be notified when new content goes live!

Cloud 52
article thumbnail

A visual guide to Azure Data Factory

A Cloud Guru: Data Engineering

A while back we published the Visual Guide to Azure Fundamentals on A Cloud Guru. The post got a lot of positive feedback so we thought we’d do another one — this time focused on Azure Data Factory! What is a visual guide? Visual guides are hi-resolution “sketchnotes.” They summarize a given topic or content […] The post A visual guide to Azure Data Factory appeared first on A Cloud Guru.

Data 52
article thumbnail

Key considerations when making a decision on a Cloud Data Warehouse

Cloudera

Making a decision on a cloud data warehouse is a big deal. Beyond there being a number of choices each with very different strengths, the parameters for your decision have also changed. Modernizing your data warehousing experience with the cloud means moving from dedicated, on-premises hardware focused on traditional relational analytics on structured data to a modern platform.

article thumbnail

Your smart frontend is doing too much

Grouparoo

The rise in popularity of frontend libraries and frameworks like React, Vue and Angular make it easier than ever before to build rich and interactive web apps. Pair these powerful libraries with a nice API to pull some data, and you can pretty quickly build out complex use cases. However, the ability to do so many things on the client side doesn't always mean you should.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Why CEOs Must Lead a New Relationship with Data

Teradata

CEOs need a new relationship with data if they are to successfully transition to the hyper-personalized, hyper-localized future most recognize as today’s immediate imperative.

Data 52
article thumbnail

Introducing Cluster RBAC, Audit Logs, and BYOK for Enterprise-Grade Security

Confluent

When it comes to launching your next app with data in motion, few things pose the same risk to going live as meeting requirements for data security and compliance. Doing […].

article thumbnail

What’s new in CDP Private Cloud 1.2?

Cloudera

CDP (Cloudera Data Platform) Private Cloud 1.2 was recently released and builds on the success of CDP Private Cloud Base (see the 7.1.6 release blog ). While Private Cloud Base is the ideal modernization of both CDH and HDP deployments for traditional workloads, Private Cloud adds cloud-native capabilities. In this blog, we’ll cover the complete range of new capabilities and updates for CDP Private Cloud as a whole (the platform) as well as for both the CDW (Cloudera Data Warehouse) and CML (Clo

Cloud 90
article thumbnail

Data-driven performance improvements in grocery retail: Pursuing the 1%

Retail Insight

Elite sport is a results business. T he difference between winning and losing often com es down to the finest of margins. As Al Pacino said in Any Given Sunday , it is all about the ‘inches’.

Retail 52
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Why CEOs Must Lead a New Relationship with Data

Teradata

CEOs need a new relationship with data if they are to successfully transition to the hyper-personalized, hyper-localized future most recognize as today’s immediate imperative.

Data 52
article thumbnail

Data Makes Your Tools Smarter

Grouparoo

When I was in charge of Product/Engineering at TaskRabbit, it was always challenging to prioritize integrations being requested by our Marketing, Sales, and Customer Success teams. First and foremost, most engineers just hate working on these kinds of integrations. Often, this preference alone is the deciding factor in organizations for what gets prioritized or not.

Data 52
article thumbnail

Coffee With Cloudera Partners: AWS

Cloudera

Enterprises are adopting a hybrid cloud approach. While more Cloudera customers want to move apps and data to the cloud, they also want to continue using their data centers for security and governance. By having both on-premises and cloud environments, organizations increase their agility, and hybrid model is gaining momentum. A hybrid approach benefits many organizations as it allows them to make best use of on-premises infrastructure while taking advantage of additional compute capacity and l

AWS 65
article thumbnail

How do thread priorities affect your Android app?

Booking.com Engineering

Introduction Threads are essential for responsive UI applications. When programming in Android, we make sure that any kind of work that could cause the slightest lagging is scheduled to a separate thread, other than the one responsible for the UI updates. And even though there are various high level constructs available for the developer’s convenience, how threading works at a very low level leaks from all these abstractions nonetheless.

Java 52
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Kafka to Delta Lake, as fast as possible

Scribd Technology

Streaming data from Apache Kafka into Delta Lake is an integral part of Scribd’s data platform, but has been challenging to manage and scale. We use Spark Structured Streaming jobs to read data from Kafka topics and write that data into Delta Lake tables. This approach gets the job done but in production our experience has convinced us that a different approach is necessary to efficiently bring data from Kafka to Delta Lake.

Kafka 52
article thumbnail

Listen Carefully

Teradata

The data-driven, digital-first era has multiplied the complexity of customer conversations – but it has also provided the means to generate and act on real insight.

IT 52
article thumbnail

How XOps Is Hoping to Unite All the Disparate Ops Disciplines Under One Banner

DataKitchen

The post How XOps Is Hoping to Unite All the Disparate Ops Disciplines Under One Banner first appeared on DataKitchen.

52
article thumbnail

Scala Testing with ScalaTest: A Beginner's Guide to Testing Styles

Rock the JVM

In this article, we explore the main testing styles in Scala and ScalaTest: understanding what terms like 'FunSuite' and 'FlatSpec' really mean

Scala 52
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

RudderStack Product News Vol. #006 - Better Data Reporting

RudderStack

Our latest sprint focused on improving data reporting in multiple views of the product and adding several marketing integrations.

Data 40
article thumbnail

Popular Use Cases for Real-Time Analytics

Rockset

In 2008, Dominos Pizza released its pizza tracker so that fans could monitor in real time if their pizza was in the oven or out for delivery. By 2019, 65% of Dominos’ sales came through digital channels including home devices and emoji texts, reimagining the brand for the digital era. The Dominos’ Pizza Tracker is the quintessential example of real-time analytics.

Retail 40
article thumbnail

Blinkist Chooses Monte Carlo to Deliver More Reliable Data Pipelines Through Data Observability

Monte Carlo

Monte Carlo today announced Berlin-based microlearning app Blinkist has selected Monte Carlo to achieve more reliable data through data observability. As a high-growth company with over 16 million users worldwide, Blinkist leverages paid performance marketing to fuel customer acquisition — and those channels rely on accurate behavioral data to optimize campaign spend.

article thumbnail

Connecting Your Data Mesh with DataOps

DataKitchen

The post Connecting Your Data Mesh with DataOps first appeared on DataKitchen.

Data 40
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.