Sat.Aug 08, 2020 - Fri.Aug 14, 2020

article thumbnail

Teradata Vantage: Born for Cloud Before Cloud Was Born

Teradata

Teradata Workload Management enables Vantage to be fully optimized for cloud & hybrid deployments & to efficiently deliver the lowest cost for enterprise analytics.

Cloud 124
article thumbnail

Confluent Announces Offer for Nonprofits Providing COVID-19 Relief

Confluent

In March, I wrote about Confluent’s commitment to our customers, employees, and community during the COVID-19 pandemic. In some respects, it’s hard to believe that only a few months have […].

105
105
article thumbnail

Closing The Loop On Event Data Collection With Iteratively

Data Engineering Podcast

Summary Event based data is a rich source of information for analytics, unless none of the event structures are consistent. The team at Iteratively are building a platform to manage the end to end flow of collaboration around what events are needed, how to structure the attributes, and how they are captured. In this episode founders Patrick Thompson and Ondrej Hrebicek discuss the problems that they have experienced as a result of inconsistent event schemas, how the Iteratively platform integrat

article thumbnail

Improving our video encodes for legacy devices

Netflix Tech

by Mariana Afonso , Anush Moorthy , Liwei Guo , Lishan Zhu , Anne Aaron Netflix has been one of the pioneers of streaming video-on-demand content?—?we announced our intention to stream video over 13 years ago, in January 2007?—?and have only increased both our device and content reach since then. Given the global nature of the service and Netflix’s commitment to creating a service that members enjoy, it is not surprising that we support a wide variety of streaming devices, from set-top-boxes and

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Apache Ozone Fault Injection Framework

Cloudera

One of the key challenges of building an enterprise-class robust scalable storage system is to validate the system under duress and failing system components. This includes, but is not limited to: failed networks, failed or failing disks, arbitrary delays in the network or IO path, network partitions, and unresponsive systems. Apache Ozone fault injection framework is designed to validate Ozone under heavy stress and failed or failing system components.

Hadoop 96
article thumbnail

Multi-Threaded Message Consumption with the Apache Kafka Consumer

Confluent

Multithreading is “the ability of a central processing unit (CPU) (or a single core in a multi-core processor) to provide multiple threads of execution concurrently, supported by the operating system.” […].

Kafka 105

More Trending

article thumbnail

Computational Causal Inference at Netflix

Netflix Tech

Jeffrey Wong , Colin McFarland Every Netflix data scientist, whether their background is from biology, psychology, physics, economics, math, statistics, or biostatistics, has made meaningful contributions to the way Netflix analyzes causal effects. Scientists from these fields have made many advancements in causal effects research in the past few decades, spanning instrumental variables, forest methods, heterogeneous effects, time-dynamic effects, quantile effects, and much more.

article thumbnail

Analytics-on-the-fly: from batch to real-time user engagement

Rockset

It was the winter of 2007 when I logged into my newly created Facebook account for the very first time and I was amazed to see Facebook immediately show me three of my friends with whom I had lost touch since elementary school. One of them was working in London in a multinational bank, the other one was an engineer at Google in their Silicon Valley office office and the third one was running a restaurant in my town of Guwahati, a sleepy town on the India-Myanmar border.

Hadoop 52
article thumbnail

Streaming Heterogeneous Databases with Kafka Connect – The Easy Way

Confluent

Building a Cloud ETL Pipeline on Confluent Cloud shows you how to build and deploy a data pipeline entirely in the cloud. However, not all databases can be in the […].

Database 103
article thumbnail

Answers in the Cloud, No Matter Where Your Data Is

Teradata

Vantage on Azure provides enterprise-grade real-time business intelligence through a comprehensive solution that combines analytics, data lakes, & data warehouse technologies.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Telltale: Netflix Application Monitoring Simplified

Netflix Tech

By Andrei U., Seth Katz , Janak Ramachandran , Jeff Butsch , Peter Lau , Ram Vaithilingam , and Greg Burrell Our Telltale Vision An alert fires and you get paged in the middle of the night. A metric crossed a threshold. You’re half awake and wondering, “Is there really a problem or is this just an alert that needs tuning? When was the last time somebody adjusted our alert thresholds?

article thumbnail

Type-Level Programming in Scala: Part 1 - Numbers and Comparisons

Rock the JVM

Harness the full power of Scala's type system: let the compiler infer complex type relationships for you at compile time

Scala 52
article thumbnail

The Curious Incident of the State Store in Recovery in ksqlDB

Confluent

When operating cloud infrastructure, “time is money” is more than a cliché—it is interpreted literally as every processing second stacks up on the monthly bill. ksqlDB strives to reduce these […].

Cloud 85
article thumbnail

Accelerating Innovation in the Analytic Ecosystem: Flexibility

Teradata

In part 1 of this 3 part series on reducing conflict between business & IT to accelerate innovation, we focus on enabling flexibility for tools, languages & libraries.

IT 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Integration: Apache Kafka & Nifi

RandomTrees

By Anshul Ghogre Introduction Apache NiFiis designed to automate the flow of data between software systems. It is based on the “NiagaraFiles” software previously developed by the NSA, it supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic. Apache Kafka is used for building real-time data pipelines and streaming apps.

Kafka 52
article thumbnail

Type-Level Programming in Scala: Part 1 - Numbers and Comparisons

Rock the JVM

Harness the full power of Scala's type system: let the compiler infer complex type relationships for you at compile time

Scala 52
article thumbnail

Superset 0.37, Viz Plugins, Row-Level Security, Better Code Quality

Preset

Summary of Superset 0.

Coding 40
article thumbnail

The Future is Serverless: What About Your Data Stack?

Rockset

Originally published on July 8, 2020 Yesterday I read an analyst report that the serverless architecture market will be $21B by 2025. I also recently met with Alex DeBrie, author of the DynamoDB book and enjoyed learning about his serverless philosophy. He wrote a great post about the key factors for choosing serverless databases here , and we had a fascinating conversation about serverless indexing systems that complement them.

BI 40
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Power BI Template App for Stripe

FreshBI

So, what is a Power BI Template App? A Power BI Template App is a published Power BI solution that can be used by any company that has the data platform for which the Template App was created. Can you imagine picking your entire Power BI Solution off the shelf - one crafted for your specific business needs and your specific data structure. Power BI Template Apps are designed to be such an out-of-the-box solution and this blog post is an example of such for a Power BI Solution for Stripe.

BI 52
article thumbnail

How Nielsen Scaled Access To Data Analytics Using Apache Superset

Preset

Learn why Nielsen migrated to Superset for visualization and dashboards.

article thumbnail

Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data

Rockset

From business communications and financial transactions to trip planning and activity tracking, much of our lives run through smartphones today. eGoGames will help you add competitive esports to that list. As the first European esports platform for mobile devices, eGoGames offers head-to-head, league, and tournament competition for skill-based mobile games.

BI 40
article thumbnail

Rapid Experimentation and Growth Using Real-Time Analytics

Rockset

You may hear the phrase that the world is moving from batch to real-time a lot. While traditional “business intelligence” has come a long way in the past 20 years, the world of real-time analytics is still in its early days. Traditional BI had its Renaissance moments with the advent of Big Data technologies such as Hadoop, and then cloud data lakes and warehouses have brought everyone to the Modern era.

BI 40
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.