Sat.Jan 25, 2020 - Fri.Jan 31, 2020

article thumbnail

Streaming Machine Learning with Tiered Storage and Without a Data Lake

Confluent

The combination of streaming machine learning (ML) and Confluent Tiered Storage enables you to build one scalable, reliable, but also simple infrastructure for all machine learning tasks using the Apache […].

article thumbnail

Data Privacy and Why it Matters to Our Customers

Teradata

People want control over their personal data, but are also willing to trade it away for convenience. When does the exploitation of our data become unethical? Read more!

IT 115
article thumbnail

Pay Down Technical Debt In Your Data Pipeline With Great Expectations

Data Engineering Podcast

Summary Data pipelines are complicated and business critical pieces of technical infrastructure. Unfortunately they are also complex and difficult to test, leading to a significant amount of technical debt which contributes to slower iteration cycles. In this episode James Campbell describes how he helped create the Great Expectations framework to help you gain control and confidence in your data delivery workflows, the challenges of validating and monitoring the quality and accuracy of your dat

article thumbnail

Case Study: Standard Cognition Uses Rockset to Deliver Data APIs and Real-Time Metrics for Vision AI

Rockset

Walk into a store, grab the items you want, and walk out without having to interact with a cashier or even use a self-checkout system. That’s the no-hassle shopping experience of the future you’ll get at the Standard Store , a demonstration store showcasing the AI-powered checkout pioneered by Standard Cognition. The company makes use of computer vision to remove the need for checkout lines of any sort in physical retail locations.

Retail 40
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Norfolk Southern Corporation

Teradata

Rail transportation analytics optimizes network planning and operations for greater visibility customer demands of rail traffic.