Sat.Dec 29, 2018 - Fri.Jan 04, 2019

article thumbnail

Simplifying Continuous Data Processing Using Stream Native Storage In Pravega with Tom Kaitchuck - Episode 63

Data Engineering Podcast

Summary As more companies and organizations are working to gain a real-time view of their business, they are increasingly turning to stream processing technologies to fullfill that need. However, the storage requirements for continuous, unbounded streams of data are markedly different than that of batch oriented workloads. To address this shortcoming the team at Dell EMC has created the open source Pravega project.

article thumbnail

The New Cloudera

Cloudera

A new year is always an opportunity for change. This year, we’re making a big one. On January 3, we closed the merger of Cloudera and Hortonworks — the two leading companies in the big data space — creating a single new company that is the leader in our category. We are well positioned to deliver even more innovation and success than we have independently over the last decade.

Hadoop 75
article thumbnail

How Data Privacy Can Be Good for Your Business

Teradata

Regulations like GDPR are an opportunity for many organizations, Reiner Kappenberger explains how data privacy can be good for your business.

Data 63
article thumbnail

How OCR Can Help Employees Fight Through Most Mundane Tasks

InData Labs

These days, office employees need an AI hero. Can you imagine the number of hours wasted on handling a paper-based workflow? Isn’t it time to save employees from piles of paper? No one is saying it will be easy to eliminate paper documents promptly. For instance, in the legal sphere where the cost of a. Запись How OCR Can Help Employees Fight Through Most Mundane Tasks впервые появилась InData Labs.

IT 52
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

The Circle and Square, All You Need to Know About Data and Analytics

Teradata

Rob Armstrong uses the simple analogy of shapes to explain the complicated topic of data and analytics.

Data 40