Sat.Sep 08, 2018 - Fri.Sep 14, 2018

article thumbnail

Keep Your Data And Query It Too Using Chaos Search with Thomas Hazel and Pete Cheslock - Episode 47

Data Engineering Podcast

Summary Elasticsearch is a powerful tool for storing and analyzing data, but when using it for logs and other time oriented information it can become problematic to keep all of your history. Chaos Search was started to make it easy for you to keep all of your data and make it usable in S3, so that you can have the best of both worlds. In this episode the CTO, Thomas Hazel, and VP of Product, Pete Cheslock, describe how they have built a platform to let you keep all of your history, save money, a

IT 100
article thumbnail

And the winners are…. Congratulations to the Sixth Annual Data Impact Awards winners

Cloudera

It’s a big week for us, as many Clouderans descend on New York for the Strata Data Conference. The week is typically filled with exciting announcements from Cloudera and many partners and others in the data management, machine learning and analytics industry. Last night we kicked it off with the sixth annual Data Impact Awards Celebration. These awards recognize organizations that transform complex data into actionable insights and illustrate impact to technology, science, health, lifestyle, and

article thumbnail

Shop the Look with Deep Learning

Zalando Engineering

Retrieving fashion products based on a query image Have you ever seen a picture on Instagram and thought, “Oh, wow! I want these shoes”? or been inspired by your favourite fashion blogger and looked for similar products (for example, on Zalando)? Visual search for fashion, the task of identifying fashion articles in an image and finding them in an online store, has been the subject of an ever growing body of scientific literature over the last few years (see for example [1-11]).

article thumbnail

Cloudera Data Warehouse – A Partner Perspective

Cloudera

Among the many reasons that a majority of large enterprises have adopted Cloudera Data Warehouse as their modern analytic platform of choice is the incredible ecosystem of partners that have emerged over recent years. In this new blog series, we will take a closer look at some of the most innovative partners, and how the Cloudera platform is helping them deliver groundbreaking solutions to our customers.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Building an Open Data Processing Pipeline for IoT

Cloudera

Authors: David Bericat, Global Technical Lead, Internet of Things, Red Hat and Jonathan Cooper-Ellis, Solutions Architect, Cloudera. Last week Cloudera introduced an open end-to-end architecture for IoT and the different components needed to help satisfy today’s enterprise needs regarding operational technology (OT), information technology (IT), data analytics and machine learning (ML), along with modern and traditional application development, deployment, and integration.

article thumbnail

Boosting enterprise machine learning with automated feature engineering

Cloudera

Machine learning. The very name suggests there’s little involvement required from actual people. It’s a bit surprising to note, then, that perhaps the most limiting factor in data science and machine learning today is people. People add complexity. People add the risk of error. And people add a lot of time. However, we’ll always need people to come up with the overarching prediction problems to solve and to make the ultimate choices to solve them, but there is a lot of data science work now that

article thumbnail

Altus Data Warehouse

Cloudera

We are proud to announce the general availability of Cloudera Altus Data Warehouse , the only cloud data warehousing service that brings the warehouse to the data. Cloudera’s modern data warehouse runs wherever it makes the most sense for your business – on-premises, public cloud, hybrid cloud, or even multi-cloud. Modern data warehousing for the cloud.