Sat.Feb 02, 2019 - Fri.Feb 08, 2019

article thumbnail

Cleaning And Curating Open Data For Archaeology

Data Engineering Podcast

Summary Archaeologists collect and create a variety of data as part of their research and exploration. Open Context is a platform for cleaning, curating, and sharing this data. In this episode Eric Kansa describes how they process, clean, and normalize the data that they host, the challenges that they face with scaling ETL processes which require domain specific knowledge, and how the information contained in connections that they expose is being used for interesting projects.

article thumbnail

Machine Learning with Python, Jupyter, KSQL and TensorFlow

Confluent

Building a scalable, reliable and performant machine learning (ML) infrastructure is not easy. It takes much more effort than just building an analytic model with Python and your favorite machine learning framework. After all, machine learning with Python requires the use of algorithms that allow computer programs to constantly learn, but building that infrastructure is several levels higher in complexity.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Protecting a Story’s Future with History and Science

Netflix Tech

By Kylee Peña, Chris Clark, and Mike Whipple Kylee’s parents after their wedding in 1978. I?—?Kylee?—?have two photos from my parents’ wedding. Just two. This year they celebrated 40 years of marriage, so both photos were shot on film. Both capture a joy and awkwardness that come with young weddings. They’re fresh and full of life, candid captures from another era.

article thumbnail

Introducing Cloudera DataFlow (CDF)

Cloudera

Late last year, the news of the merger between Hortonworks and Cloudera shook the industry and gave birth to the new Cloudera – the combined company with a focus on being an Enterprise Data Cloud leader and a product offering that spans from edge to AI. One of the most promising technology areas in this merger that already had a high growth potential and is poised for even more growth is the Data-in-Motion platform called Hortonworks DataFlow (HDF).

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Aggregator Leaf Tailer: An Alternative to Lambda Architecture for Real-Time Analytics

Rockset

Aggregator Leaf Tailer (ALT) is the data architecture favored by web-scale companies, like Facebook, LinkedIn, and Google, for its efficiency and scalability. In this blog post, I will describe the Aggregator Leaf Tailer architecture and its advantages for low-latency data processing and analytics. When we started Rockset , we set out to implement a real-time analytics engine that made the developer's job as simple as possible.

article thumbnail

Open Source: January Updates - Celebrate 'I Love Free Software Day

Zalando Engineering

Project Highlights Lionel Montrieux brought Nakadi to FOSDEM 2019. This is one of the largest open source projects released by Zalando. Nakadi is a distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues. It is used in production by over a hundred teams daily and handles over 100 TB of data every day. Try out Nakadi !

More Trending

article thumbnail

How ATB Financial is Utilizing Hybrid Cloud to Reduce the Time to Value for Big Data Analytics by 90 Percent

Cloudera

ATB Financial is Alberta’s largest home grown financial institution, and prides itself on its customer obsession, putting the over 750,000 Albertans at the centre of all that they do. As a result, ATB is constantly transforming in order to ensure it can continue to deliver unparalleled value to Albertans. A key pillar in the transformation journey is focused on robust data operations that can help ATB deliver timely, relevant and delightful service.

article thumbnail

The First Mistake of a CDO: Proposing Business Value

Teradata

Kevin Lewis explains the role of chief of data officer.

Data 56
article thumbnail

On the Effectiveness of Online Marketing

Zalando Engineering

Measuring the incremental effect of online marketing to optimize advertising investment One of the core values at Zalando is to be Customer Obsessed , and this applies to online marketing as well. For many Zalando customers, their experience starts with a catchy ad. Therefore, in Personalized Marketing , our mission is to reach customers with a personalized message and suggest products tailored to their needs or wants.

Scala 40
article thumbnail

Defining a company policy to handle harassment in open source

Zalando Engineering

Open Source Participation When you as a Zalando employee engage in open source communities as part of your work, you will interact with the wider open source communities outside Zalando - this is generally a good experience and collaborating with many different types of developers with different backgrounds is generally a positive input to your personal development.

Coding 40
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Simpler Is Better. Until It Isn’t.

Teradata

Teradata was recently named a leader in the Gartner Magic Quadrant for Data Management Solutions for Analytics.

IT 40