Sat.Jan 19, 2019 - Fri.Jan 25, 2019

article thumbnail

Building Enterprise Big Data Systems At LEGO

Data Engineering Podcast

Summary Building internal expertise around big data in a large organization is a major competitive advantage. However, it can be a difficult process due to compliance needs and the need to scale globally on day one. In this episode Jesper Søgaard and Keld Antonsen share the story of starting and growing the big data group at LEGO. They discuss the challenges of being at global scale from the start, hiring and training talented engineers, prototyping and deploying new systems in the cloud, and wh

Big Data 100
article thumbnail

Big Data Fabric Weaves Together Automation, Scalability, and Intelligence

Cloudera

Today’s data landscape is characterized by exponentially increasing volumes of data, comprising a variety of structured, unstructured, and semi-structured data types originating from an expanding number of disparate data sources located on-premises, in the cloud, and at the edge. In conjunction with the evolving data ecosystem are demands by business for reliable, trustworthy, up-to-date data to enable real-time actionable insights.

article thumbnail

Using Data to Answer the Key Challenge to Enterprise Reinforcement Learning

Teradata

Applying deep reinforcement learning to real world problems has the potential to revolutionize how businesses tackle many of their core business challenges.

Data 45
article thumbnail

Live Dashboards with Redash and Rockset

Rockset

Redash is a powerful open source query and visualization tool that helps you make sense of your data. It connects to variety of data sources and also includes a native connector for Rockset. In this post we will demonstrate how to use Redash to build live dashboards on Rockset data sets. Configure If you've never used Redash before, you need to set it up first.

SQL 40
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

A Day in the Life of a Frontend Engineer at Zalando

Zalando Engineering

You’ve probably never had the same day twice at your current job. At Zalando it’s no different. Here, it not only depends on the product you're currently working on but also on your peers. Actually, what's expected from a frontend engineer can vary according to a company philosophy or your own previous experience: usually a frontend engineer can be seen as a Swiss army knife when in reality at Zalando, for example, we see them as masters of trades.

article thumbnail

Running Fast SQL on DynamoDB Tables

Rockset

Have you ever wanted to run SQL queries on Amazon DynamoDB tables without impacting your production workloads? Wouldn't it be great to do so without needing to set up an ETL job and then having to manually monitor that job? In this blog, I will discuss how Rockset integrates with DynamoDB and continuously updates a collection automatically as new objects are added to a DynamoDB table.

SQL 40

More Trending

article thumbnail

How Painful is it (Really) to Switch Cloud Providers?

Teradata

Ron Luebke discusses the pains of switching cloud providers.

Cloud 40
article thumbnail

Rockset adds Excel spreadsheet support: Use SQL across XLSX files and join with other JSON, CSV or Parquet data

Rockset

An incredible amount of business data is floating around in Excel spreadsheets - so data scientists often need to analyze data across multiple worksheets or even multiple spreadsheets using SQL. Additionally, this data may need to be joined with other data sets that are in JSON, CSV or Parquet formats. Microsoft Excel currently has some basic SQL support in place: Use SQL for connecting to an external database like Access or SQL Server, parsing field or table contents and importing the data.

SQL 40
article thumbnail

Using Data to Answer the Key Challenge to Enterprise Reinforcement Learning

Teradata

Applying deep reinforcement learning to real world problems has the potential to revolutionize how businesses tackle many of their core business challenges.

Data 40