Sat.Aug 18, 2018 - Fri.Aug 24, 2018

article thumbnail

Graph Databases In Production At Scale Using DGraph with Manish Jain - Episode 44

Data Engineering Podcast

Summary The way that you store your data can have a huge impact on the ways that it can be practically used. For a substantial number of use cases, the optimal format for storing and querying that information is as a graph, however databases architected around that use case have historically been difficult to use at scale or for serving fast, distributed queries.

Database 100
article thumbnail

Making slow queries fast with composite indexes in MySQL

nodeSWAT

Making slow queries fast using composite indexes in MySQL This post expects some basic knowledge of SQL. Examples were made using MySQL 5.7.18 and run on my mid 2014 Macbook Pro. Query execution times are based on multiple executions so index caching can kick in. The use-case came from a real application and the solution is used in production. So you have inserted preliminary data to your database and run a simple COUNT(*) query against it with a simple WHERE clause and… the spinner is still run

MySQL 52
article thumbnail

Zalando at the DatSci Awards 2018

Zalando Engineering

Building data science products in multi disciplinary teams For the last three years, I have been working on different data science projects at Zalando, helping our more than 24 million customers find the most relevant items in the assortment we have. Along the way, I have learned how to scale data science , or how to build a new personalization product from scratch.

article thumbnail

Top 5 Reasons to Learn AWS

ProjectPro

“Cloud is now what we call the new normal. It’s no longer an experiment, it’s no longer an after-thought.”- said Vincent Quah, regional head of education, research, healthcare and non-profit organizations of AWS. Why should I learn AWS? Cloud computing is taking the tech world by storm and so is the need to learn cloud computing.

AWS 40
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.