article thumbnail

Learning System Design: Top 5 Essential Reads

KDnuggets

Explore system design with these expert-recommended books.

Designing 150
article thumbnail

Data Engineering Interview Series #2: System Design

Start Data Engineering

Pipeline design] Design data pipelines to populate your data models 2.5. [Requirements gathering] Make sure you clearly understand the requirements & business use case 2.2. Understand source data] Know what you have to work with 2.3. Model your data] Define data models for historical analytics 2.4.

Designing 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Designing Data Transfer Systems That Scale

Data Engineering Podcast

Data transfer systems are a critical component of data enablement, and building them to support large volumes of information is a complex endeavor. With Datafold, you can seamlessly plan, translate, and validate data across systems, massively accelerating your migration project. When is DoubleCloud Data Transfer the wrong choice?

Systems 130
article thumbnail

Establishing a Large Scale Learned Retrieval System at Pinterest

Pinterest Engineering

Modern large-scale recommendation systems usually include multiple stages where retrieval aims at retrieving candidates from billions of candidate pools, and ranking predicts which item a user tends to engage from the trimmed candidate set retrieved from early stages [2]. General multi-stage recommendation system design in Pinterest.

Systems 67
article thumbnail

LLMs in Production: Tooling, Process, and Team Structure

Speaker: Dr. Greg Loughnane and Chris Alexiuk

However, during development – and even more so once deployed to production – best practices for operating and improving generative AI applications are less understood.

article thumbnail

8 Essential Data Pipeline Design Patterns You Should Know

Monte Carlo

That’s where data pipeline design patterns come in. So, why does choosing the right data pipeline design matter? In this guide, we’ll explore the patterns that can help you design data pipelines that actually work. Table of Contents Common Data Pipeline Design Patterns Explained 1. Batch Processing Pattern 2.

article thumbnail

Designing A Non-Relational Database Engine

Data Engineering Podcast

In this episode Oren Eini, CEO and creator of RavenDB, explores the nuances of relational vs. non-relational engines, and the strategies for designing a non-relational database. When designing and building a database, what are the initial set of questions that need to be answered? Can you describe what constitutes a NoSQL database?

article thumbnail

Leading the Development of Profitable and Sustainable Products

Speaker: Jason Tanner

A sustainable business model contains a system of interrelated choices made not once but over time. Discover how to design and evolve profit streams over time, focusing on solution sustainability, economic sustainability, and relationship sustainability.