article thumbnail

CockroachDB In Depth with Peter Mattis - Episode 35

Data Engineering Podcast

Summary With the increased ease of gaining access to servers in data centers across the world has come the need for supporting globally distributed data storage. With the first wave of cloud era databases the ability to replicate information geographically came at the expense of transactions and familiar query languages.

article thumbnail

Reflections On Designing A Data Platform From Scratch

Data Engineering Podcast

If you’re a data engineering podcast listener, you get credits worth $3000 on an annual subscription TimescaleDB, from your friends at Timescale, is the leading open-source relational database with support for time-series data. Time-series data is time stamped so you can measure how a system is changing.

Designing 100
article thumbnail

Data Engineering Weekly #175

Data Engineering Weekly

link] Open AI: Model Spec LLM models are slowly emerging as the intelligent data storage layer. Similar to how data modeling techniques emerged during the burst of relation databases, we started to see similar strategies for fine-tuning and prompt templates. Will they co-exist or fight with each other?

article thumbnail

Hadoop vs Spark: Main Big Data Tools Explained

AltexSoft

Master Nodes control and coordinate two key functions of Hadoop: data storage and parallel processing of data. Worker or Slave Nodes are the majority of nodes used to store data and run computations according to instructions from a master node. Data storage options. Data management and monitoring options.

article thumbnail

Big Data Technologies that Everyone Should Know in 2024

Knowledge Hut

Each of these technologies has its own strengths and weaknesses, but all of them can be used to gain insights from large data sets. As organizations continue to generate more and more data, big data technologies will become increasingly essential. Let's explore the technologies available for big data.

article thumbnail

Graph Databases In Production At Scale Using DGraph with Manish Jain - Episode 44

Data Engineering Podcast

There are a few ways that graph structures and properties can be implemented, including the ability to store data in the vertices connecting nodes and the structures that can be contained within the nodes themselves. How does the query interface and data storage in DGraph differ from other options?

Database 100
article thumbnail

Unpacking Fauna: A Global Scale Cloud Native Database

Data Engineering Podcast

Summary One of the biggest challenges for any business trying to grow and reach customers globally is how to scale their data storage. FaunaDB is a cloud native database built by the engineers behind Twitter’s infrastructure and designed to serve the needs of modern systems.

Database 100