article thumbnail

Inside Agoda’s Private Cloud - Exclusive

The Pragmatic Engineer

In a previous two-part series , we dived into Uber’s multi-year project to move onto the cloud , away from operating its own data centers. But there’s no “one size fits all” strategy when it comes to deciding the right balance between utilizing the cloud and operating your infrastructure on-premises.

Cloud 251
article thumbnail

On-Premise vs Cloud: Where Does the Future of Data Storage Lie?

Monte Carlo

Generative AI and machine learning Data teams are acutely aware of the GenAI wave , and many industry watchers suspect that this emerging technology is driving a huge wave of infrastructure modernization and utilization.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Building Meta’s GenAI Infrastructure

Engineering at Meta

Storage Storage plays an important role in AI training, and yet is one of the least talked-about aspects. As the GenAI training jobs become more multimodal over time, consuming large amounts of image, video, and text data, the need for data storage grows rapidly.

Building 145
article thumbnail

How To Future-Proof Your Data Pipelines

Ascend.io

This elasticity allows data pipelines to scale up or down as needed, optimizing resource utilization and cost efficiency. Ensure the provider supports the infrastructure necessary for your data needs, such as managed databases, storage, and data pipeline services.

article thumbnail

Data Engineering Weekly #206

Data Engineering Weekly

DeepSeek development involves a unique training recipe that generates a large dataset of long chain-of-thought reasoning examples, utilizes an interim high-quality reasoning model, and employs large-scale reinforcement learning (RL). It employs a two-tower model approach to learn query and item embeddings from user engagement data.

article thumbnail

Why Open Table Format Architecture is Essential for Modern Data Systems

phData: Data Engineering

The world we live in today presents larger datasets, more complex data, and diverse needs, all of which call for efficient, scalable data systems. Though basic and easy to use, traditional table storage formats struggle to keep up. Track data files within the table along with their column statistics. Contact phData Today!

article thumbnail

Turbocharging Atlas: How we reduced server initialization time to less than 2 minutes

ThoughtSpot

It stores all the metadata created within a ThoughtSpot instance to enable efficient querying, retrieval, and management of data objects. While Atlas operates as an in-memory graph database for speed and performance, it uses PostgreSQL as its persistent storage layer to ensure durability and long-term data storage.