Data Architecture, Data Process and Pipeline-centric

Data Architecture

Data Process

Pipeline-centric

The Race For Data Quality in a Medallion Architecture

DataKitchen

NOVEMBER 5, 2024

The Race For Data Quality In A Medallion Architecture The Medallion architecture pattern is gaining traction among data teams. It is a layered approach to managing and transforming data. It sounds great, but how do you prove the data is correct at each layer? How do you ensure data quality in every layer

Architecture

Architecture Raw Data Pipeline-centric Data Ingestion

Microsoft Fabric vs. Snowflake: Key Differences You Need to Know

Edureka

APRIL 22, 2025

Its multi-cluster shared data architecture is one of its primary features. Additionally, Fabric has deep integrations with Power BI for visualization and Microsoft Purview for governance, resulting in a smooth experience for both business users and data professionals.

BI Pipeline-centric Data Lake Google Cloud

Join 37,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Trending Sources

The Rise of Streaming Data Architectures: What You Need to Know

Precisely

JANUARY 6, 2025

Customers expect immediate responses and personalized interactions, and streaming data architectures help you meet these expectations. Integrated and scalable architectures drive business agility. Your ability to deliver seamless, personalized, and timely experiences is key to success in our modern customer-centric landscape.

Data Architecture

Data Architecture Architecture Pipeline-centric Banking

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

The Top Snowflake Integrations Every Data Team Should Know

Monte Carlo

JULY 28, 2025

The main benefits of a well-integrated Snowflake environment include automation of repetitive tasks, scalability that grows with your data volumes, data + AI observability that catches issues before they impact users, compliance features that satisfy regulators, and data democratization that puts insights in everyone’s hands.

BI Pipeline-centric Data Ingestion Government

How to Crack Amazon Data Engineer Interview in 2025?

ProjectPro

JUNE 6, 2025

This section will cover the most commonly asked questions for an Amazon Data Engineer interview. Candidates should focus on Data Modelling , ETL Processes, Data Warehousing, Big Data Technologies, Programming Skills, AWS services, data processing technologies, and real-world problem-solving scenarios.

Data Engineering

Data Engineering Data Engineer Engineering NoSQL

7 Best Data Warehousing Tools for Efficient Data Storage Needs

ProjectPro

JUNE 6, 2025

Google BigQuery Pricing Google BigQuery pricing is based on a pay-as-you-go model, primarily determined by the amount of data processed during queries and the storage capacity used. Query costs are calculated per terabyte processed, with a free monthly tier available. Moving data in and out can be time-consuming.

Data Storage

Data Storage PostgreSQL Data Warehouse AWS

Centralize Your Data Processes With a DataOps Process Hub

DataKitchen

NOVEMBER 4, 2021

Data organizations often have a mix of centralized and decentralized activity. DataOps concerns itself with the complex flow of data across teams, data centers and organizational boundaries. It expands beyond tools and data architecture and views the data organization from the perspective of its processes and workflows.

Process

Process Data Process Pharmaceutical Data Lake

Building a Scalable Search Architecture

Confluent

JUNE 18, 2019

It involves many moving parts, from data preparation to building indexing and query pipelines. Luckily, this task looks a lot like the way we tackle problems that arise when connecting data. Building an indexing pipeline at scale with Kafka Connect. It is a natural evolution from the initial application-centric setup.

Architecture

Architecture Building Kafka Database-centric

The Good and the Bad of Apache Spark Big Data Processing

AltexSoft

JULY 18, 2023

Its flexibility allows it to operate on single-node machines and large clusters, serving as a multi-language platform for executing data engineering , data science , and machine learning tasks. Before diving into the world of Spark, we suggest you get acquainted with data engineering in general. Big data processing.

Big Data

Big Data Data Process Process Hadoop

How to Become a Data Engineer in 2024?

Knowledge Hut

DECEMBER 26, 2023

Data Engineering is typically a software engineering role that focuses deeply on data – namely, data workflows, data pipelines, and the ETL (Extract, Transform, Load) process. What is the role of a Data Engineer? They are required to have deep knowledge of distributed systems and computer science.

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

Data Engineer Roles And Responsibilities 2022

U-Next

AUGUST 17, 2022

Data Engineers must be proficient in Python to create complicated, scalable algorithms. This language provides a solid basis for big data processing and is effective, flexible, and ideal for text analytics. To create autonomous data streams, Data Engineering teams use AWS. Responsibilities of a Data Engineer.

Data Engineering

Data Engineering Data Engineer Database-centric Pipeline-centric

Data Lineage Tools: Key Capabilities and 5 Notable Solutions

Databand.ai

JULY 19, 2023

This capability is useful for businesses, as it provides a clear and comprehensive view of their data’s history and transformations. Data lineage tools are not a new concept. In this article: Why Are Data Lineage Tools Important? One of the unique features of Atlan is its human-centric design.

Pipeline-centric

Pipeline-centric Data Governance Metadata Government

Azure Synapse vs Databricks: 2023 Comparison Guide

Knowledge Hut

SEPTEMBER 26, 2023

Organisations are constantly looking for robust and effective platforms to manage and derive value from their data in the constantly changing landscape of data analytics and processing. These platforms provide strong capabilities for data processing, storage, and analytics, enabling companies to fully use their data assets.

Data Lake

Data Lake Database-centric Pipeline-centric Machine Learning

How to Become an Azure Data Engineer? 2023 Roadmap

Knowledge Hut

NOVEMBER 17, 2023

The demand for data-related professions, including data engineering, has indeed been on the rise due to the increasing importance of data-driven decision-making in various industries. Becoming an Azure Data Engineer in this data-centric landscape is a promising career choice.

Data Engineering

Data Engineering Data Engineer Engineering Scala

Snowpark Offers Expanded Capabilities Including Fully Managed Containers, Native ML APIs, New Python Versions, External Access, Enhanced DevOps and More

Snowflake

JUNE 28, 2023

Snowpark is our secure deployment and processing of non-SQL code, consisting of two layers: Familiar Client Side Libraries – Snowpark brings deeply integrated, DataFrame-style programming and OSS compatible APIs to the languages data practitioners like to use. Previously, tasks could be executed as quickly as 1-minute.

Python

Python Accessible Accessibility Pipeline-centric

The Ultimate Modern Data Stack Migration Guide

phData: Data Engineering

JULY 18, 2023

Slow Response to New Information: Legacy data systems often lack the computation power necessary to run efficiently and can be cost-inefficient to scale. This typically results in long-running ETL pipelines that cause decisions to be made on stale or old data.

Data Warehouse

Data Warehouse Pipeline-centric Government Data

Azure Synapse vs. Databricks – What Are the Differences?

Edureka

JULY 4, 2024

Databricks runs on an optimized Spark version and gives you the option to select GPU-enabled clusters, making it more suitable for complex data processing. The platform’s massive parallel processing (MPP) architecture empowers you with high-performance querying of even massive datasets.

Data Lake

Data Lake Pipeline-centric Data Warehouse ETL Tools

The Top Data Analytics and Science Influencers and Content Creators on LinkedIn

Databand.ai

DECEMBER 20, 2022

Neelesh regularly shares his advice channels, including as a recent guest on Databand’s MAD Data Podcast , where he spoke about how engineering can deliver better value for data science. On LinkedIn, he posts frequently about data engineering, data architecture, interview preparation, and career advice.

Data Analytics

Data Analytics Google Cloud Data Science Data Mining

Data Engineering Digest

The Race For Data Quality in a Medallion Architecture

Microsoft Fabric vs. Snowflake: Key Differences You Need to Know

Webinars

Trending Sources

The Rise of Streaming Data Architectures: What You Need to Know

Webinars

The Top Snowflake Integrations Every Data Team Should Know

How to Crack Amazon Data Engineer Interview in 2025?

7 Best Data Warehousing Tools for Efficient Data Storage Needs

Centralize Your Data Processes With a DataOps Process Hub

Building a Scalable Search Architecture

The Good and the Bad of Apache Spark Big Data Processing

How to Become a Data Engineer in 2024?

Data Engineer Roles And Responsibilities 2022

Data Lineage Tools: Key Capabilities and 5 Notable Solutions

Azure Synapse vs Databricks: 2023 Comparison Guide

How to Become an Azure Data Engineer? 2023 Roadmap

Snowpark Offers Expanded Capabilities Including Fully Managed Containers, Native ML APIs, New Python Versions, External Access, Enhanced DevOps and More

The Ultimate Modern Data Stack Migration Guide

Azure Synapse vs. Databricks – What Are the Differences?

The Top Data Analytics and Science Influencers and Content Creators on LinkedIn

Stay Connected