Building, Data Pipeline and Pipeline-centric

Snowflake’s New Python API Empowers Data Engineers to Build Modern Data Pipelines with Ease

Snowflake

APRIL 17, 2024

In today’s data-driven world, developer productivity is essential for organizations to build effective and reliable products, accelerate time to value, and fuel ongoing innovation. For a deeper dive into Snowflake’s Python API and other native Snowflake DevOps features, register for the Snowflake Data Cloud Summit 2024.

Data Pipeline

Data Pipeline Python Data Engineering Data Engineer

Serverless Data Pipelines On DataCoral

Data Engineering Podcast

APRIL 7, 2019

Summary How much time do you spend maintaining your data pipeline? This was a fascinating conversation with someone who has spent his entire career working on simplifying complex data problems. How does the data-centric approach of DataCoral differ from the way that other platforms think about processing information?

Data Pipeline

Data Pipeline Pipeline-centric Database-centric AWS

An IBM Z Data Integration Success Story

Precisely

MARCH 28, 2025

The data generated was as varied as the departments relying on these applications. Some departments used IBM Db2, while others relied on VSAM files or IMS databases creating complex data governance processes and costly data pipeline maintenance.

Data Integration

Data Integration Pipeline-centric Database-centric Kafka

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Introducing CDP Data Engineering: Purpose Built Tooling For Accelerating Data Pipelines

Cloudera

SEPTEMBER 17, 2020

To tackle these challenges, we’re thrilled to announce CDP Data Engineering (DE) , the only cloud-native service purpose-built for enterprise data engineering teams. Native Apache Airflow and robust APIs for orchestrating and automating job scheduling and delivering complex data pipelines anywhere.

Data Pipeline

Data Pipeline Data Engineering Data Engineer Engineering

Improve Data Quality Through Engineering Rigor And Business Engagement With Synq

Data Engineering Podcast

JUNE 30, 2024

He highlights the role of data teams in modern organizations and how Synq is empowering them to achieve this. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Data lakes are notoriously complex.

Pipeline-centric

Pipeline-centric Engineering Data Lake High Quality Data

The Race For Data Quality in a Medallion Architecture

DataKitchen

NOVEMBER 5, 2024

When data reaches the Gold layer, it is highly curated and structured, offering a single version of the truth for decision-makers across the organization. We have also seen a fourth layer, the Platinum layer , in companies’ proposals that extend the Data pipeline to OneLake and Microsoft Fabric.

Architecture

Architecture Raw Data Pipeline-centric Data Ingestion

Data Engineering Weekly #203

Data Engineering Weekly

JANUARY 12, 2025

With Astro, you can build, run, and observe your data pipelines in one place, ensuring your mission critical data is delivered on time. Generative AI demands the processing of vast amounts of diverse, unstructured data (e.g., Generative AI demands the processing of vast amounts of diverse, unstructured data (e.g.,

Pipeline-centric

Pipeline-centric Data Engineering Data Engineer Engineering

Data Engineering Weekly #196

Data Engineering Weekly

NOVEMBER 3, 2024

impactdatasummit.com Thumbtack: What we learned building an ML infrastructure team at Thumbtack Thumbtack shares valuable insights from building its ML infrastructure team. The blog emphasizes the importance of starting with a clear client focus to avoid over-engineering and ensure user-centric development.

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

Snowflake Startup Challenge 2024: Announcing the 10 Semi-Finalists

Snowflake

APRIL 8, 2024

In 2020, Snowflake announced a new global competition to recognize the work of early-stage startups building their apps — and their businesses — on Snowflake, offering up to $250,000 in investment as the top prize. Just as varied was the list of Snowflake tech that early-stage startups are using to drive their innovative entries.

Pipeline-centric

Pipeline-centric Food Healthcare Unstructured Data

Microsoft Fabric vs. Snowflake: Key Differences You Need to Know

Edureka

APRIL 22, 2025

Snowflake is completely managed, but its main focus is on the data warehouse layer, and users need to integrate with other tools for BI, ML, or ETL. Ideal for: Business-centric workflows involving fabric Snowflake = environments with a lot of developers and data engineers 2.

BI

BI Pipeline-centric Data Lake Google Cloud

Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify

Data Engineering Podcast

MAY 7, 2023

Segment created the Unify product to reduce the burden of building a comprehensive view of customers and synchronizing it to all of the systems that need it. In this episode Kevin Niparko and Hanhan Wang share the details of how it is implemented and how you can use it to build and maintain rich customer profiles.

Pipeline-centric

Pipeline-centric Data Lake Machine Learning Data Warehouse

Toward a Data Mesh (part 2) : Architecture & Technologies

François Nguyen

MARCH 22, 2021

TL;DR After setting up and organizing the teams, we are describing 4 topics to make data mesh a reality. How do we build data products ? The next problem will be the diversity of these mini data platforms (because of the configuration) and you even go deeper in problems with managing different technologies or version.

Technology

Technology Architecture Google Cloud Metadata

Data Pipelines in the Healthcare Industry

DareData

JULY 29, 2020

One paper suggests that there is a need for a re-orientation of the healthcare industry to be more "patient-centric". Furthermore, clean and accessible data, along with data driven automations, can assist medical professionals in taking this patient-centric approach by freeing them from some time-consuming processes.

Data Pipeline

Data Pipeline Healthcare Medical Pipeline-centric

Data Pipeline vs. ETL: Which Delivers More Value?

Ascend.io

MAY 31, 2023

In the modern world of data engineering, two concepts often find themselves in a semantic tug-of-war: data pipeline and ETL. Fast forward to the present day, and we now have data pipelines. Data Ingestion Data ingestion is the first step of both ETL and data pipelines.

Data Pipeline

Data Pipeline ETL Tools Pipeline-centric Data Warehouse

5 Takeaways from the Data Pipeline Automation Summit 2023

Ascend.io

APRIL 27, 2023

Going into the Data Pipeline Automation Summit 2023, we were thrilled to connect with our customers and partners and share the innovations we’ve been working on at Ascend. The summit explored the future of data pipeline automation and the endless possibilities it presents.

Data Pipeline

Data Pipeline Pipeline-centric Data Validation Data Engineering

Revolutionizing Build Analytics: How to enhance build processes with ThoughtSpot

ThoughtSpot

OCTOBER 18, 2024

In the fast-paced world of software development, the efficiency of build processes plays a crucial role in maintaining productivity and code quality. At ThoughtSpot , while Gradle has been effective, the growing complexity of our projects demanded a more sophisticated approach to understanding and optimizing our builds.

Building

Building Process Pipeline-centric Database-centric

Bringing Automation To Data Labeling For Machine Learning With Watchful

Data Engineering Podcast

AUGUST 13, 2022

Summary Data engineers have typically left the process of data labeling to data scientists or other roles because of its nature as a manual and process heavy undertaking, focusing instead on building automation and repeatable systems. Data stacks are becoming more and more complex.

Machine Learning

Machine Learning Pipeline-centric Database-centric MongoDB

Data News — Week 24.37

Christophe Blefari

SEPTEMBER 13, 2024

NVidia released Eagle a vision-centric multimodal LLM — Look at the example in the Github repo, given an image and a user input the LLM is able to answer things like "Describe the image in detail" or "Which car in the picture is more aerodynamic" based on a drawing. What's your question?

Pipeline-centric

Pipeline-centric Data Python Data Science

How to Become a Data Engineer in 2024?

Knowledge Hut

DECEMBER 26, 2023

Data Engineering is typically a software engineering role that focuses deeply on data – namely, data workflows, data pipelines, and the ETL (Extract, Transform, Load) process. What is the role of a Data Engineer? Data scientists and data Analysts depend on data engineers to build these data pipelines.

Data Engineering

Data Engineering Data Engineer Engineering Hadoop

Cloudera Customer Story

Cloudera

DECEMBER 13, 2023

To enable LGIM to better utilize its wealth of data, LGIM required a centralized platform that made internal data discovery easy for all teams and could securely integrate external partners and third-party outsourced data pipelines.

Pipeline-centric

Pipeline-centric Professional Services BI Datasets

Every Company is Becoming a Software Company

Confluent

SEPTEMBER 25, 2019

Of course, this is not to imply that companies will become only software (there are still plenty of people in even the most software-centric companies), just that the full scope of the business is captured in an integrated software defined process. Our approach to building this platform is from the bottom up. Confluent’s mission.

Database-centric

Database-centric Kafka Pipeline-centric Retail

How DataOps is Transforming Commercial Pharma Analytics

DataKitchen

AUGUST 27, 2021

DataOps is fundamentally about eliminating errors, reducing cycle time, building trust and increasing agility. The data pipelines must contend with a high level of complexity – over seventy data sources and a variety of cadences, including daily/weekly updates and builds.

Pharmaceutical

Pharmaceutical Pipeline-centric Data Analytics Data Lake

Building a Future in Banking and Capital Markets

The Modern Data Company

DECEMBER 29, 2022

This means moving beyond product-centric thinking to a data-driven customer experience model that’s consistent across all channels. Next, the wealth management industry is also shifting away from a product focus to a client-centric model. DataOS is the world’s first operating system.

Banking

Banking Pipeline-centric Building Finance

Rebuilding Netflix Video Processing Pipeline with Microservices

Netflix Tech

JANUARY 10, 2024

The Netflix video processing pipeline went live with the launch of our streaming service in 2007. By integrating with studio content systems, we enabled the pipeline to leverage rich metadata from the creative side and create more engaging member experiences like interactive storytelling.

Process

Process Pipeline-centric Media Metadata

Data Engineering Weekly #161

Data Engineering Weekly

MARCH 3, 2024

Here is the agenda, 1) Data Application Lifecycle Management - Harish Kumar( Paypal) Hear from the team in PayPal on how they build the data product lifecycle management (DPLM) systems. 4) Building Data Products and why should you? Part 1: Why did we need to build our own SIEM?

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

Data Engineering Weekly #186

Data Engineering Weekly

AUGUST 25, 2024

Take Astro (the fully managed Airflow solution) for a test drive today and unlock a suite of features designed to simplify, optimize, and scale your data pipelines. Try For Free → Conference Alert: Data Engineering for AI/ML This is a virtual conference at the intersection of Data and AI.

Data Engineering

Data Engineering Data Engineer Engineering Database-centric

Centralize Your Data Processes With a DataOps Process Hub

DataKitchen

NOVEMBER 4, 2021

Your IT organization may have a permanent data lake, but data analytics teams need the ability to rapidly create insight from data. The DataKitchen Platform serves as a process hub that builds temporary analytic databases for daily and weekly ad hoc analytics work. Figure 3: Example process hub for biologic launch.

Process

Process Data Process Pharmaceutical Data Lake

What is a Data Engineer?

Dataquest

JANUARY 25, 2017

A data scientist is only as good as the data they have access to. Most companies store their data in variety of formats across databases and text files. This is where data engineers come in — they build pipelines that transform that data into formats that data scientists can use.

Data Engineering

Data Engineering Data Engineer Pipeline-centric Database-centric

Data Entropy?—?More Data, More Problems?

Towards Data Science

MAY 19, 2023

Business users are unable to find and access data assets critical to their workflows. Data engineers spend countless hours troubleshooting broken pipelines. The data team is constantly burning out and has a high employee turnover. Stakeholders fail to see the ROI behind expensive data initiatives.

Pipeline-centric

Pipeline-centric Data Software Engineer Software Engineering

Creating Value With a Data-Centric Culture: Essential Capabilities to Treat Data as a Product

Ascend.io

JUNE 8, 2023

Treating data as a product is more than a concept; it’s a paradigm shift that can significantly elevate the value that business intelligence and data-centric decision-making have on the business. Data pipelines Data integrity Data lineage Data stewardship Data catalog Data product costing Let’s review each one in detail.

Pipeline-centric

Pipeline-centric Database-centric Data Ingestion Data Pipeline

Beyond the Data Complexity: Building Agile, Reusable Data Architectures

The Modern Data Company

JULY 29, 2024

The limited reusability of data assets further exacerbates this agility challenge. Already operating at capacity, data teams often find themselves repeating efforts, rebuilding similar data pipelines and models for each new project. As businesses grow and evolve, their data needs expand exponentially.

Data Architecture

Data Architecture Architecture Building Pipeline-centric

What Is A DataOps Engineer? Skills, Salary, & How to Become One

Monte Carlo

MARCH 28, 2024

In a nutshell, DataOps engineers are responsible not only for designing and building data pipelines, but iterating on them via automation and collaboration as well. So, does this mean you should choose DataOps engineering vs. data engineering when considering your next career move? What does a DataOps engineer do?

Engineering

Engineering Pipeline-centric BI Google Cloud

The Rise of Unstructured Data

Cloudera

NOVEMBER 15, 2021

Data Cascades are said to be pervasive, to lack immediate visibility, but to eventually impact the world in a negative manner. Related to the neglect of data quality, it has been observed that much of the efforts in AI have been model-centric, that is, mostly devoted to developing and improving models , given fixed data sets.

Unstructured Data

Unstructured Data Pipeline-centric Database-centric Entertainment

Wizeline and Ascend.io Join Forces to Unleash AI-Powered Data Automation

Ascend.io

MAY 7, 2024

” Key Partnership Benefits: Cost Optimization and Efficiency : The collaboration is poised to reduce IT and data management costs significantly, including an up to 68% reduction in data stack spend and the ability to build data pipelines 7.5x ABOUT ASCEND.IO Learn more at Ascend.io or follow us @ascend_io.

Pipeline-centric

Pipeline-centric Data Cleanse Data Security Data Pipeline

RAG vs Fine Tuning: How to Choose the Right Method

Monte Carlo

MAY 30, 2024

It can involve prompt engineering, vector databases like Pinecone , embedding vectors and semantic layers, data modeling, data orchestration, and data pipelines – all tailored for RAG. But when it’s done right, RAG can add an incredible amount of value to AI-powered data products. What is Fine Tuning?

Pipeline-centric

Pipeline-centric Database-centric Datasets Data Pipeline

What is Azure Data Factory – Here’s Everything You Need to Know

Edureka

JULY 3, 2024

ADF connects to various data sources, including on-premises systems, cloud services, and SaaS applications. It then gathers and relocates information to a centralized hub in the cloud using the Copy Activity within data pipelines. Transform and Enhance the Data: Once centralized, data undergoes transformation and enrichment.

Pipeline-centric

Pipeline-centric Data Lake Database-centric Data Pipeline

Snowflake Expands Programmability to Bolster Support for AI/ML and Streaming Pipeline Development

Snowflake

JUNE 28, 2023

At Snowflake, we’re helping data scientists, data engineers, and application developers build faster and more efficiently in the Data Cloud. Streamlit gives data scientists and Python developers the ability to quickly turn data and models into interactive, enterprise-ready applications.

Pipeline-centric

Pipeline-centric Programming Language Python Government

Dialing Down The Dollars: Quantify and Control Your Data Costs

Ascend.io

JUNE 21, 2023

Creating business value from the onslaught of data can feel like captaining a high-tech vessel through uncharted waters. Data teams across business areas are cranking out data sets in response to impatient business requests, while simultaneously trying to build disparate DataOps processes from scratch.

Pipeline-centric

Pipeline-centric Data Pipeline Metadata Data

Snowflake Cost Optimization: Understanding Your Spending and Tactics to Keep It in Check

Ascend.io

OCTOBER 20, 2023

The Nuances of Snowflake Costing Snowflake’s pricing strategy is an exemplification of its user-centric approach: pay for what you use. The Predictability of Pipelines In stark contrast to ad-hoc queries, pipelines are where cost optimization efforts can yield significant dividends.

Pipeline-centric

Pipeline-centric IT Data Pipeline Bytes

Data Engineering Weekly #137

Data Engineering Weekly

JULY 2, 2023

Data Engineering Weekly Is Brought to You by RudderStack RudderStack Profiles takes the SaaS guesswork, and SQL grunt work out of building complete customer profiles, so you can quickly ship actionable, enriched data to every downstream team. Now every team can build a customer360 in Snowflake with RudderStack Profiles.

Data Engineering

Data Engineering Data Engineer Engineering Database-centric

97 things every data engineer should know

Grouparoo

OCTOBER 6, 2021

This provided a nice overview of the breadth of topics that are relevant to data engineering including data warehouses/lakes, pipelines, metadata, security, compliance, quality, and working with other teams. Open question: how to seed data in a staging environment? Test system with A/A test. Be adaptable.

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

?Data Engineer vs Machine Learning Engineer: What to Choose?

Knowledge Hut

JUNE 20, 2023

Factors Data Engineer Machine Learning Definition Data engineers create, maintain, and optimize data infrastructure for data. In addition, they are responsible for developing pipelines that turn raw data into formats that data consumers can use easily. Assess the needs and goals of the business.

Machine Learning

Machine Learning Data Engineering Data Engineer Engineering

Azure Data Engineer vs Azure DevOps: Top 8 Differences

Knowledge Hut

NOVEMBER 2, 2023

For those aspiring to build a career within the Azure ecosystem, navigating the choices between Azure Data Engineers and Azure DevOps Engineers can be quite challenging. Azure Data Engineers and Azure DevOps Engineers are two critical components of the Azure ecosystem for different but interconnected reasons.

Data Engineering

Data Engineering Data Engineer Engineering Pipeline-centric

Data Quality Solutions: Build or Buy? 4 Things To Know

Monte Carlo

JULY 15, 2021

As data pipelines become increasingly complex, investing in a data quality solution is becoming an increasingly important priority for modern data teams. But should you build it—or buy it? And for those just getting started? For your Ubers, Airbnbs, and Netflixes of the world, this is no problem.

Building

Building Pipeline-centric Data Engineering Data Engineer

Snowflake’s New Python API Empowers Data Engineers to Build Modern Data Pipelines with Ease

Serverless Data Pipelines On DataCoral

Webinars

Trending Sources

An IBM Z Data Integration Success Story

Webinars

Introducing CDP Data Engineering: Purpose Built Tooling For Accelerating Data Pipelines

Improve Data Quality Through Engineering Rigor And Business Engagement With Synq

The Race For Data Quality in a Medallion Architecture

Data Engineering Weekly #203

Data Engineering Weekly #196

Snowflake Startup Challenge 2024: Announcing the 10 Semi-Finalists

Microsoft Fabric vs. Snowflake: Key Differences You Need to Know

Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify

Toward a Data Mesh (part 2) : Architecture & Technologies

Data Pipelines in the Healthcare Industry

Data Pipeline vs. ETL: Which Delivers More Value?

5 Takeaways from the Data Pipeline Automation Summit 2023

Revolutionizing Build Analytics: How to enhance build processes with ThoughtSpot

Bringing Automation To Data Labeling For Machine Learning With Watchful

Data News — Week 24.37

How to Become a Data Engineer in 2024?

Cloudera Customer Story

Every Company is Becoming a Software Company

How DataOps is Transforming Commercial Pharma Analytics

Building a Future in Banking and Capital Markets

Rebuilding Netflix Video Processing Pipeline with Microservices

Data Engineering Weekly #161

Data Engineering Weekly #186

Centralize Your Data Processes With a DataOps Process Hub

What is a Data Engineer?

Data Entropy?—?More Data, More Problems?

Creating Value With a Data-Centric Culture: Essential Capabilities to Treat Data as a Product

Beyond the Data Complexity: Building Agile, Reusable Data Architectures

What Is A DataOps Engineer? Skills, Salary, & How to Become One

The Rise of Unstructured Data

Wizeline and Ascend.io Join Forces to Unleash AI-Powered Data Automation

RAG vs Fine Tuning: How to Choose the Right Method

What is Azure Data Factory – Here’s Everything You Need to Know

Snowflake Expands Programmability to Bolster Support for AI/ML and Streaming Pipeline Development

Dialing Down The Dollars: Quantify and Control Your Data Costs

Snowflake Cost Optimization: Understanding Your Spending and Tactics to Keep It in Check

Data Engineering Weekly #137

97 things every data engineer should know

?Data Engineer vs Machine Learning Engineer: What to Choose?

Azure Data Engineer vs Azure DevOps: Top 8 Differences

Data Quality Solutions: Build or Buy? 4 Things To Know

Stay Connected