Data and Relational Database - Data Engineering Digest

How to Normalize Relational Databases With SQL Code?

Analytics Vidhya

FEBRUARY 27, 2023

Introduction Data is the new oil in this century. The database is the major element of a data science project. To generate actionable insights, the database must be centralized and organized efficiently. So, we are […] The post How to Normalize Relational Databases With SQL Code?

Relational Database

Relational Database Database SQL Coding

Designing A Non-Relational Database Engine

Data Engineering Podcast

APRIL 14, 2024

In this episode Oren Eini, CEO and creator of RavenDB, explores the nuances of relational vs. non-relational engines, and the strategies for designing a non-relational database. Datafold has recently launched data replication testing, providing ongoing validation for source-to-target replication.

Non-relational Database

Non-relational Database Relational Database Database Designing

Data Integrity for AI: What’s Old is New Again

Precisely

JANUARY 9, 2025

Does the LLM capture all the relevant data and context required for it to deliver useful insights? Not to mention the crazy stories about Gen AI making up answers without the data to back it up!) Are we allowed to use all the data, or are there copyright or privacy concerns? But simply moving the data wasnt enough.

Data Integration

Data Integration Hadoop Data Warehouse Data Lake

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

How Apache Iceberg Is Changing the Face of Data Lakes

Snowflake

APRIL 2, 2025

Data storage has been evolving, from databases to data warehouses and expansive data lakes, with each architecture responding to different business and data needs. Traditional databases excelled at structured data and transactional workloads but struggled with performance at scale as data volumes grew.

Data Lake

Data Lake Cloud Storage Metadata Data Warehouse

Understanding the Basics of Database Normalization

Analytics Vidhya

MARCH 2, 2023

Introduction Data normalization is the process of building a database according to what is known as a canonical form, where the final product is a relational database with no data redundancy. More specifically, normalization involves organizing data according to attributes assigned as part of a larger data model.

Database

Database Relational Database Building Process

Step-by-Step Roadmap to Learn SQL in 2023

Analytics Vidhya

FEBRUARY 28, 2023

Introduction Structured Query Language is a powerful language to manage and manipulate data stored in databases. SQL is widely used in the field of data science and is considered an essential skill to have if you work with data.

SQL

SQL Relational Database Data Science Database

Introduction to Databases in Data Science

KDnuggets

SEPTEMBER 8, 2023

Understand the relevance of databases in data science. Also learn the fundamentals of relational databases, NoSQL database categories, and more.

Database

Database Data Science NoSQL Relational Database

Simplifying Data Architecture and Security to Accelerate Value

Snowflake

NOVEMBER 11, 2024

It’s easy these days for an organization’s data infrastructure to begin looking like a maze, with an accumulation of point solutions here and there. Snowflake is committed to doing just that by continually adding features to help our customers simplify how they architect their data infrastructure. Here’s a closer look.

Data Architecture

Data Architecture Architecture Data Lake Kafka

Simplify Delta Lake Complexity with mack.

Confessions of a Data Guy

JANUARY 12, 2023

Anyone who’s been roaming around the forest of Data Engineering has probably run into many of the newish tools that have been growing rapidly around the concepts of Data Warehouses, Data Lakes, and Lake Houses … the merging of the old relational database functionality with TB and PB level cloud-based file storage systems.

Data Lake

Data Lake Relational Database Data Warehouse Data Engineering

How To Migrate Data From Postgres To Iceberg?

Hevo

MARCH 9, 2025

Relational databases like Postgres have been the backbone of enterprise data management for years. However, as data volumes grow and the need for flexibility, scalability, and advanced analytics increases, modern solutions like Apache Iceberg are becoming essential.

Relational Database

Relational Database Architecture Database Data

Skills you should have as a Data Engineer

Team Data Science

JANUARY 8, 2021

Big Data has become the dominant innovation in all high-performing companies. Notable businesses today focus their decision-making capabilities on knowledge gained from the study of big data. Big Data gives you an advantage in competition as true for businesses as it is for professionals working in the area of analytics.

Data Engineering

Data Engineering Data Engineer Engineering Unstructured Data

Change Data Capture (CDC): What it is and How it Works

Striim

MARCH 21, 2025

Business transactions captured in relational databases are critical to understanding the state of business operations. Since the value of data quickly drops over time, organizations need a way to analyze data as it is generated. What is Change Data Capture?

IT

IT Data Lake Data Warehouse Relational Database

A Prequel to Data Mesh

Towards Data Science

JANUARY 16, 2024

My personal take on justifying the existence of Data Mesh A senior stakeholder at one my projects mentioned that they wanted to decentralise their data platform architecture and democratise data across the organisation. When I heard the words ‘decentralised data architecture’, I was left utterly confused at first!

Data Warehouse

Data Warehouse Data Architecture Relational Database NoSQL

Why Open Table Format Architecture is Essential for Modern Data Systems

phData: Data Engineering

NOVEMBER 8, 2024

The world we live in today presents larger datasets, more complex data, and diverse needs, all of which call for efficient, scalable data systems. Open Table Format (OTF) architecture now provides a solution for efficient data storage, management, and processing while ensuring compatibility across different platforms.

Architecture

Architecture Systems Data Lake Google Cloud

Why Data Capabilities Follow Up a Digital Transformation

Team Data Science

FEBRUARY 23, 2021

Companies can now make data useful to elevate decision making and to optimise products and processes. It's currently easy to acquire data strategically. If you ever have to explain to friends or colleagues why data capabilities are crucial to navigating the future of work and innovation, try this storytelling tactic. .

Business Intelligence

Business Intelligence Food Unstructured Data Relational Database

Data Engineering Weekly #175

Data Engineering Weekly

JUNE 10, 2024

Experience Enterprise-Grade Apache Airflow Astro augments Airflow with enterprise-grade features to enhance productivity, meet scalability and availability demands across your data pipelines, and more. Databricks and Snowflake offer a data warehouse on top of cloud providers like AWS, Google Cloud, and Azure.

Data Engineering

Data Engineering Data Engineer Engineering Kafka

How to Load Data from Oracle to Iceberg

Hevo

MARCH 9, 2025

Relational databases like Oracle have been the backbone of enterprise data management for years. However, as data volumes grow and the need for flexibility, scalability, and advanced analytics increases, modern solutions like Apache Iceberg are becoming essential.

Relational Database

Relational Database Architecture Database Data

Readings in Streaming Database Systems

Confluent

NOVEMBER 2, 2021

What will the next important category of databases look like? For decades, relational databases were the undisputed home of data. They powered everything: from websites to analytics, from customer data […].

Database

Database Systems Relational Database Data

AWS RDS PostgreSQL Setup

Start Data Engineering

JULY 18, 2020

RDS AWS RDS is a managed service provided by AWS to run a relational database. Go to Services -> RDS Click on Create Database, In the Create Database prompt, choose Standard Create option with PostgreSQL as engine type. We will see how to setup a postgres instance using AWS RDS. Log in to your AWS account.

PostgreSQL

PostgreSQL AWS Relational Database Database

Data Integrity vs. Data Quality: How Are They Different?

Precisely

JULY 12, 2024

Data can be your organization’s most valuable asset, but only if it’s data you can trust. When companies work with data that is untrustworthy for any reason, it can result in incorrect insights, skewed analysis, and reckless recommendations to become data integrity vs data quality.

Data Integration

Data Integration Datasets Data Data Governance

Reflections On Designing A Data Platform From Scratch

Data Engineering Podcast

FEBRUARY 27, 2022

Summary Building a data platform is a complex journey that requires a significant amount of planning to do well. In this episode Tobias Macey, the host of the show, reflects on his plans for building a data platform and what he has learned from running the podcast that is influencing his choices. That’s Timescale.

Designing

Designing Metadata Data Lake Relational Database

Combining CDC Transactional Messages Using Kafka Streams

Confluent

FEBRUARY 23, 2023

How to use Kafka Streams to aggregate change data capture (CDC) messages from a relational database into transactional messages, powering a scalable microservices architecture.

Kafka

Kafka Relational Database Architecture Database

SnowflakeDB: The Data Warehouse Built For The Cloud

Data Engineering Podcast

DECEMBER 8, 2019

Summary Data warehouses have gone through many transformations, from standard relational databases on powerful hardware, to column oriented storage engines, to the current generation of cloud-native analytical engines. How does it compare to the other available platforms for data warehousing?

Data Warehouse

Data Warehouse Cloud AWS Relational Database

Building Transactional Systems Using Apache Kafka

Confluent

AUGUST 20, 2019

Traditional relational database systems are ubiquitous in software systems. They are surrounded by a strong ecosystem of tools, such as object-relational mappers and schema migration helpers. Today’s businesses, however, want to process ever-increasing amounts of data. All of these are enforced by relational databases.

Kafka

Kafka Systems Building Relational Database

PostgreSQL Query Optimization: 10 Best Tricks & Techniques (Explained with Code)

Hevo

DECEMBER 20, 2024

PostgreSQL is one of the most popular open-source choices for relational databases. It is loved by engineers for its powerful features, flexibility, efficient data retrieval mechanism, and on top of all its overall performance. There are several […]

PostgreSQL

PostgreSQL Coding Relational Database Database

Best Morgan Stanley Data Engineer Interview Questions

U-Next

MARCH 1, 2023

Introduction Data Engineer is responsible for managing the flow of data to be used to make better business decisions. A solid understanding of relational databases and SQL language is a must-have skill, as an ability to manipulate large amounts of data effectively. What is a data warehouse?

Data Engineering

Data Engineering Data Engineer Non-relational Database Engineering

Hadoop vs Spark: Main Big Data Tools Explained

AltexSoft

JUNE 7, 2021

Hadoop and Spark are the two most popular platforms for Big Data processing. They both enable you to deal with huge collections of data no matter its format — from Excel tables to user feedback on websites to images and video files. Which Big Data tasks does Spark solve most effectively? How does it work? cost-effectiveness.

Big Data Tools

Big Data Tools Hadoop Big Data Database-centric

Ingest Data Faster, Easier and Cost-Effectively with New Connectors and Product Updates

Snowflake

JUNE 13, 2024

The journey toward achieving a robust data platform that secures all your data in one place can seem like a daunting one. But at Snowflake, we’re committed to making the first step the easiest — with seamless, cost-effective data ingestion to help bring your workloads into the AI Data Cloud with ease.

Data Ingestion

Data Ingestion MySQL PostgreSQL Data Pipeline

Index Your Big Data With Pilosa For Faster Analytics

Data Engineering Podcast

APRIL 15, 2019

Summary Database indexes are critical to ensure fast lookups of your data, but they are inherently tied to the database engine. Pilosa is rewriting that equation by providing a flexible, scalable, performant engine for building an index of your data to enable high-speed aggregate analysis.

Big Data

Big Data Relational Database Database Media

Taking A Multidimensional Approach To Data Observability At Acceldata

Data Engineering Podcast

MARCH 13, 2022

Summary Data observability is a term that has been co-opted by numerous vendors with varying ideas of what it should mean. With simple pricing, fast networking, object storage, and worldwide data centers, you’ve got everything you need to run a bulletproof data platform.

Data Lake

Data Lake Relational Database Data Engineering Data Engineer

A Practical Introduction To Graph Data Applications

Data Engineering Podcast

AUGUST 3, 2020

Summary Finding connections between data and the entities that they represent is a complex problem. Graph data models and the applications built on top of them are perfect for representing relationships and finding emergent structures in your information. If you hand a book to a new data engineer, what wisdom would you add to it?

NoSQL

NoSQL Database Algorithm Relational Database

Toward a Data Mesh (part 2) : Architecture & Technologies

François Nguyen

MARCH 22, 2021

TL;DR After setting up and organizing the teams, we are describing 4 topics to make data mesh a reality. TL;DR After setting up and organizing the teams, we are describing 4 topics to make data mesh a reality. How do we build data products ? How can we interoperate between the data domains ?

Technology

Technology Architecture Google Cloud Metadata

PostgreSQL Import CSV: 3 Easy Methods

Hevo

MAY 9, 2023

As a business grows, the demand to efficiently handle and process the exponentially growing data also rises. A popular open-source relational database used by several organizations across the world is PostgreSQL.

PostgreSQL

PostgreSQL Relational Database Database Data Integration

A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore

Data Engineering Podcast

MAY 29, 2022

Summary A large fraction of data engineering work involves moving data from one storage location to another in order to support different access and query patterns. Singlestore aims to cut down on the number of database engines that you need to run so that you can reduce the amount of copying that is required.

Database

Database Architecture Data Architecture PostgreSQL

“The Power of SQL in Driving Business Success”

U-Next

MARCH 8, 2023

For more than 40 years, relational databases have been managed and modified using the programming language SQL (Structured Query Language). Given that it lets organizations efficiently store, retrieve, and analyze massive volumes of data, it has become an essential tool in their daily operations.

SQL

SQL Relational Database Database Data Analysis

Is SQL needed to be a data scientist?

KDnuggets

JULY 25, 2019

As long as there is ‘data’ in data scientist, Structured Query Language (or see-quel as we call it) will remain an important part of it. In this blog, let us explore data science and its relationship with SQL.

SQL

SQL Data Science Data Relational Database

Mastering Data Science in 2024 [A Beginner's Guide]

Knowledge Hut

DECEMBER 26, 2023

Data science is a field of study that works with large amounts of facts and uses splitting tools and methods to uncover hidden patterns, extract useful data, and make business choices. Data scientists use complex machine learning techniques to develop prediction models. Why Should You Learn Data Science?

Data Science

Data Science Programming Language Deep Learning Machine Learning

Best TCS Data Analyst Interview Questions and Answers for 2023

U-Next

MARCH 7, 2023

ntroduction Data Analytics is an extremely important field in today’s business world, and it will only become more so as time goes on. By 2023, Data Analytics is projected to be worth USD 240.56 The Data Analyst interview questions are very competitive and difficult. How to track changes in databases?

Data Mining

Data Mining Scala Government Data Governance

Getting Started with Cloudera Data Platform Operational Database (COD)

Cloudera

NOVEMBER 23, 2021

What is Cloudera Operational Database (COD)? Operational Database is a relational and non-relational database built on Apache HBase and is designed to support OLTP applications, which use big data. The operational database in Cloudera Data Platform has the following components: .

Database

Database Non-relational Database NoSQL Government

IMPACT 2024 Keynote Recap: Product Vision, Announcements, And More

Monte Carlo

NOVEMBER 14, 2024

Lior drove straight into the three reasons why data downtime happens: The three root causes of data downtime. Lior drove straight into the three reasons why data downtime happens: The three root causes of data downtime. For us, data quality monitoring is just the first step, something we all must move beyond.

Relational Database

Relational Database SQL Metadata Data Validation

Startup Spotlight: Hum Applies AI and LLMs to Help Publishers ‘Own’ Their Audiences

Snowflake

NOVEMBER 27, 2023

Hum is harnessing frontier AI to transform content and audience data into actionable insights and personalized experiences. What problem does Hum aim to solve, and how are you using data to address the issue? To do that, they need rich data and powerful AI. Hum’s fast data store is built on Elasticsearch.

Raw Data

Raw Data Relational Database Consulting Architecture

Top 10 Data Science Websites to learn More

Knowledge Hut

FEBRUARY 29, 2024

Being a data scientist means constantly growing, enabling businesses to become more data-propelled, and learning newer trends and tools. There are various excellent resources in data science that can help you to develop your skillset. So, having the right knowledge of tools and technology is important for handling such data.

Data Science

Data Science Datasets Machine Learning Database Design

How to Normalize Relational Databases With SQL Code?

Designing A Non-Relational Database Engine

Webinars

Trending Sources

Data Integrity for AI: What’s Old is New Again

Webinars

How Apache Iceberg Is Changing the Face of Data Lakes

Understanding the Basics of Database Normalization

Step-by-Step Roadmap to Learn SQL in 2023

Top 8 Interview Questions on Apache Sqoop

Introduction to Databases in Data Science

Simplifying Data Architecture and Security to Accelerate Value

Top 5 SQL Interview Questions

Simplify Delta Lake Complexity with mack.

How To Migrate Data From Postgres To Iceberg?

Skills you should have as a Data Engineer

Change Data Capture (CDC): What it is and How it Works

A Prequel to Data Mesh

Why Open Table Format Architecture is Essential for Modern Data Systems

Why Data Capabilities Follow Up a Digital Transformation

Data Engineering Weekly #175

How to Load Data from Oracle to Iceberg

Readings in Streaming Database Systems

AWS RDS PostgreSQL Setup

Data Integrity vs. Data Quality: How Are They Different?

Reflections On Designing A Data Platform From Scratch

Combining CDC Transactional Messages Using Kafka Streams

SnowflakeDB: The Data Warehouse Built For The Cloud

Building Transactional Systems Using Apache Kafka

PostgreSQL Query Optimization: 10 Best Tricks & Techniques (Explained with Code)

Best Morgan Stanley Data Engineer Interview Questions

Hadoop vs Spark: Main Big Data Tools Explained

Ingest Data Faster, Easier and Cost-Effectively with New Connectors and Product Updates

Index Your Big Data With Pilosa For Faster Analytics

Taking A Multidimensional Approach To Data Observability At Acceldata

A Practical Introduction To Graph Data Applications

Toward a Data Mesh (part 2) : Architecture & Technologies

PostgreSQL Import CSV: 3 Easy Methods

A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore

“The Power of SQL in Driving Business Success”

Is SQL needed to be a data scientist?

Mastering Data Science in 2024 [A Beginner's Guide]

Best TCS Data Analyst Interview Questions and Answers for 2023

Getting Started with Cloudera Data Platform Operational Database (COD)

IMPACT 2024 Keynote Recap: Product Vision, Announcements, And More

Startup Spotlight: Hum Applies AI and LLMs to Help Publishers ‘Own’ Their Audiences

Top 10 Data Science Websites to learn More

Stay Connected