Database and SQL - Data Engineering Digest

How to Normalize Relational Databases With SQL Code?

Analytics Vidhya

FEBRUARY 27, 2023

The database is the major element of a data science project. To generate actionable insights, the database must be centralized and organized efficiently. If a corrupted, unorganized, or redundant database is used, the results of the analysis may become inconsistent and highly misleading. appeared first on Analytics Vidhya.

Relational Database

Relational Database Database SQL Coding

SQL Injection: The Cyber Attack Hiding in Your Database

Analytics Vidhya

FEBRUARY 2, 2023

Introduction SQL injection is an attack in which a malicious user can insert arbitrary SQL code into a web application’s query, allowing them to gain unauthorized access to a database. We can use this to steal sensitive information or make unauthorized changes to the data stored in the database.

Database

Database SQL Coding Accessible

MSSQL vs MySQL: Comparing Powerhouses of Databases

Analytics Vidhya

AUGUST 30, 2023

Introduction In the bustling arena of database management systems, two heavyweight contenders emerge, each carrying its arsenal of features and capabilities. In one corner, we have the suave and sophisticated Microsoft SQL Server (MSSQL), donned in the elegance of enterprise-level prowess.

MySQL

MySQL Database SQL Systems

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Step-by-Step Roadmap to Learn SQL in 2023

Analytics Vidhya

FEBRUARY 28, 2023

Introduction Structured Query Language is a powerful language to manage and manipulate data stored in databases. SQL is widely used in the field of data science and is considered an essential skill to have if you work with data.

SQL

SQL Relational Database Data Science Database

Mirroring SQL Server Database to Microsoft Fabric

Striim

NOVEMBER 19, 2024

SQL2Fabric Mirroring is a new fully managed service offered by Striim to mirror on premise SQL Databases. It’s a collaborative service between Striim and Microsoft based on Fabric Open Mirroring that enables real-time data replication from on-premise SQL Server databases to Azure Fabric OneLake.

SQL

SQL Database Data Warehouse Data Pipeline

Introduction to Databases with SQL: Free Harvard Course

KDnuggets

OCTOBER 20, 2023

Want to learn SQL the Harvard way? Start learning today with CS50 SQL, a free course on databases with SQL from Harvard.

SQL

SQL Database

5 Free University Courses to Learn Databases and SQL

KDnuggets

MARCH 5, 2024

Looking to learn SQL and databases to level up your data science skills? Learn SQL, database internals, and much more with these free university courses.

SQL

SQL Database Data Science Data

Understanding the Basics of Database Normalization

Analytics Vidhya

MARCH 2, 2023

Introduction Data normalization is the process of building a database according to what is known as a canonical form, where the final product is a relational database with no data redundancy. More specifically, normalization involves organizing data according to attributes assigned as part of a larger data model.

Database

Database Relational Database Building Process

Surveying The Market Of Database Products

Data Engineering Podcast

OCTOBER 29, 2023

Summary Databases are the core of most applications, whether transactional or analytical. In recent years the selection of database products has exploded, making the critical decision of which engine(s) to use even more difficult. What are the aspects of the database market that keep you interested as a VP of product?

Database

Database SQL BI Machine Learning

7 Modern SQL Database you Must Know in 2024

KDnuggets

JUNE 28, 2024

Explore the world of modern databases that are fast, secure, and cost-efficient, designed to tackle large-scale and diverse data challenges.

Database

Database SQL Designing Data

Back to Basics Week 2: Database, SQL, Data Management and Statistical Concepts

KDnuggets

NOVEMBER 13, 2023

This week, we delve into the vital world of Databases, SQL, Data Management, and Statistical Concepts in Data Science. Welcome back to Week 2 of KDnuggets’ "Back to Basics" series.

Database

Database SQL Data Management Management

SQL vs NoSQL: 7 Key Takeaways

KDnuggets

SEPTEMBER 5, 2022

People assume that NoSQL is a counterpart to SQL. Instead, it’s a different type of database designed for use-cases where SQL is not ideal. The differences between the two are many, although some are so crucial that they define both databases at their cores.

NoSQL

NoSQL SQL Database Design Database

The Three Levels of SQL Comprehension: What they are and why you need to know about them

dbt Developer Hub

JANUARY 22, 2025

The main thing I knew going in was "SDF understands SQL". For the next era of Analytics Engineering to be as transformative as the last, dbt needs to move beyond being a string preprocessor and into fully comprehending SQL. Today we're going to dig into what SQL comprehension actually means, since it's so critical to what comes next.

SQL

SQL Database Coding Technology

KDnuggets News, September 13: Getting Started with SQL in 5 Steps • Introduction to Databases in Data Science

KDnuggets

SEPTEMBER 13, 2023

Getting Started with SQL in 5 Steps • Introduction to Databases in Data Science • Time 100 AI: The Most Influential?

Data Science

Data Science Database SQL Data

10 GitHub Repositories to Master SQL

KDnuggets

JUNE 10, 2024

Learn SQL and databases through free courses, tutorials, tools, guides, books, practice exercises, projects, awesome lists, and other resources.

SQL

SQL Database Project

Using SQL with Python: SQLAlchemy and Pandas

KDnuggets

JUNE 12, 2024

A simple tutorial on how to connect to databases, execute SQL queries, and analyze and visualize data.

SQL

SQL Python Database Data

SQL Notes for Professionals: The Free eBook Review

KDnuggets

MAY 5, 2022

The free book is a combination of SQL cheat sheets and practical database examples. It provided bite-size information about every SQL function and attribute with coding samples.

SQL

SQL Database Coding IT

Designing A Non-Relational Database Engine

Data Engineering Podcast

APRIL 14, 2024

Summary Databases come in a variety of formats for different use cases. The default association with the term "database" is relational engines, but non-relational engines are also used quite widely. Can you describe what constitutes a NoSQL database? Your first 30 days are free! Data lakes are notoriously complex.

Non-relational Database

Non-relational Database Relational Database Database Designing

How to Write SQL in Native Python

KDnuggets

FEBRUARY 1, 2022

If the idea of being able to link with SQL databases and define, manipulate, and query using Python sounds appealing, check out the SQLModel library.

SQL

SQL Python Database

Tackling Real Time Streaming Data With SQL Using RisingWave

Data Engineering Podcast

FEBRUARY 4, 2024

Summary Stream processing systems have long been built with a code-first design, adding SQL as a layer on top of the existing framework. RisingWave is a database engine that was created specifically for stream processing, with S3 as the storage layer. Can you describe what RisingWave is and the story behind it?

SQL

SQL Data Lake High Quality Data Machine Learning

Reconciling The Data In Your Databases With Datafold

Data Engineering Podcast

MARCH 17, 2024

Summary A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consuming, especially when the source and destination speak different dialects of SQL. Your first 30 days are free!

Database

Database Data Lake High Quality Data Data Workflow

Key-Value Databases, Explained

KDnuggets

OCTOBER 4, 2022

Among the four big NoSQL database types, key-value stores are probably the most popular ones due to their simplicity and fast performance. Let’s further explore how key-value stores work and what are their practical uses.

Database

Database NoSQL SQL

A Beginner’s Guide to ClickHouse Database

KDnuggets

SEPTEMBER 13, 2024

Learn how to install ClickHouse DBMS, create a database, and run SQL queries using native and Python clients.

Database

Database SQL Python Data Engineer

Why SQL is THE Language to Learn for Data Science

KDnuggets

OCTOBER 12, 2023

SQL is the essential data science language due to its universal database accessibility, efficient data cleaning capabilities, seamless integration with other languages, and requirement for most data science jobs.

Data Science

Data Science SQL Database Data

Find Out About The Technology Behind The Latest PFAD In Analytical Database Development

Data Engineering Podcast

FEBRUARY 25, 2024

Summary Building a database engine requires a substantial amount of engineering effort and time investment. In this episode he explains how he used the combination of Apache Arrow, Flight, Datafusion, and Parquet to lay the foundation of the newest version of his time-series database. Your first 30 days are free!

Database

Database Technology Data Lake High Quality Data

Building An Internal Database As A Service Platform At Cloudflare

Data Engineering Podcast

AUGUST 27, 2023

In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles so you can quickly ship actionable, enriched data to every downstream team. Why Postgres?

Database

Database Building PostgreSQL BI

Getting Started with Graph Database Queries, with Cheat Sheet!

KDnuggets

NOVEMBER 6, 2023

Graph databases are quickly becoming a core part of the analytics toolset for enterprise IT organizations. If you know SQL, you can easily learn Cypher and open up a huge opportunity for data analysis.

Database

Database SQL Data Analysis Data

Introduction to Databases in Data Science

KDnuggets

SEPTEMBER 8, 2023

Understand the relevance of databases in data science. Also learn the fundamentals of relational databases, NoSQL database categories, and more.

Database

Database Data Science NoSQL Relational Database

Interesting startup idea: benchmarking cloud platform pricing

The Pragmatic Engineer

OCTOBER 17, 2024

The current database includes 2,000 server types in 130 regions and 340 zones. Results are stored in git and their database, together with benchmarking metadata. Databases: SQLite files used to publish data Duck DB to query these files in the public APIs Cockroach DB : used to collect and store historical data.

Cloud

Cloud AWS Metadata Cloud Computing

Troubleshooting Kafka In Production

Data Engineering Podcast

DECEMBER 24, 2023

RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles so you can quickly ship actionable, enriched data to every downstream team. It’s the only true SQL streaming database built from the ground up to meet the needs of modern data products. With Materialize, you can!

Kafka

Kafka Data Lake High Quality Data SQL

How Meta discovers data flows via lineage at scale

Engineering at Meta

JANUARY 22, 2025

Data lineage refers to the process of tracing the journey of data as it moves through various systems, illustrating how data transitions from one data asset, such as a database table (the source asset), to another (the sink asset). In this blog, we will delve into an early stage in PAI implementation: data lineage. Hack, C++, Python, etc.)

Data Warehouse

Data Warehouse SQL Programming Language Data

dbt multi-project collaboration

Christophe Blefari

OCTOBER 19, 2023

With dbt, you can apply software engineering practices to SQL development. Managing your SQL patrimony has never been easier. So, yes, dbt is cool but there is a common pattern with it: you accumulate SQL queries. Fast forward to 2 years later, you find yourself with hundreds or thousands of SQL queries. See the doc.

Project

Project Finance SQL Government

An Overview Of The Sate Of Data Orchestration In An Increasingly Complex Data Ecosystem

Data Engineering Podcast

SEPTEMBER 10, 2023

RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles so you can quickly ship actionable, enriched data to every downstream team. Learn more about Datafold by visiting dataengineeringpodcast.com/datafold You shouldn't have to throw away the database to build with fast-changing data.

BI

BI SQL Machine Learning Data

What Is the Difference Between SQL and Object-Relational Mapping (ORM)?

KDnuggets

FEBRUARY 24, 2022

Object-relational mapping, or ORM, is a technique that allows you to interact with databases using the object-oriented paradigm of the programming language of your choosing. How is that different from structured query language, though, and when do you use them?

SQL

SQL Programming Language Database Programming

Accelerate AI Development with Snowflake

Snowflake

NOVEMBER 11, 2024

However, scaling LLM data processing to millions of records can pose data transfer and orchestration challenges, easily addressed by the user-friendly SQL functions in Snowflake Cortex. Traditionally, SQL has been limited to structured data neatly organized in tables.

Unstructured Data

Unstructured Data SQL AWS Healthcare

Data News — Week 25.02

Christophe Blefari

JANUARY 11, 2025

Materialization of data warehouse layers — What are the consideration for every materialisation you should pick in your data warehouse layer: view, tables, schema vs. databases, etc. The best code is the code you never wrote — Every line of code is a form of debt—a liability that must be maintained and understood.

Data

Data Data Warehouse Coding Programming Language

How to get started with dbt

Christophe Blefari

MARCH 1, 2023

dbt Core is an open-source framework that helps you organise data warehouse SQL transformation. In a simple words dbt sits on top of your raw data to organise all your SQL queries that are defining your data assets. a macro — a macro is a Jinja function that either do something or return SQL or partial SQL code.

Data Warehouse

Data Warehouse SQL Metadata Raw Data

Data News — Week 24.11

Christophe Blefari

MARCH 15, 2024

With yato you give a folder with SQL queries and it guesses the DAG and runs the queries in the right order. BigQuery supports DELETE to delete partitions in a SQL query. I'd like to do a bit of user research about yato, if you consider using it drop me a message please. Give a lot of insights on the market.

Metadata

Metadata Data Data Warehouse Software Engineer

Building Linked Data Products With JSON-LD

Data Engineering Podcast

SEPTEMBER 17, 2023

RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles so you can quickly ship actionable, enriched data to every downstream team. It’s the only true SQL streaming database built from the ground up to meet the needs of modern data products. With Materialize, you can!

Building

Building SQL BI Python

OLTP Vs OLAP – What Is The Difference

Seattle Data Guy

MAY 8, 2023

Adding databases like MongoDB and CassandraDB only makes matters worse, since they’re not SQL-friendly – the language most analysts and data practitioners are used to.… If you’re relying on your OLTP system to provide analytics, you might be in for a surprise.

MongoDB

MongoDB SQL Database Designing

A Deep Dive into Data Replication: Most Effective Way to Protect Your Data

Analytics Vidhya

FEBRUARY 22, 2023

Introduction Data replication is also known as database replication, which is copying data to ensure that all information remains consistent across all data resources in real-time. data replication is like a safety net that keeps your information safe from disappearing or falling through the cracks. In most cases, data alters.

Database

Database Data NoSQL Datasets

How to Normalize Relational Databases With SQL Code?

SQL Injection: The Cyber Attack Hiding in Your Database

Webinars

Trending Sources

MSSQL vs MySQL: Comparing Powerhouses of Databases

Webinars

Step-by-Step Roadmap to Learn SQL in 2023

Top 5 SQL Interview Questions With Implementation

Mirroring SQL Server Database to Microsoft Fabric

Introduction to Databases with SQL: Free Harvard Course

5 Free University Courses to Learn Databases and SQL

Understanding the Basics of Database Normalization

Surveying The Market Of Database Products

Top 5 SQL Interview Questions

7 Modern SQL Database you Must Know in 2024

Back to Basics Week 2: Database, SQL, Data Management and Statistical Concepts

SQL vs NoSQL: 7 Key Takeaways

The Three Levels of SQL Comprehension: What they are and why you need to know about them

KDnuggets News, September 13: Getting Started with SQL in 5 Steps • Introduction to Databases in Data Science

10 GitHub Repositories to Master SQL

Using SQL with Python: SQLAlchemy and Pandas

SQL Notes for Professionals: The Free eBook Review

Designing A Non-Relational Database Engine

How to Write SQL in Native Python

Tackling Real Time Streaming Data With SQL Using RisingWave

Reconciling The Data In Your Databases With Datafold

Key-Value Databases, Explained

A Beginner’s Guide to ClickHouse Database

Why SQL is THE Language to Learn for Data Science

Find Out About The Technology Behind The Latest PFAD In Analytical Database Development

Building An Internal Database As A Service Platform At Cloudflare

Getting Started with Graph Database Queries, with Cheat Sheet!

Introduction to Databases in Data Science

Interesting startup idea: benchmarking cloud platform pricing

Troubleshooting Kafka In Production

How Meta discovers data flows via lineage at scale

Top 8 Interview Questions on Apache Sqoop

dbt multi-project collaboration

An Overview Of The Sate Of Data Orchestration In An Increasingly Complex Data Ecosystem

What Is the Difference Between SQL and Object-Relational Mapping (ORM)?

Accelerate AI Development with Snowflake

Data News — Week 25.02

How to get started with dbt

Data News — Week 24.11

Building Linked Data Products With JSON-LD

OLTP Vs OLAP – What Is The Difference

A Deep Dive into Data Replication: Most Effective Way to Protect Your Data

Stay Connected