10 GitHub Repositories to Master SQL
KDnuggets
JUNE 10, 2024
Learn SQL and databases through free courses, tutorials, tools, guides, books, practice exercises, projects, awesome lists, and other resources.
This site uses cookies to improve your experience. To help us insure we adhere to various privacy regulations, please select your country/region of residence. If you do not select a country, we will assume you are from the United States. Select your Cookie Settings or view our Privacy Policy and Terms of Use.
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
Used for the proper function of the website
Used for monitoring website traffic and interactions
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
KDnuggets
JUNE 10, 2024
Learn SQL and databases through free courses, tutorials, tools, guides, books, practice exercises, projects, awesome lists, and other resources.
KDnuggets
DECEMBER 28, 2023
Discover a collection of best books to start your data career or master a new skill, all for free!
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Agent Tooling: Connecting AI to Your Tools, Systems & Data
How to Modernize Manufacturing Without Losing Control
Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration
KDnuggets
OCTOBER 31, 2023
Use this knowledge to upskill yourselves.
Data Engineering Podcast
DECEMBER 24, 2023
Elad Eldor has experienced these challenges first-hand, leading to his work writing the book "Kafka: : Troubleshooting in Production" In this episode he highlights the sources of complexity that contribute to Kafka's operational difficulties, and some of the main ways to identify and mitigate potential sources of trouble.
Start Data Engineering
DECEMBER 5, 2020
Analytical Data Processing in SQL (Click here)
KDnuggets
APRIL 27, 2022
A Brief Introduction to Papers With Code; Machine Learning Books You Need To Read In 2022; Building a Scalable ETL with SQL + Python; 7 Steps to Mastering SQL for Data Science; Top Data Science Projects to Build Your Skills.
Towards Data Science
MARCH 29, 2023
In this article, I want to focus on my on-again, off-again relationship with books and reading. I burned out spectacularly about a year into a PhD program, and my relationship with books ended for quite some time. Even if you haven’t read any of the books below, you’ve probably at least heard of some of them.
Data Engineering Podcast
NOVEMBER 19, 2023
RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles so you can quickly ship actionable, enriched data to every downstream team. It’s the only true SQL streaming database built from the ground up to meet the needs of modern data products. Introducing RudderStack Profiles.
Data Engineering Podcast
JULY 16, 2023
In his recent book "Datapreneurs" he reflects on the people and businesses that he has known and worked with and how they relied on data to deliver valuable services and drive meaningful change. What are the most interesting, unexpected, or challenging lessons that you have learned while working on the Datapreneurs book?
Data Engineering Podcast
SEPTEMBER 7, 2020
If you hand a book to a new data engineer, what wisdom would you add to it? Your host is Tobias Macey and today I’m interviewing Martin Traverso about PrestoSQL, a distributed SQL engine that queries data in place Interview Introduction How did you get involved in the area of data management?
Data Engineering Weekly
FEBRUARY 9, 2025
Airbnb restricted the range of booking probabilities for map pins, which led to significant booking improvements. Further iterations included tiered map pins and a map re-centering algorithm based on booking probabilities. link] Fernando Borretti: Composable SQL One of the biggest challenges in SQL is the unit testing.
Cloudera
MAY 18, 2021
Flink SQL is a data processing language that enables rapid prototyping and development of event-driven and streaming applications. Flink SQL combines the performance and scalability of Apache Flink, a popular distributed streaming platform, with the simplicity and accessibility of SQL. You can view the code here.
Data Engineering Weekly
NOVEMBER 24, 2024
[link] JBarti: Write Manageable Queries With The BigQuery Pipe Syntax Our quest to simplify SQL is always an adventure. The blog narrates a few examples of Pipe Syntax in comparison with the SQL queries. BigQuery's pipe syntax seems exciting to watch, and it is an interesting approach to how it gets adopted.
Data Engineering Podcast
APRIL 14, 2024
For data engineers who battle to build and scale high quality data workflows on the data lake, Starburst powers petabyte-scale SQL analytics fast, at a fraction of the cost of traditional methods, so that you can meet all your data needs ranging from AI to data applications to complete analytics. Your first 30 days are free!
The Pragmatic Engineer
JUNE 13, 2023
Agoda is a leading online travel booking platform in Asia. It’s owned by Booking Holdings Inc, which also owns the popular travel sites, Kayak and Booking.com. For transactional databases, it’s mostly the Microsoft SQL Server, but also other databases like PostgreSQL, ScyllaDB and Couchbase.
Data Engineering Podcast
DECEMBER 26, 2021
By acting as a virtual hub for data assets ranging from tables and dashboards to SQL snippets & code, Atlan enables teams to create a single source of truth for all their data assets, and collaborate across the modern data stack through deep integrations with tools like Snowflake, Slack, Looker and more.
KDnuggets
JANUARY 15, 2020
This week: learn the 5 must-have data science skills for the new year; find out which book is THE book to get started learning machine learning; pick up some Python tips and tricks; learn SQL, but learn it the hard way; and find an introductory guide to learning common NLP techniques.
databricks
FEBRUARY 25, 2025
As North America's largest book. In today's fast-paced business environment, the ability to quickly access and analyze data is crucial for maintaining a competitive edge.
Knowledge Hut
JUNE 30, 2023
Whether you're a beginner looking to dive into the foundations or an experienced practitioner seeking advanced techniques, the right books can be your guiding light. Books on data engineering serve as essential resources to guide you through the vast terrain of data engineering. What is Data Engineering?
Data Engineering Weekly
MARCH 30, 2025
Airbnb defines and estimates three types of LTV: Baseline (total bookings over 365 days, predicted using machine learning), Incremental (baseline LTV adjusted for cannibalization from other listings, estimated via a production function) Marketing-induced incremental (additional LTV generated by Airbnb initiatives). Sonnet for assistance.
Knowledge Hut
DECEMBER 26, 2023
Budding aspirants and students are constantly looking for reliable data science s, research material, and the top data science books to kickstart their careers in this field. Be it as a beginner or an experienced learner; you need to know which book is a reliable source of knowledge and is suited to your personal level of understanding.
ProjectPro
FEBRUARY 16, 2023
ProjectPro has curated a list of the best Apache Spark books that cater to different levels of expertise, from beginner to pros. So sit back, grab a cup of coffee, and let's dive into the world of reading the top Apache Spark books. So sit back, grab a cup of coffee, and let's dive into the world of reading the top Apache Spark books.
Team Data Science
DECEMBER 17, 2020
Structured Query Language (SQL) SQL has been in existence for some time now, and database systems are still prevalent and important in many companies. Data scientists should acquire some basic SQL functionality. Getting a practical knowledge of data processing will also help. See you later.
Data Engineering Podcast
DECEMBER 9, 2018
Jean George Perrin has been so impressed by the versatility of Spark that he is writing a book for data engineers to hit the ground running. What was your motivation for writing a book about Spark? What have been some of the most interesting or useful lessons that you have learned in the process of writing a book about Spark?
KDnuggets
NOVEMBER 16, 2022
10 Cheat Sheets You Need To Ace Data Science Interview • 7 Free Platforms for Building a Strong Data Science Portfolio • The Complete Free PyTorch Course for Deep Learning • 3 Valuable Skills That Have Doubled My Income as a Data Scientist • 25 Advanced SQL Interview Questions for Data Scientists • A Data Science Portfolio That Will Land You The Job (..)
Knowledge Hut
OCTOBER 11, 2023
There are many books on the market where you can learn and understand Power BI. This post will discuss the Top 10 Power BI books for beginners and advanced-level readers. From understanding the basics to mastering advanced techniques, these books are your gateway to harnessing the full potential of Power BI.
Christophe Blefari
JANUARY 26, 2024
Data mesh by the book will not work, if you want to scale you can't just add more people in a central team. ClickHouse and the one billion row challenge — ClickHouse proposed a SQL solution with ClickHouse local to the a challenge consisting in aggregating 1B rows in a text file.
Christophe Blefari
MARCH 22, 2024
Commun Corpus — A HuggingFace dataset collection including public domain texts, newspapers and books in a lot of languages. This is a nice way to mix SQL and Python code. This is a visualisation of the hours spent by Erin reading books in 2023. Designing RAGs — A super long and detailed article about RAG.
Team Data Science
JANUARY 8, 2021
Data engineering function involve the fundamental understanding of data utilization skills such as coding, python, SQL database, relational database, AWS in the field of big data. It would even be an additional benefit for them to have expertise in computer networking as well. See you later.
Sync Computing
DECEMBER 10, 2024
Book a Demo! In fact, the DBU rate of a large SQL warehouse is 40 DBUs/hr. If youre interested in understanding whats right for your companys Databricks usage, feel free to book a time that works for you here ! The post Databricks Compute Comparison: Classic Jobs vs Serverless Jobs vs SQL Warehouses appeared first on Sync.
Data Engineering Podcast
DECEMBER 28, 2022
Datafold shows how a change in SQL code affects your data, both on a statistical level and down to individual rows and values before it gets merged to production. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. By the time errors have made their way into production, it’s often too late and damage is done.
Data Engineering Podcast
JANUARY 30, 2022
In addition to that, the host curated the essays contained in the book "97 Things Every Data Engineer Should Know", using the knowledge and context gained from running the show to inform the selection process. Overview of the 97 things book How the project came about Goals of the book What are the paths into data engineering?
Data Engineering Podcast
DECEMBER 25, 2022
Datafold shows how a change in SQL code affects your data, both on a statistical level and down to individual rows and values before it gets merged to production. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. By the time errors have made their way into production, it’s often too late and damage is done.
Knowledge Hut
OCTOBER 29, 2023
Online cloud-enabled bookstore system This system can function as an internet bookstore by utilizing SQL and C#. The books would be divided into sections to help users find their desired book without becoming overwhelmed by a database. The system aims to enhance security and provide measures against SQL injection hacking.
Data Engineering Podcast
OCTOBER 13, 2021
In this episode he discusses his experiences and how he approached the work of distilling them for his book "Fail Fast, Learn Faster" This is an entertaining and enlightening exploration of the business side of data with an industry veteran. Can you start by discussing the focus of the book and what motivated you to write it?
Data Engineering Podcast
MARCH 10, 2024
For data engineers who battle to build and scale high quality data workflows on the data lake, Starburst powers petabyte-scale SQL analytics fast, at a fraction of the cost of traditional methods, so that you can meet all your data needs ranging from AI to data applications to complete analytics. Your first 30 days are free!
Knowledge Hut
JULY 7, 2023
SQL, or Structured Query Language, is one the most widely used programming languages, which has not changed in decades. SQL is responsible for fetching the relevant data as per the requirement from the vast data store known as databases. This blog aims to cover SQL projects which can help you enhance your SQL skillset.
Monte Carlo
APRIL 8, 2025
So before you start writing SQL or labeling columns, it’s important to understand what youre working with. SQL lets you control who can see what with role-based access. You can even book a demo with just an email. Ask yourself: What kind of data are you storing? Now its time to set some ground rules.
Data Engineering Podcast
JANUARY 15, 2022
In this episode Brian McMillan shares his work on the book "Building Data Products" and how he is working to educate business users and data professionals about the combination of technical, economical, and business considerations that need to be blended for these projects to succeed. Who is your target audience?
Data Engineering Podcast
JUNE 19, 2022
Datafold shows how a change in SQL code affects your data, both on a statistical level and down to individual rows and values before it gets merged to production. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.
Data Engineering Podcast
NOVEMBER 6, 2022
Datafold shows how a change in SQL code affects your data, both on a statistical level and down to individual rows and values before it gets merged to production. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.
Data Engineering Podcast
NOVEMBER 20, 2022
Datafold shows how a change in SQL code affects your data, both on a statistical level and down to individual rows and values before it gets merged to production. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold.
Rockset
FEBRUARY 21, 2019
A quick survey of my laptop folders reveals account statements, receipts, technical papers, book chapters, and presentation slides—all PDFs. Which is a great reason for Rockset to support SQL queries on PDF files, in our mission to make data more usable to everyone.
Expert insights. Personalized for you.
We have resent the email to
Are you sure you want to cancel your subscriptions?
Let's personalize your content