Build a Reproducible and Maintainable Data Science Project: A Free Online Book
KDnuggets
AUGUST 29, 2022
This free online book is a fantastic resource on how to structure, manage, and maintain your real-world data science projects.
This site uses cookies to improve your experience. To help us insure we adhere to various privacy regulations, please select your country/region of residence. If you do not select a country, we will assume you are from the United States. Select your Cookie Settings or view our Privacy Policy and Terms of Use.
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
Used for the proper function of the website
Used for monitoring website traffic and interactions
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
KDnuggets
AUGUST 29, 2022
This free online book is a fantastic resource on how to structure, manage, and maintain your real-world data science projects.
KDnuggets
JANUARY 9, 2020
This book is thought for beginners in Machine Learning, that are looking for a practical approach to learning by building projects and studying the different Machine Learning algorithms within a specific context.
The Pragmatic Engineer
MARCH 12, 2024
Lots of time has passed, yet the book is still relevant. ” Brooks agrees with this observation, and suggests a radical solution: have as few senior programmers as possible, and build a team around each one – a bit like how a hospital surgeon leads a whole team. ‘The Mythical Man Month’ by Frederick P.
KDnuggets
MARCH 20, 2020
We have compiled a list of some of the best (and free) machine learning books that will prove helpful for everyone aspiring to build a career in the field.
Advertisement
Think your customers will pay more for data visualizations in your application? Five years ago they may have. But today, dashboards and visualizations have become table stakes. Discover which features will differentiate your application and maximize the ROI of your embedded analytics. Brought to you by Logi Analytics.
The Pragmatic Engineer
APRIL 21, 2024
A refresher on OpenAI, and on Evan Evan: how did you join OpenAI, and end up heading the Applied engineering group – which also builds ChatGPT? I do not have a PhD in Machine Learning, and was excited by the idea of building APIs and engineering teams. With this, it’s over to Evan. My questions are in italic.
Knowledge Hut
DECEMBER 27, 2023
Also, you must go through certain software engineering books to make your knowledge and skills robust for the job. In this article, we will read about some of the most prevalent and widely loved and best books to read for software engineers that can help you get a good hold of all the concepts in engineering.
KDnuggets
NOVEMBER 9, 2022
Finally a book on Attention. Learn how to build your own transformer model with Machine Learning Mastery's new book.
KDnuggets
OCTOBER 27, 2023
This week on KDnuggets: Go from learning what large language models are to building and deploying LLM apps in 7 steps • Check this list of free books for learning Python, statistics, linear algebra, machine learning and deep learning • And much, much more!
Data Engineering Podcast
DECEMBER 24, 2023
Elad Eldor has experienced these challenges first-hand, leading to his work writing the book "Kafka: : Troubleshooting in Production" In this episode he highlights the sources of complexity that contribute to Kafka's operational difficulties, and some of the main ways to identify and mitigate potential sources of trouble.
François Nguyen
FEBRUARY 28, 2021
If there is one only book to read about lean manufacturing, this is the one. This is the kind of book you can read again and again and still learn something about your current context. It is also a book you can read whatever your industry, you will always find situations covered by this book.
Knowledge Hut
JUNE 25, 2024
For those interested in studying this programming language, several best books for python data science are accessible. Top 8 Python Data Science Books for 2023 Python is one of the programming languages that is most commonly utilized in the field of data science. This book offers practical programming solutions to these problems.
Data Engineering Podcast
FEBRUARY 26, 2023
Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. It's supposed to make building smarter, faster, and more flexible data infrastructures a breeze. We feel your pain. It ends up being anything but that. You can't optimize for everything all at once.
Confluent
FEBRUARY 24, 2022
A few years ago I helped build an event-driven system for gym bookings. The pitch was that we were building a better experience for both the gym members booking different […].
KDnuggets
FEBRUARY 28, 2022
The book has more math than our other books and over 85 code examples to help you understand the concepts. Unless you have a basic knowledge of calculus, you cannot understand how machine learning algorithms are developed.
Knowledge Hut
NOVEMBER 7, 2023
With that in mind, having the best CSM books to help you with Scrum preparation would go a long way in assisting you to become the expert you desire to be. Along with the best Scrum book, it is a plus point to go for CSM online training and boost your learning. Who is a CSM?
Data Engineering Podcast
JULY 24, 2022
Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. What are your goals with this book?
KDnuggets
APRIL 27, 2022
A Brief Introduction to Papers With Code; Machine Learning Books You Need To Read In 2022; Building a Scalable ETL with SQL + Python; 7 Steps to Mastering SQL for Data Science; Top Data Science Projects to Build Your Skills.
dbt Developer Hub
APRIL 19, 2023
We can then build the OBT by running dbt run. The goal of dimensional modeling is to take raw data and transform it into Fact and Dimension tables that represent the business. Using dbt_utils.star() , we select all columns except the surrogate key columns since the surrogate keys don't hold any meaning besides being useful for the joins.
Data Engineering Podcast
DECEMBER 7, 2020
Summary Building data products are complicated by the fact that there are so many different stakeholders with competing goals and priorities. If you hand a book to a new data engineer, what wisdom would you add to it? What are your thoughts on when to hire an outside consultant, vs building internal capacity?
The Pragmatic Engineer
NOVEMBER 21, 2023
He then worked at the casual games company Zynga, building their in-game advertising platform. In 1997, when Amazon was a one-floor, sub-100 person startup trying to sell books online, I experienced what every engineer dreads. Backend code I wrote and pushed to prod took down Amazon.com for several hours.
Towards Data Science
FEBRUARY 19, 2024
Image from Unsplash Building a Semantic Book Search: Scale an Embedding Pipeline with Apache Spark and AWS EMR Serverless Using OpenAI’s Clip model to support natural language search on a collection of 70k book covers In a previous post I did a little PoC to see if I could use OpenAI’s Clip model to build a semantic book search.
KDnuggets
APRIL 5, 2023
Learn about Apache Kafka architecture and its implementation using a real-world use case of a taxi booking app.
Data Engineering Podcast
MAY 13, 2021
If you want to build a warehouse that gives you both control and flexibility then you might consider building on top of the venerable PostgreSQL project. In this episode Thomas Richter and Joshua Drake share their advice on how to build a production ready data warehouse with Postgres.
The Pragmatic Engineer
OCTOBER 10, 2024
Forks gaining momentum that has the code of Wordpress, but uses a different name could also be in the books. For any and all vendors offering managed Wordpress hosting: Automattic making it clear that it will enforce its commercial trademark rights is a worrying sign. I can see a few things happen: A fork (or more).
The Pragmatic Engineer
SEPTEMBER 19, 2024
Later this year, he’s publishing a book on tech debt. This project helped onboard me to the software, its structure, its build, and our issue tracking and version control workflows. If you have suggestions of topics for Lou to cover in the upcoming book, please connect with him on LinkedIn , or via his website.
The Pragmatic Engineer
SEPTEMBER 22, 2023
Bun was mostly built by Jared Sumner , a former Stripe engineer, and recipient of the Thiel Fellowship (a grant of $100,000 for young people to drop out of school and build things, founded by venture capitalist, Peter Thiel). The innovator’s dilemma comes from the book of the same title by Clayton Christerenses. world by storm.
Cloudyard
MARCH 4, 2024
Read Time: 1 Minute, 32 Second In this blog post, we will explore how to leverage Snowpark and Streamlit to build an interactive book exploration application. With this Streamlit application, you can: Browse a comprehensive list of books: The app retrieves book data directly from a Snowflake table named “BOOKS.”
Data Engineering Podcast
OCTOBER 2, 2021
Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.
Data Engineering Podcast
APRIL 14, 2024
Dagster offers a new approach to building and running data platforms and data pipelines. Your host is Tobias Macey and today I'm interviewing Oren Eini about the work of designing and building a NoSQL database engine Interview Introduction How did you get involved in the area of data management? Your first 30 days are free!
Data Engineering Podcast
DECEMBER 14, 2020
They share the journey that they went through to build a scalable and maintainable system for web scraping, how to make it reliable and resilient to errors, and the lessons that they learned in the process. This was a great conversation about real world experiences in building a successful data-oriented business.
Snowflake
OCTOBER 9, 2023
Marketing teams are creating composable customer data platforms (CDPs) on the Data Cloud to build a 360-degree view of each customer. Customer Studio : Leverage a marketer-friendly suite of features to build audiences, coordinate campaigns, run tests and more. To learn more, book a demo with Hightouch.
Data Engineering Podcast
NOVEMBER 19, 2023
Dustin Dorsey and Cameron Cyr co-authored a practical guide to building your dbt project. In this episode they share their hard-won wisdom about how to build and scale your dbt projects. You recently wrote a book to give a crash course in best practices for dbt. While it is easy to adopt, there are many potential pitfalls.
Data Engineering Podcast
JULY 16, 2023
In his recent book "Datapreneurs" he reflects on the people and businesses that he has known and worked with and how they relied on data to deliver valuable services and drive meaningful change. What are the most interesting, unexpected, or challenging lessons that you have learned while working on the Datapreneurs book?
The Pragmatic Engineer
AUGUST 10, 2023
Learnerbly is an L&D platform hundreds of tech companies use, as it makes administering these budgets much simpler — and it’s also a lot easier for people to request books, courses, or newsletters. Small business hosts on the travel booking platform are waiting more than a month to be paid.
Edureka
APRIL 24, 2024
If you want to leverage PRINCE2 with expertise and pass the certification test with high marks, it’s essential to comprehend the core ideas, regulations and phases involved – right from navigating PRINCE2 to finding the right PRINCE2 books. It is an ideal resource for people looking for books on the PRINCE2 foundation exam.
Jesse Anderson
JUNE 7, 2022
We also had the good fortune that the universe was aligned with our desire to delve into data mesh, with her freshly published book, Data Mesh: Delivering Data-Driven Value at Scale , giving us a great base to have an in-depth discussion and not just another interview. One of the things I loved about her book is the diagrams.
Data Engineering Podcast
JULY 27, 2021
Now there’s a book that captures the foundational lessons and principles that underly everything that you hear about here. When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.
The Pragmatic Engineer
SEPTEMBER 28, 2023
Willem Spruijt is a software engineer whom I worked on the same team with at Uber in Amsterdam, building payments systems. An example of making a company-wide impact at Rise was during our Christmas hackathon, when an engineer built a Calendly-like feature for anyone to book a slot in your calendar.
François Nguyen
FEBRUARY 14, 2021
But the result is implémenting agile without a minimum of capabilities in your context is like building on sand. You can see many projects using the Agile methodology having hard life because they have to build the foundations and start the decoration. This book is a must read ! This is where the « quick and dirty » begins.
Knowledge Hut
MARCH 20, 2024
With a wide range and variety of UX design books available on the internet, professionals like you can continue to enhance your skills and broaden your knowledge base. Good books not only provide better insights but also ensure a proper, streamlined process of implementing your knowledge. What is UI/UX?
The Pragmatic Engineer
APRIL 19, 2023
I still remember being in a meeting where a Very Respected Engineer was explaining how they are building a project, and they said something along the lines of "and, of course, idempotency is non-negotiable." An example of this was how, when I was writing a book on resumes, I came across the concept of "ATSes rejecting resumes."
François Nguyen
FEBRUARY 21, 2021
You are not building anything solid without a strong tech leadership. This book could help This book is focus on organizing business and technology teams. You have this video doing the link between the spotify model and this book here. ” It is important but not enough if you want to reach technical excellence.
Christophe Blefari
MARCH 22, 2024
On my side I'll talk about Apache Superset and what you can do to build a complete application with it. Commun Corpus — A HuggingFace dataset collection including public domain texts, newspapers and books in a lot of languages. This is a visualisation of the hours spent by Erin reading books in 2023.
Expert insights. Personalized for you.
We have resent the email to
Are you sure you want to cancel your subscriptions?
Let's personalize your content