Sun.May 26, 2024

article thumbnail

Why Data Analysts And Engineers Make Great Consultants

Seattle Data Guy

Many data engineers and analysts don’t realize how valuable the knowledge they have is. They’ve spent hours upon hours learning SQL, Python, how to properly analyze data, build data warehouses, and understand the differences between eight different ETL solutions. Even what they might think is basic knowledge could be worth $10,000 to $100,000+ for a… Read more The post Why Data Analysts And Engineers Make Great Consultants appeared first on Seattle Data Guy.

article thumbnail

Data Migration Strategies For Large Scale Systems

Data Engineering Podcast

Summary Any software system that survives long enough will require some form of migration or evolution. When that system is responsible for the data layer the process becomes more challenging. Sriram Panyam has been involved in several projects that required migration of large volumes of data in high traffic environments. In this episode he shares some of the valuable lessons that he learned about how to make those projects successful.

Systems 130
article thumbnail

Data Engineering Weekly #173

Data Engineering Weekly

Luke Byrne: Questions About AI Is AI all about hype? What do humans spend their time on in a post-AGI world? There are many burning questions from our readers, too, and the author did an amazing compilation of some of the widely discussed questions around AI development. What is your burning question about AI? [link] Chris Riccomini: S3 Is Showing Its Age Building a global scale distributed system with eleven 9s of durability and four 9s of availability is no easy feat.

article thumbnail

What Is Kubernetes? Definitive Guide for Dummies

Knowledge Hut

Kubernetes is a system for managing and orchestrating containerized applications across a cluster of nodes. It was designed by Google to manage and schedule containers at scale. Kubernetes can run on-premises or in the cloud, making it a popular choice for modernizing IT infrastructure. Many major companies use Kubernetes to manage their containerized applications, including Google, and Shopify.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.