Sun.Oct 20, 2024

article thumbnail

ETL Pipelines in Python: Best Practices and Techniques

Towards Data Science

Strategies for Enhancing Generalizability, Scalability, and Maintainability in Your ETL Pipelines Continue reading on Towards Data Science »

Python 57
article thumbnail

Sustainability in Agile: How Scrum Roles Can Drive Greener Practices by James Camilleri

Scott Logic

Introduction The intersection of sustainability and software development is gaining attention, with increasing awareness of the environmental impact of IT practices. A recent blog by Pini Reznik discusses the environmental impact of Agile software development, highlighting how its emphasis on speed and adaptability often leads to resource waste, particularly in cloud computing.

article thumbnail

Data Engineering Weekly #194

Data Engineering Weekly

Notion: A brief history of Notion’s data catalog Notion writes about its journey in adopting data catalogs and describes how a vanilla data catalog solution will only be effective if it adopts a strong data platform foundation. Adopting Typescript rather than the specialized IDL languages is a good strategy, although I wonder how it works in cross-language systems like Android & iOS.

article thumbnail

Into The Multi-cloud by Dave Ogle

Scott Logic

Introducing, the Multi-cloud Today most software projects utilise some form of cloud computing to host their infrastructure, often choosing one of the big three providers, Google (GCP), Microsoft (Azure) or Amazon (AWS). What I think is less common, is to consider multiple cloud providers when looking to host resources although it is something which is mentioned from time to time.

Cloud 52
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.