Thu.Jan 02, 2025

article thumbnail

5 Simple Projects to Start Today: A Learning Roadmap for Data Engineering

Towards Data Science

Start with 5 practical projects to lay the foundation for your data engineering roadmap.

article thumbnail

10 Pandas One-Liners for Quick Data Quality Checks

KDnuggets

Want to run some quick data quality checks? Here are 10 pandas one-liners that'll come in handy.

Data 113
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

How HP is optimizing the 3D Printing supply chain using Delta Sharing

databricks

Javier Lagares is a Principal Data Engineer at HP, where he leads the development of data-driven solutions for the 3D printing business. With.

article thumbnail

Part 2: A Survey of Analytics Engineering Work at Netflix

Netflix Tech

This article is the second in a multi-part series sharing a breadth of Analytics Engineering work at Netflix, recently presented as part of our annual internal Analytics Engineering conference. Need to catch up? Check out Part 1. In this article, we highlight a few exciting analytic business applications, and in our final article well go into aspects of the technical craft.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Develop a Stand-out Data Science Portfolio with GitHub

KDnuggets

Improve your chances of getting noticed with these tips.

article thumbnail

Data Engineering — ORM and ODM with Python

Towards Data Science

Manipulate database data leveraging an object-oriented programming paradigm Continue reading on Towards Data Science

More Trending

article thumbnail

How I Built a Real-Time Weather Data Pipeline Using AWS—Entirely Serverless

Towards Data Science

Introduction Data Proposed Workflow AWS Cloud Components Collecting the Data (Lambda Function 1) Writing the Data to the Table (Lambda Function 2) Converting the data in CSV

AWS 56
article thumbnail

RxJS Operations in Angular

Edureka

Angular is a well-known front-end tool for making web apps that are both dynamic and reliable. Additionally, RxJS in Angular offers a full set of tools made to easily handle asynchronous processes and reactive programming. This combination enables developers to create efficient, responsive, and user-friendly applications that adhere to modern web standards.

article thumbnail

Three AI Trends Developers Need to Know in 2025

Confluent

Continuing issues with hallucinations, the increased independence of agentic AI systems, and the greater usage of dynamic data sources, are three AI trends to monitor in 2025.

Systems 52
article thumbnail

Understanding Cardinality of Relationships in Power BI: A Complete Guide

Edureka

The Cardinality Of Relationships in Power BI plays an important role in defining the relationships between tables as part of the data modeling process. Proper data interpretation is made feasible with performance optimization, as well as highly accurate and efficient reports. Therefore, being well-versed in cardinality has been proven to facilitate the building of reliable models and significantly improve reporting efficiency.

BI 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

What is Malvertising & How Do You Avoid It?

Edureka

One day, while surfing the web, you click on an ad for a Black Friday Sale. The moment you do, a security alert pops up with a warning that Your Computer may be Compromised. To fix the issue, it asks that you download an application. You would click on the link, thinking this could be a potential solution. The download page claims to have a solution to your problem, but in reality, it infects your computer with malware, which slows it down and compromises your data.

IT 52
article thumbnail

Understanding Literals in Python: A Beginner’s Guide

Edureka

In the world of programming, just like how we rely on stable components in technology—such as reliable servers, consistent APIs, or robust frameworks—Python also has its own set of “constants” known as literals. Literals in Python are the direct representations of fixed values in your code. These could be numbers, strings, or even collections like lists.

Python 52
article thumbnail

What Is NIS2? – Compliance and Policies

Edureka

Protecting vital infrastructure and digital services has never been more crucial in the increasingly connected world of today. The strategies and laws designed to lower these risks must change in parallel with the complexity of cyberthreats. The NIS2 Directive from the European Union enhances cybersecurity by introducing legal provisions that improve defenses in a number of sectors.

article thumbnail

What are Logs in Cybersecurity? And It’s Importance

Edureka

Imagine you’re in the control room of a huge digital system, where everything is constantly buzzing. Every click, transaction, error, and attempt to access something is quietly being recorded in the background. This is where log files come in. From the apps you use every day to the firewalls protecting your network, log files give you a behind-the-scenes look at what’s really going on.

Bytes 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Top 10 Full Stack Development Tools in 2025

Edureka

Full stack development entails working on a web application’s front-end (client side) and back-end (server side). It also necessitates a full collection of tools to manage all parts of the online application, from the user interface to server-side logic and data storage (database). We’ll go over several tools and the technologies that go along with them that are primarily used while creating web applications.

Java 52
article thumbnail

Network Security vs Cyber Security: What’s the Difference?

Edureka

In the current digital atmosphere, it is very important to understand the difference between network security vs cybersecurity for the proper protection of sensitive data. Though these are often used as identical terms to each other, they stress different aspects of security. Network security is basically concerned with the protection of the network infrastructure, which includes devices and connections, from unauthorized access and other forms of threats.

Banking 52
article thumbnail

AWS Storage Gateway: A Comprehensive Guide

Edureka

AWS Storage Gateway is a service that lets you connect your systems on-premises to the cloud. This blog will talk about it. Discover the various types of storage ports, their main features, and how they can help businesses better manage their data. Companies can use the on-premises processes they already have while also using these tools to store data in the cloud.

AWS 52
article thumbnail

What is Git Flow and How to use Git Flow

Edureka

Working on the same code with a team can get messy. For instance, features, bug fixes, and releases often overlap, causing confusion. Fortunately that’s where Git Flow helps. It’s an easy-to-follow workflow that will keep everything organized so your team can work smoothly without stepping on each other’s toes. Using Git Flow will know exactly where new features come in, bugs are fixed, and releases are prepared-all of these things while keeping the main code stable.

Coding 52
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Mastering the CALCULATE() Function in Power BI: A Comprehensive Guide

Edureka

If you are familiar with Power BI, you are already aware that the CALCULATE function in Power BI is one of the most important DAX functions in the platform. This function is very useful and popular and provides many opportunities to improve your data analysis. There are many uses for Power BI’s CALCULATE function, so in this post, I’ll walk through how it works and look at a few common uses.

BI 52