Mon.Jul 29, 2024

article thumbnail

Building Data Science Pipelines Using Pandas

KDnuggets

Learn to build the end-to-end data science pipelines from data ingestion to data visualization using Pandas pipe method.

article thumbnail

Introducing Apache Kafka® 3.8

Confluent

Apache Kafka 3.8 adds 17 new KIPs (13 for Core, 3 for Streams & 1 for Connect). Highlights include 2 new Docker images, the ability to set task assignors, and more!

Kafka 131
article thumbnail

How to Perform Memory-Efficient Operations on Large Datasets with Pandas

KDnuggets

Let's learn how to perform memory-efficient operations in pandas with large dataset.

Datasets 143
article thumbnail

Beyond Web Mercator: Projected Basemaps Revisited

ArcGIS

More small-scale projected basemaps to add to the set I built in 2023

Project 105
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Generative AI for Capital Markets

databricks

Financial Valuations & Comparative Analysis Financial institutions specialized in capital markets such as hedge funds, market makers and pension funds have long been.

103
103
article thumbnail

5 Top Machine Learning Courses You Can Take in 2024

KDnuggets

Forget about going to university and become a machine learning professional with these 5 top certifications.

More Trending

article thumbnail

Deploying dbt Projects at Scale on Google Cloud

Towards Data Science

Containerising and running dbt projects with Artifact Registry, Cloud Composer, GitHub Actions and dbt-airflow Continue reading on Towards Data Science »

article thumbnail

Enhance IT Visibility: Integrate IBM i and IBM Z Data into ServiceNow

Precisely

Key Takeaways Machine data created by your IBM Z and IBM i is critical for flagging anomalies that may exist in your business and meeting regulatory compliance requirements Automated discovery and integration tools, like a CMDB, streamline processes, reduce risks, and enhance decision-making. Precisely Ironstream ensures real-time updates and a holistic approach to IT service management, optimizing your operations by integrating IBM Z and IBM i machine data into ServiceNow.

IT 52
article thumbnail

Beyond the Data Complexity: Building Agile, Reusable Data Architectures

The Modern Data Company

There’s no denying the importance of data in today’s business world. Done right, it is the backbone of organizational success, driving innovation, unlocking new revenue models, and delivering a competitive edge. However, data leaders worldwide are grappling with a challenging question: “Why aren’t our data initiatives delivering the expected ROI?

article thumbnail

Balancing AI Power with Human Insight: A Humorous Dive into Text Analytics

Elder Research

The post Balancing AI Power with Human Insight: A Humorous Dive into Text Analytics appeared first on Elder Research.

52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Maximize savings on your unused Fabric Capacity

Towards Data Science

Automate your Fabric capacity state with Azure Logic Apps Continue reading on Towards Data Science »

article thumbnail

Building Lyft’s Next Emblem — Glow

Lyft Engineering

Building Lyfts Next EmblemGlow By: Avneet Oberoi , Michael Vernier , Phoenix Li , MasroorAhmed Introduction Long time riders might remember the original fuzzy, pink Carstache emblem that made Lyft universally recognizable. Over the years, the emblem dropped the fuzz for pink lights in the Glowstache and later evolved with more colors as the beloved Amp , which has been in active use for over seven years!

article thumbnail

What Makes Data-in-Motion Architectures a Must-Have for the Modern Enterprise

Cloudera

Cloudera’s data-in-motion architecture is a comprehensive set of scalable, modular, re-composable capabilities that help organizations deliver smart automation and real-time data products with maximum efficiency while remaining agile to meet changing business needs. In this blog, we will examine the “why” behind streaming data and review some high-level guidelines for how organizations should build their data-in-motion architecture of the future.