Tue.Jul 30, 2024

article thumbnail

7 Steps to Master the Art of Data Storytelling

KDnuggets

Follow this 7 step recipe to mastering effective insight and information dissemination through compelling data story crafting.

Data 144
article thumbnail

New with Confluent Platform: Enhanced security with OAuth Support, Confluent Platform for Apache Flink® (LA), a new Connector, and More

Confluent

Confluent Platform 7.

121
121
article thumbnail

How to Perform Matrix Operations with NumPy

KDnuggets

Learning how to perform several of the most basic matrix operations with NumPy.

Python 123
article thumbnail

OKR-Centric Delivery Models for Engineering-Focused Enterprises

databricks

Introduction An organization adopting new technologies or on a modernization journey typically focuses on upcoming tools, their features and potential performance/cost improvements under.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

MarshMallow: The Sweetest Python Library for Data Serialization and Validation

KDnuggets

Stop debugging data mismatches and focus on your application logic when you let Marshmallow handle serialization, deserialization and validation for you.

Python 91
article thumbnail

Data Warehouse, Redefined

Towards Data Science

Rethinking data warehousing: Why redefinition is necessary even beyond Modern Data Warehouse (MDW) and Lakehouse Models Continue reading on Towards Data Science »

More Trending

article thumbnail

Mobiumata by Chris Price

Scott Logic

Mobiumata (a concatenation of Möbius strip and cellular automata) is a small interactive art piece that allows folk to play god to 1,500 LED cells wrapped into a Möbius strip. When Scott Logic needed something to act as a talking point for a conference booth, as a big fan of all things flashy, shiny and interactive, I jumped at the opportunity to create something engaging that was roughly themed around AI.

Coding 52
article thumbnail

The 6 Data Quality Dimensions with Examples

Monte Carlo

It’s clear that data quality is becoming more of a focus for more data teams. So why are there still so many questions like these: A quick search on subreddits for data engineers, data analysts, data scientists, and more can yield a plethora of users seeking data quality advice. And while the comment below may seem like the accepted way of doing data quality management… … there’s actually a much better way.

article thumbnail

If agile is the answer, what is the question? by Dave Ogle

Scott Logic

The other day a colleague asked this question on one of our internal Slack channels: “If you were writing headings in a document, and using the capitalisation style of capitalising the first letter of just the important words in a header, how would you capitalise ‘what we are trying to achieve?’” The answers were many and varied, ranging from serious answers with supporting documentation to tongue-in-cheek responses, the answer which won the day though was this: “Objective” Clever, isn’t it?

article thumbnail

How to Build RAG Applications Using Snowflake Cortex?

Hevo

GPT has become a go-to search engine for many. We often use it instead of Google to get a quick solution for any query. Given its popularity, why don’t you include a customer chatbot or a troubleshooting chatbot service in your business? Imagine having a brand-specific chatbot with expertise in answering your business related queries.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Change Data Capture as the Backbone of RAG AI-Driven System Resilience Strategies

Striim

Ensuring system resilience is critical for maintaining a competitive edge in today’s data-driven world. As businesses rely on real-time data to fuel decision-making, it’s essential that their systems can withstand disruptions and maintain functionality. Change Data Capture (CDC) is a key player, particularly in AI-driven systems where real-time data integration and adaptive responses are crucial.

Systems 52
article thumbnail

Optimizing Data Warehouse Cost using Apache Iceberg

Hevo

Data warehouses bring phenomenal results from well-informed, data-driven decision-making for an organization. There were times when only companies with large capital, and substantial IT infrastructures invested time and effort, let alone money, in a data warehouse.