Mon.Feb 05, 2024

article thumbnail

A Data Lake, You Call It? It’s a Data Swamp

KDnuggets

How and why the data lake architecture often fails to meet its promises. And how better governance helps mitigate such challenges.

Data Lake 145
article thumbnail

Top 5 Data + AI Predictions for Financial Services in 2024

Snowflake

Generative AI tops every list of major financial services trends for 2024. And it’s no wonder — this new technology has the potential to revolutionize the industry by augmenting the value of employee work, driving organizational efficiencies, providing personalized customer experiences, and uncovering new insights from vast amounts of data. Its predictive capabilities can help leaders anticipate market trends and make more informed decisions, improving financial outcomes for customers as well as

article thumbnail

Books, Courses, and Live Events to Learn Generative AI with O’Reilly

KDnuggets

If you are new to generative AI or an expert who wants to learn more, O’Reilly offers a range of resources to kickstart your generative AI journey.

144
144
article thumbnail

Type Classes in Kotlin: A Practical Guide

Rock the JVM

By Riccardo Cardin In this article, we delve into the concept of type classes in Kotlin, a powerful tool that allows developers to abstract logic for different data types. We’ll take data validation as an example to show how type classes can be used to write generic and reusable code. Our implementation will be based on the Arrow Kt library, which will exploit Kotlin’s context receivers.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Data Maturity: The Cornerstone of AI-Enabled Innovation

KDnuggets

This article outlines strategies for overcoming data maturity challenges and accelerating AI adoption.

Data 140
article thumbnail

Navigating Slowly Changing Dimensions (SCD) and Data Reinstatement: A Comprehensive Guide

Towards Data Science

Navigating Slowly Changing Dimensions (SCD) and Data Restatement: A Comprehensive Guide Strategies for efficiently managing dimension changes and data restatement in enterprise data warehousing Imagine this, you are a data engineer working for a large retail company that utilizes the incremental load technique in data warehousing. This technique involves selectively updating or loading only the new or modified data since the last update.

Retail 71

More Trending

article thumbnail

Evaluating Retrieval in RAGs: A Gentle Introduction

Tweag

No, not this RAG. Despite their many capabilities, Large Language Models (LLMs) have a serious limitation: they’re stuck in time and their knowledge is limited to the data they have been trained on. Updating the knowledge of an LLM can take two forms: fine-tuning, which we will address in a future post, and the ever-present RAG. RAG, short for Retrieval Augmented Generation, has garnered a lot of attention in the GenAI community and for good reasons.

article thumbnail

Material Master Data 101: Challenges and Solutions

Precisely

The manufacturing processes you manage in SAP ERP systems require vast amounts of interdependent data. Material master data is one of the crucial pieces that areas throughout an organization rely on for successful operations. However, manual material data management often comes with many errors and great risks for your business. The good news? Automation can be a game-changer – solving your biggest challenges and unlocking new opportunities.

article thumbnail

Kotlin 101: Type Classes Quickly Explained

Rock the JVM

Discover type classes in Kotlin: a powerful pattern to organize your code for improved readability, maintainability, and flexibility

Coding 52
article thumbnail

A Guide to Seamless Data Fabric Implementation

Striim

Organizations are grappling with the increasing complexity and diversity of their data sources. Traditional approaches often fall short in addressing the challenges posed by disparate data silos, and there arises a need for a more cohesive and integrated solution. Enter Data Fabric — a paradigm that promises a unified, scalable, and agile approach to managing the intricacies of modern data.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!