Wed.May 15, 2024

article thumbnail

Why You Should Replace Pandas with Polars

Confessions of a Data Guy

I’m still amazed to this day how many folks hold onto stuff they love, they just can’t let it go. I get it, sorta, I’m the same way. There are reasons why people do the things they do, even if they are hard for us to understand. It blows my mind when I see something […] The post Why You Should Replace Pandas with Polars appeared first on Confessions of a Data Guy.

IT 148
article thumbnail

The Best Strategies for Fine-Tuning Large Language Models

KDnuggets

Learn how to master the art of fine-tuning LLMs for specialized tasks.

135
135
article thumbnail

Developing Production Level Databricks Pipelines.

Confessions of a Data Guy

A question that comes up often … “How do I develop Production Level Databricks Pipelines?” Or maybe someone just has a feeling that using Notebooks all day long is expensive and ends up being an unreliable way to produce Databricks Spark + Delta Lake pipelines that run well … without error. It isn’t really that […] The post Developing Production Level Databricks Pipelines. appeared first on Confessions of a Data Guy.

Data 130
article thumbnail

The Easiest Way of Running Llama 3 Locally

KDnuggets

Download, install, and type one command in the terminal to start using Llama 3 on your laptop.

130
130
article thumbnail

Apache Airflow® 101 Essential Tips for Beginners

Apache Airflow® is the open-source standard to manage workflows as code. It is a versatile tool used in companies across the world from agile startups to tech giants to flagship enterprises across all industries. Due to its widespread adoption, Airflow knowledge is paramount to success in the field of data engineering.

article thumbnail

Snowflake Invests in Metaplane for Deep, End-to-End Observability in the Data Cloud

Snowflake

According to Infosys, 35% of AI projects will either fail or experience delays because of poor data quality. There’s a huge gap between the data quality most companies have by default and the data quality needed for successful AI. And that gap is directly affecting the performance and reliability of AI systems everywhere. As organizations expand their use of Snowflake to build and deploy new AI-powered data applications, comprehensive data observability is critical to success.

Cloud 122

More Trending

article thumbnail

5 Ways Advertising, Media and Entertainment Companies are Using Gen AI

Snowflake

The emergence of generative AI (gen AI) heralds a new, groundbreaking era for advertising, media and entertainment. According to a recent Snowflake report, Advertising, Media and Entertainment Data + AI Predictions 2024 , gen AI is going to transform the industry — from content creation to customer experience. The companies that will come out ahead during this time are those that most successfully and quickly supercharge their data strategy.

article thumbnail

Unapologetically Technical Episode 11 – Hubert Dulay

Jesse Anderson

In this episode of Unapologetically Technical, I interview Hubert Dulay, the author of Streaming Data Mesh and Developer Advocate at StarTree. We talked about his early experience with web backends like CORBA and SOAP and how those prepared him for data work. He shares his advice for those with web development skills to transition into data and what it’s like for a person leaving a company after a long tenure there.

IT 100
article thumbnail

LLaMA 3: Meta’s Most Powerful Open-Source Model Yet

KDnuggets

This open model outperforms Google’s Gemini and other cutting-edge models on various benchmarks.

79
article thumbnail

What’s new for ArcGIS Defense Mapping in ArcGIS Pro 3.3

ArcGIS

Check out what's new for ArcGIS Defense Mapping in ArcGIS Pro 3.3. Enhancements were made to the Glossary Table, product files, and tools.

62
article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Navigating Knowledge Graphs - Creating Cypher Queries with LLMs by Ben Heyman

Scott Logic

Introduction In this blog post I’ll show you how to create graph database queries using a large language model (LLM) and very minimal query language knowledge. By using an LLM, anyone who wants to start using a graph database can do so, without having to learn a new querying language (“Great news!” I hear you all say). Graph databases are not like your typical relational database and are represented by a knowledge graph.

article thumbnail

Transitioning to Appcraft: Evolution of Zalando’s server-driven UI framework

Zalando Engineering

At the heart of Zalando's mobile content strategy lies the Appcraft platform, fueling 13 dynamic pages within the app. This framework is instrumental in delivering top-tier content formats, including the popular Zalando Stories. In this post we explain the origins and inner workings of the platform. The TNA Dilemma The Flexible Layout Kit (formerly known as Truly Native Apps, TNA was a framework used in Zalando App to render content dynamically.

article thumbnail

Guided Navigation in Power BI

FreshBI

Dashboards that tell good stories are premium dashboards. A body could for example create a 12 month cash flow forecast and call it a day’s work. …….Or instead, one could tell the story intended to be told by the dashboard and have a dashboard that reveals the sequential milestones along the 12 month cash projection. The milestones expected to be the major contributors to the cash balance and their causes and effects on the business.

BI 52
article thumbnail

Disable Inactive Users in Snowflake

Cloudyard

Read Time: 1 Minute, 23 Second Managing user accounts is a critical task in any organization, especially for ensuring security and compliance. With the increasing number of users and automating the process of disabling or inactive user accounts for employees who have left the company is essential. This blog post will explore how you can leverage Snowflake’s capabilities, specifically through a stored procedure, to automate the disabling of user accounts for those who have not logged in ove

SQL 52
article thumbnail

Apache Airflow® Crash Course: From 0 to Running your Pipeline in the Cloud

With over 30 million monthly downloads, Apache Airflow is the tool of choice for programmatically authoring, scheduling, and monitoring data pipelines. Airflow enables you to define workflows as Python code, allowing for dynamic and scalable pipelines suitable to any use case from ETL/ELT to running ML/AI operations in production. This introductory tutorial provides a crash course for writing and deploying your first Airflow pipeline.