Fri.Dec 06, 2024

article thumbnail

Top 5 Tips for Fine-Tuning LLMs

KDnuggets

104
104
article thumbnail

Equiniti: From Zero to AI

databricks

Equiniti wanted to centralize data and insights to its operations. To this end, it utilized the Databricks Data Intelligence Platform and Mosaic AI tools to enhance customer experience and drive innovation.

Utilities 102
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

AWS S3 Tables. Technical Introduction.

Confessions of a Data Guy

Well, everyone is abuzz with the recently announced S3 Tables that came out of AWS reinvent this year. I’m going to call fools gold on this one right out of the gate. I tried them out, in real life that is, not just some marketing buzz, and it will leave most people, not all, be […] The post AWS S3 Tables. Technical Introduction. appeared first on Confessions of a Data Guy.

AWS 130
article thumbnail

NumPy for Simulating Random Processes and Monte Carlo Methods

KDnuggets

Process 74
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

The Struggle Between Data Dark Ages and LLM Accuracy

Cloudera

Artificial Intelligence promises to transform lives and business as we know it. But what does that future look like? The AI Forecast: Data and AI in the Cloud Era , sponsored by Cloudera, aims to take an objective look at the impact of AI on business, industry, and the world at large. Hosted weekly by Paul Muller, The AI Forecast speaks to experts in the space to understand the ins and outs of AI in the enterprise, the kinds of data architectures and infrastructures that support it, the guardrai

article thumbnail

5 Outside the Box Applications of Natural Language Processing

KDnuggets

Process 74

More Trending

article thumbnail

Webinar: Data Quality in a Medallion Architecture – 2024

DataKitchen

Would you like help maintaining high-quality data across every layer of your Medallion Architecture? Like an Olympic athlete training for the gold, your data needs a continuous, iterative process to maintain peak performance. We covered how Data Quality Testing, Observability, and Scorecards turn data quality into a dynamic process, helping you build accuracy, consistency, and trust at each layerBronze, Silver, and Gold.

article thumbnail

Test smarter not harder: Where should tests go in your pipeline?

dbt Developer Hub

Greetings, dbters! Its Faith & Jerrie, back again to offer tactical advice on where to put tests in your pipeline. In our first post on refining testing best practices, we developed a prioritized list of data quality concerns. We also documented first steps for debugging each concern. This post will guide you on where specific tests should go in your data pipeline.

article thumbnail

Marketing Data Warehouse: An Easy Guide and Everything You Need to Know

Hevo

We know just how hard it is to run your marketing data. The variety of campaigns running through platforms, such as Google Ads, Facebook, and HubSpot, among others, gives the kind of information that would just flood you.

article thumbnail

The Chaos of Catalogs

Data Engineering Weekly

48
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Hevo vs Airflow: The Better Tool?

Hevo

Data integration is an integral part of modern business strategy, enabling businesses to convert raw data into actionable information and make data-driven decisions. Tools like Apache Airflow are used and popular for workflow automation. However, its technical complexities and steeper learning curve can create a challenge for teams that require an efficient real-time data pipeline.

article thumbnail

Newsletter: Your December Dose of Data & AI

Data Council

Data 40
article thumbnail

Matillion vs Airflow: Which one to choose in 2025?

Hevo

Matillion is a cloud-based ETL tool known for its user-friendly, low-code interface. It’s great for teams that want to get pipelines up and running quickly without heavy coding. It also integrates seamlessly with cloud platforms like Snowflake, BigQuery, and Redshift, making it a solid choice for companies already working in the cloud.

article thumbnail

Data Council 2025: Meet the Track Hosts

Data Council

Data 40
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.