Sat.May 11, 2024 - Fri.May 17, 2024

article thumbnail

5 Free University Courses to Learn Machine Learning

KDnuggets

Want to learn machine learning from the best of resources? Check out these free machine learning courses from the top universities of the world.

article thumbnail

Why You Should Replace Pandas with Polars

Confessions of a Data Guy

I’m still amazed to this day how many folks hold onto stuff they love, they just can’t let it go. I get it, sorta, I’m the same way. There are reasons why people do the things they do, even if they are hard for us to understand. It blows my mind when I see something […] The post Why You Should Replace Pandas with Polars appeared first on Confessions of a Data Guy.

IT 147
article thumbnail

Release Management For Data Platform Services And Logic

Data Engineering Podcast

Summary Building a data platform is a substrantial engineering endeavor. Once it is running, the next challenge is figuring out how to address release management for all of the different component parts. The services and systems need to be kept up to date, but so does the code that controls their behavior. In this episode your host Tobias Macey reflects on his current challenges in this area and some of the factors that contribute to the complexity of the problem.

article thumbnail

Mind the map: a new design for the London Underground map

ArcGIS

A modern take on the London tube map with updated accessible colours, a re-classification of lines by type, and line symbols scaled by frequency

Designing 135
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Mastering Python: 7 Strategies for Writing Clear, Organized, and Efficient Code

KDnuggets

Optimize Your Python Workflow: Proven Techniques for Crafting Production-Ready Code

Python 150
article thumbnail

Data News — Week 24.20

Christophe Blefari

Lights on ( credits ) Hello you. The sun is out, the days are getting longer and Data News is still here. Next week marks 3 years of this newsletter/blog (yay 🎉 ). It'll be a time for looking back, reflecting and celebrating, but next week. This week, we reached 5000 members. Yes, 5000 of you read my content periodically. Just thank you ❤️ In the recent days I've been working on a new side project.

Food 130

More Trending

article thumbnail

How to Crush the Spider Benchmark with Ease on Databricks

databricks

How we reached 79.9% on the Spider dev dataset with Llama3 8B through savvy prompting and fine-tuning on Databricks.

Datasets 130
article thumbnail

10 Free Must-Take Data Science Courses to Get Started

KDnuggets

Want to start your data science journey? Then, let these courses guide you on that trip.

article thumbnail

Snowflake Invests in Metaplane for Deep, End-to-End Observability in the Data Cloud

Snowflake

According to Infosys, 35% of AI projects will either fail or experience delays because of poor data quality. There’s a huge gap between the data quality most companies have by default and the data quality needed for successful AI. And that gap is directly affecting the performance and reliability of AI systems everywhere. As organizations expand their use of Snowflake to build and deploy new AI-powered data applications, comprehensive data observability is critical to success.

Cloud 114
article thumbnail

What’s New for Spatial Analyst in ArcGIS Pro 3.3

ArcGIS

Spatial Analyst in ArcGIS Pro 3.3 offers new capabilities for suitability modeling, as well as density, distance, solar, and zonal analysis.

111
111
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Unlocking the Potential of Private Data Sharing using Databricks Private Exchanges

databricks

We are thrilled to announce an exciting new feature on the Databricks Marketplace that simplifies the process of setting up private exchanges for.

Data 119
article thumbnail

The Easiest Way of Running Llama 3 Locally

KDnuggets

Download, install, and type one command in the terminal to start using Llama 3 on your laptop.

146
146
article thumbnail

What Separates Hybrid Cloud and ‘True’ Hybrid Cloud?

Cloudera

Hybrid cloud plays a central role in many of today’s emerging innovations—most notably artificial intelligence (AI) and other emerging technologies that create new business value and improve operational efficiencies. But getting there requires data, and a lot of it. More than that, though, harnessing the potential of these technologies requires quality data—without it, the output from an AI implementation can end up inefficient or wholly inaccurate.

Cloud 104
article thumbnail

Designing and testing for accessibility in GIS and mapping

ArcGIS

Review best practices for designing and testing for accessibility maps and apps throughout the ArcGIS system during the development process.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Building DBRX-class Custom LLMs with Mosaic AI Training

databricks

We recently introduced DBRX : an open, state-of-the-art, general-purpose LLM. DBRX was trained, fine-tuned, and evaluated using Mosaic AI Training, scaling training to.

Building 119
article thumbnail

The Best Strategies for Fine-Tuning Large Language Models

KDnuggets

Learn how to master the art of fine-tuning LLMs for specialized tasks.

145
145
article thumbnail

Six Clouderans Earn CRN Women of the Channel Distinction

Cloudera

Businesses today face unique challenges, whether it’s with hybrid cloud, AI, data analytics, or all of the above. Delivering solutions that can address those challenges effectively requires a robust ecosystem of partnerships. At the center of this critical ecosystem is the partner marketing team at Cloudera, who work tirelessly in pursuit of excellence for customers—and as a result, we’re proud to share that six of our very own Clouderans have been recognized by CRN as part of this year’s Women

article thumbnail

Multiresolution Object Detection with Text SAM

ArcGIS

This blog post will walk you through the process of running multi resolution deep learning over a range of cell sizes.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Databricks Is a Glassdoor Best-Led Company in 2024

databricks

Databricks is pleased to announce we are ranked #2 in the inaugural annual Glassdoor Award List of Best-Led Companies in 2024 ! At.

111
111
article thumbnail

All About the AI Regulatory Landscape

KDnuggets

This post explores the evolving AI regulatory landscape and essential aspects of the EU Act law, crucial for understanding its impact.

IT 144
article thumbnail

Unapologetically Technical Episode 11 – Hubert Dulay

Jesse Anderson

In this episode of Unapologetically Technical, I interview Hubert Dulay, the author of Streaming Data Mesh and Developer Advocate at StarTree. We talked about his early experience with web backends like CORBA and SOAP and how those prepared him for data work. He shares his advice for those with web development skills to transition into data and what it’s like for a person leaving a company after a long tenure there.

IT 100
article thumbnail

Text SAM: Extracting GIS Features Using Text Prompts

ArcGIS

Prompt Segment Anything Model (SAM) with free form text to extract features in your imagery

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Research Survey: Productivity benefits from Databricks Assistant

databricks

In the fast-paced landscape of data science and engineering, integrating Artificial Intelligence (AI) has become integral for enhancing productivity. We’ve seen many tools.

article thumbnail

Pursue a Master’s in Data Science with the 3rd Best Online Program 2024

KDnuggets

100% online master’s program with flexible schedules designed for working professionals. Enrolling now for October 28th.

article thumbnail

Scrum Master Resume: Tips, Samples, Skills Required

Knowledge Hut

A Scrum master is responsible for facilitating the process within a team and ensuring that all team members adhere to the Scrum methodology. He is also responsible for removing any impediments to the team's progress and ensuring the team can deliver their sprint commitments. To be able to apply for the role, you need to have an outstanding Scrum Master resume.

article thumbnail

Tools for Building Community Climate Resilience

ArcGIS

Discover the latest climate resilience planning tools with 18 new ready-to-use layers. Explore the Climate Resilience Index layers in ArcGIS Living Atlas.

Building 103
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

How a Leading Venture Capital Firm is Building GenAI with Databricks

databricks

Successfully building GenAI applications means going beyond just leveraging the latest cutting-edge models. It requires the development of compound AI systems that integrate.

article thumbnail

LSTMs Rise Again: Extended-LSTM Models Challenge the Transformer Superiority

KDnuggets

Can LSTMs become the de facto standard for Language Modeling tasks once again?

138
138
article thumbnail

Release Train Engineer vs Scrum Master - Critical ART Roles

Knowledge Hut

We may have heard about release train engineers- many people opt for this professional course and ask about the Release Train Engineer vs. Scrum Master comparison. If you wonder what these two are and how they differ from each other, we can help you. A scaling framework that can help your organization organize the goals and other determinants is called a Scaled Agile Framework.

article thumbnail

5 Ways Advertising, Media and Entertainment Companies are Using Gen AI

Snowflake

The emergence of generative AI (gen AI) heralds a new, groundbreaking era for advertising, media and entertainment. According to a recent Snowflake report, Advertising, Media and Entertainment Data + AI Predictions 2024 , gen AI is going to transform the industry — from content creation to customer experience. The companies that will come out ahead during this time are those that most successfully and quickly supercharge their data strategy.

article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.