Sat.May 11, 2024 - Fri.May 17, 2024

article thumbnail

Why You Should Replace Pandas with Polars

Confessions of a Data Guy

I’m still amazed to this day how many folks hold onto stuff they love, they just can’t let it go. I get it, sorta, I’m the same way. There are reasons why people do the things they do, even if they are hard for us to understand. It blows my mind when I see something […] The post Why You Should Replace Pandas with Polars appeared first on Confessions of a Data Guy.

IT 147
article thumbnail

Release Management For Data Platform Services And Logic

Data Engineering Podcast

Summary Building a data platform is a substrantial engineering endeavor. Once it is running, the next challenge is figuring out how to address release management for all of the different component parts. The services and systems need to be kept up to date, but so does the code that controls their behavior. In this episode your host Tobias Macey reflects on his current challenges in this area and some of the factors that contribute to the complexity of the problem.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 24.20

Christophe Blefari

Lights on ( credits ) Hello you. The sun is out, the days are getting longer and Data News is still here. Next week marks 3 years of this newsletter/blog (yay 🎉 ). It'll be a time for looking back, reflecting and celebrating, but next week. This week, we reached 5000 members. Yes, 5000 of you read my content periodically. Just thank you ❤️ In the recent days I've been working on a new side project.

Food 130
article thumbnail

5 Free University Courses to Learn Machine Learning

KDnuggets

Want to learn machine learning from the best of resources? Check out these free machine learning courses from the top universities of the world.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Developing Production Level Databricks Pipelines.

Confessions of a Data Guy

A question that comes up often … “How do I develop Production Level Databricks Pipelines?” Or maybe someone just has a feeling that using Notebooks all day long is expensive and ends up being an unreliable way to produce Databricks Spark + Delta Lake pipelines that run well … without error. It isn’t really that […] The post Developing Production Level Databricks Pipelines. appeared first on Confessions of a Data Guy.

Data 130
article thumbnail

Unapologetically Technical Episode 11 – Hubert Dulay

Jesse Anderson

In this episode of Unapologetically Technical, I interview Hubert Dulay, the author of Streaming Data Mesh and Developer Advocate at StarTree. We talked about his early experience with web backends like CORBA and SOAP and how those prepared him for data work. He shares his advice for those with web development skills to transition into data and what it’s like for a person leaving a company after a long tenure there.

IT 100

More Trending

article thumbnail

Snowflake Invests in Metaplane for Deep, End-to-End Observability in the Data Cloud

Snowflake

According to Infosys, 35% of AI projects will either fail or experience delays because of poor data quality. There’s a huge gap between the data quality most companies have by default and the data quality needed for successful AI. And that gap is directly affecting the performance and reliability of AI systems everywhere. As organizations expand their use of Snowflake to build and deploy new AI-powered data applications, comprehensive data observability is critical to success.

Cloud 110
article thumbnail

What Separates Hybrid Cloud and ‘True’ Hybrid Cloud?

Cloudera

Hybrid cloud plays a central role in many of today’s emerging innovations—most notably artificial intelligence (AI) and other emerging technologies that create new business value and improve operational efficiencies. But getting there requires data, and a lot of it. More than that, though, harnessing the potential of these technologies requires quality data—without it, the output from an AI implementation can end up inefficient or wholly inaccurate.

Cloud 98
article thumbnail

Scrum Master Resume: Tips, Samples, Skills Required

Knowledge Hut

A Scrum master is responsible for facilitating the process within a team and ensuring that all team members adhere to the Scrum methodology. He is also responsible for removing any impediments to the team's progress and ensuring the team can deliver their sprint commitments. To be able to apply for the role, you need to have an outstanding Scrum Master resume.

article thumbnail

How to Crush the Spider Benchmark with Ease on Databricks

databricks

How we reached 79.9% on the Spider dev dataset with Llama3 8B through savvy prompting and fine-tuning on Databricks.

Datasets 127
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Towards Sustainable Data Engineering Patterns

Towards Data Science

Engineers, scientists, and analysts have the potential to greatly reduce carbon emissions by introducing sustainable, efficient, and… Continue reading on Towards Data Science »

article thumbnail

Six Clouderans Earn CRN Women of the Channel Distinction

Cloudera

Businesses today face unique challenges, whether it’s with hybrid cloud, AI, data analytics, or all of the above. Delivering solutions that can address those challenges effectively requires a robust ecosystem of partnerships. At the center of this critical ecosystem is the partner marketing team at Cloudera, who work tirelessly in pursuit of excellence for customers—and as a result, we’re proud to share that six of our very own Clouderans have been recognized by CRN as part of this year’s Women

article thumbnail

Release Train Engineer vs Scrum Master - Critical ART Roles

Knowledge Hut

We may have heard about release train engineers- many people opt for this professional course and ask about the Release Train Engineer vs. Scrum Master comparison. If you wonder what these two are and how they differ from each other, we can help you. A scaling framework that can help your organization organize the goals and other determinants is called a Scaled Agile Framework.

article thumbnail

Building DBRX-class Custom LLMs with Mosaic AI Training

databricks

We recently introduced DBRX : an open, state-of-the-art, general-purpose LLM. DBRX was trained, fine-tuned, and evaluated using Mosaic AI Training, scaling training to.

Building 107
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

5 Ways Advertising, Media and Entertainment Companies are Using Gen AI

Snowflake

The emergence of generative AI (gen AI) heralds a new, groundbreaking era for advertising, media and entertainment. According to a recent Snowflake report, Advertising, Media and Entertainment Data + AI Predictions 2024 , gen AI is going to transform the industry — from content creation to customer experience. The companies that will come out ahead during this time are those that most successfully and quickly supercharge their data strategy.

article thumbnail

What’s New for Spatial Analyst in ArcGIS Pro 3.3

ArcGIS

Spatial Analyst in ArcGIS Pro 3.3 offers new capabilities for suitability modeling, as well as density, distance, solar, and zonal analysis.

108
108
article thumbnail

How To Install and Setup React Native on Mac

Knowledge Hut

With the rapid growth of online websites, businesses, and the general ecosystem, it is crucial that website UIs load quickly on smartphones to encourage smartphone-based internet consumption. Facebook developed React Native from a need to generate UI elements efficiently, which formed the basis for creating the open-source web framework. Its native cross-platform capabilities allow usage for a wide range of platforms for application development, including Android, Web, Windows, UWP, tvOS, macOS,

Java 98
article thumbnail

How a Leading Venture Capital Firm is Building GenAI with Databricks

databricks

Successfully building GenAI applications means going beyond just leveraging the latest cutting-edge models. It requires the development of compound AI systems that integrate.

article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

All About the AI Regulatory Landscape

KDnuggets

This post explores the evolving AI regulatory landscape and essential aspects of the EU Act law, crucial for understanding its impact.

IT 114
article thumbnail

Designing and testing for accessibility in GIS and mapping

ArcGIS

Review best practices for designing and testing for accessibility maps and apps throughout the ArcGIS system during the development process.

article thumbnail

Snowflake Advanced Certifications: Level Up to SnowPro Advanced and Show Off Your Snowflake Expertise

Snowflake

Did you know that Snowflake has five advanced role-based certifications to help you stand out in the data community as a Snowflake expert? The Snowflake Advanced Certification Series (Architect, Data Engineer, Data Scientist, Administrator, Data Analyst) offers role-based certifications designed for Snowflake practitioners with one to two years of experience (depending on the program).

article thumbnail

Unlocking the Potential of Private Data Sharing using Databricks Private Exchanges

databricks

We are thrilled to announce an exciting new feature on the Databricks Marketplace that simplifies the process of setting up private exchanges for.

Data 109
article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

HBase Deprecation at Pinterest

Pinterest Engineering

Alberto Ordonez Pereira | Senior Staff Software Engineer; Lianghong Xu | Senior Manager, Engineering; This blog marks the first of a three-part series describing our journey at Pinterest transition from managing multiple online storage services supported by HBase to a brand new serving architecture with a new datastore and a unified storage service.

NoSQL 86
article thumbnail

Contributing to Apache Kafka®: How to Write a KIP

Confluent

Learn how to contribute to open source Apache Kafka by writing Kafka Improvement Proposals (KIPs) that solve problems and add features! Read on for real examples.

Kafka 88
article thumbnail

Designing and testing for accessibility in GIS and mapping

ArcGIS

Review best practices for designing and testing for accessibility maps and apps throughout the ArcGIS system during the development process.

article thumbnail

Research Survey: Productivity benefits from Databricks Assistant

databricks

In the fast-paced landscape of data science and engineering, integrating Artificial Intelligence (AI) has become integral for enhancing productivity. We’ve seen many tools.

article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

Pursue a Master’s in Data Science with the 3rd Best Online Program 2024

KDnuggets

100% online master’s program with flexible schedules designed for working professionals. Enrolling now for October 28th.

article thumbnail

Best Practices for Technical Columns in Database Design

Towards Data Science

When architecting a transactional database or a data warehouse, it’s important not to forget about various types of technical columns… Continue reading on Towards Data Science »

article thumbnail

Tools for Building Community Climate Resilience

ArcGIS

Discover the latest climate resilience planning tools with 18 new ready-to-use layers. Explore the Climate Resilience Index layers in ArcGIS Living Atlas.

article thumbnail

Semiconductors on the Data Intelligence Platform

databricks

In the semiconductor industry, research and development tasks, manufacturing processes, and enterprise planning systems produce an array of data artifacts that can be fused to create an intelligent semiconductor enterprise. Through intelligent data use, an intelligent semiconductor enterprise accelerates time to market, increases manufacturing yield, and enhances product reliability.

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.