Sat.Jul 27, 2024 - Fri.Aug 02, 2024

article thumbnail

How To Run A Data Team As A New Head Of Data

Seattle Data Guy

What would you do if you became the head or director of data for a 1,000-person company? Yesterday, you were plugging along as an analyst, and now, suddenly, you have all these new responsibilities. Figuring out where to start is part of the job. You’d probably feel a strong temptation to freak out. Who wouldn’t?… Read more The post How To Run A Data Team As A New Head Of Data appeared first on Seattle Data Guy.

Data 130
article thumbnail

Data+AI Summit 2024 - Retrospective - Apache Spark

Waitingforcode

Welcome to the second blog post dedicated to the previous Data+AI Summit. This time I'm going to share with you a summary of Apache Spark talks.

Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

5 Tips for Improving SQL Query Performance

KDnuggets

If you work in data, you’ll write SQL queries all the time. So how do you write efficient SQL queries that are optimized for performance? This tutorial will help you with just that.

SQL 136
article thumbnail

Introducing Apache Kafka® 3.8

Confluent

Apache Kafka 3.8 adds 17 new KIPs (13 for Core, 3 for Streams & 1 for Connect). Highlights include 2 new Docker images, the ability to set task assignors, and more!

Kafka 130
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.

article thumbnail

Ingest data from SQL Server, Salesforce, and Workday with LakeFlow Connect

databricks

We’re excited to announce the Public Preview of LakeFlow Connect for SQL Server, Salesforce, and Workday. These ingestion connectors enable simple and efficient.

SQL 132
article thumbnail

Snowflake Invests in Contextual AI to Make It Easier for Enterprises to Deploy RAG Applications in the AI Data Cloud

Snowflake

Retrieval Augmented Generation (RAG) allows enterprises to ground responses from Large Language Models in their specific organization’s data. This helps ensure that AI-powered applications provide responses that are not only accurate, relevant, and consistent, but also aligned with business needs. At Snowflake, we make it simple for our customers to implement RAG, while also enabling the strict governance and privacy controls that businesses require.

Cloud 116

More Trending

article thumbnail

Data Engineering Weekly #182

Data Engineering Weekly

Meta: Introducing Llama 3.1: Our most capable models to date Probability one of the hottest announcements this week is Llama 3.1 release - the first-ever open-sourced frontier AI model competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. The Llama3 herd of models is an insightful paper that helps one deeply understand the foundational model.

article thumbnail

Announcing General Availability of Lakehouse Federation

databricks

Today, we are excited to announce that Lakehouse Federation in Unity Catalog is now Generally Available (GA) across AWS, Azure, and GCP! Lakehouse.

AWS 123
article thumbnail

Daft: Distributed Dataframes with Python.

Confessions of a Data Guy

The post Daft: Distributed Dataframes with Python. appeared first on Confessions of a Data Guy.

Python 100
article thumbnail

7 Steps to Master the Art of Data Storytelling

KDnuggets

Follow this 7 step recipe to mastering effective insight and information dissemination through compelling data story crafting.

Data 132
article thumbnail

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

Christophe Louvion, Chief Product & Technology Officer of NRC Health, is here to take us through how he guided his company's recent experience of getting from concept to launch and sales of products within 90 days. In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.

article thumbnail

An Overview of Cloudera’s AI Survey: The State of Enterprise AI and Modern Data Architecture

Cloudera

Enterprise IT leaders across industries are tasked with preparing their organizations for the technologies of the future – which is no simple task. With the use of AI exploding, Cloudera, in partnership with Researchscape, surveyed 600 IT leaders who work at companies with over 1,000 employees in the U.S., EMEA and APAC regions. The survey, ‘ The State of Enterprise AI and Modern Data Architecture ’ uncovered the challenges and barriers that exist with AI adoption, current enterprise AI deployme

article thumbnail

Accelerate Feature Engineering With Photon

databricks

Training a high-quality machine learning model requires careful data and feature preparation. To fully utilize raw data stored as tables in Databricks, running.

article thumbnail

Snowflake is Dying??!! Data Breach!!

Confessions of a Data Guy

The post Snowflake is Dying??!! Data Breach!! appeared first on Confessions of a Data Guy.

Data 100
article thumbnail

Organize, Search, and Back Up Files with Python’s Pathlib

KDnuggets

This tutorial will teach you how to simplifying your file management tasks, from organization to backup, using Python’s pathlib module.

article thumbnail

How To Speak The Language Of Financial Success In Product Management

Speaker: Jamie Bernard

Success in product management goes beyond delivering great features - it’s about achieving measurable financial outcomes that resonate across the organization. By connecting your product’s journey with the company’s financial success, you’ll ensure that every feature, release, and innovation contributes to the bottom line, driving both customer satisfaction and business growth.

article thumbnail

How to make a “peeled edge” area of interest effect in ArcGIS Pro

ArcGIS

Catch eyes and imaginations with this fun technique that draws attention to your area of interest with a bit of style!

99
article thumbnail

Responsible AI with the Databricks Data Intelligence Platform

databricks

The transformative potential of artificial intelligence (AI) is undeniable. From productivity efficiency, to cost savings, and improved decision-making across all industries, AI is.

Data 111
article thumbnail

CI/CD for Data Engineers.

Confessions of a Data Guy

The post CI/CD for Data Engineers. appeared first on Confessions of a Data Guy.

article thumbnail

How to Perform Memory-Efficient Operations on Large Datasets with Pandas

KDnuggets

Let's learn how to perform memory-efficient operations in pandas with large dataset.

Datasets 128
article thumbnail

What Is Entity Resolution? How It Works & Why It Matters

Entity Resolution Sometimes referred to as data matching or fuzzy matching, entity resolution, is critical for data quality, analytics, graph visualization and AI. Learn what entity resolution is, why it matters, how it works and its benefits. Advanced entity resolution using AI is crucial because it efficiently and easily solves many of today’s data quality and analytics problems.

article thumbnail

How BT Group Built a Smart Event Mesh with Confluent

Confluent

BT Group's Smart Event Mesh - centralized event streaming with decentralized customer experience, automation, and a foundation—all built on Confluent.

82
article thumbnail

OKR-Centric Delivery Models for Engineering-Focused Enterprises

databricks

Introduction An organization adopting new technologies or on a modernization journey typically focuses on upcoming tools, their features and potential performance/cost improvements under.

article thumbnail

Introducing the Trusted Data for AI Advisory Council

Monte Carlo

Last month, Monte Carlo released its State of Reliable AI Survey. In that survey, we found that 2 out of 3 data leaders doubted the AI-readiness of their data. Which begs the question—what does ‘AI-ready data’ actually mean? Gartner defines AI-ready data as “the ability to prove the fitness of data for AI models and use cases, which requires rethinking data management.

Data 59
article thumbnail

6 ChatGPT Prompts to Enhance your Productivity at Work

KDnuggets

Unlock your potential with these crafted 6 ChatGPT prompts designed to boost your productivity and streamline your operation workflows.

article thumbnail

Provide Real Value in Your Applications with Data and Analytics

The complexity of financial data, the need for real-time insight, and the demand for user-friendly visualizations can seem daunting when it comes to analytics - but there is an easier way. With Logi Symphony, we aim to turn these challenges into opportunities. Our platform empowers you to seamlessly integrate advanced data analytics, generative AI, data visualization, and pixel-perfect reporting into your applications, transforming raw data into actionable insights.

article thumbnail

Flink AI: Real-Time ML and GenAI Enrichment of Streaming Data with Flink SQL on Confluent Cloud

Confluent

Learn how to use Flink SQL on Confluent Cloud to invoke ML and GenAI endpoints to enrich streaming data

SQL 80
article thumbnail

Lakehouse Monitoring GA: Profiling, Diagnosing, and Enforcing Data Quality with Intelligence

databricks

At Data and AI Summit, we announced the general availability of Databricks Lakehouse Monitoring. Our unified approach to monitoring data and AI.

Data 103
article thumbnail

Beyond Web Mercator: Projected Basemaps Revisited

ArcGIS

More small-scale projected basemaps to add to the set I built in 2023

Project 92
article thumbnail

MarshMallow: The Sweetest Python Library for Data Serialization and Validation

KDnuggets

Stop debugging data mismatches and focus on your application logic when you let Marshmallow handle serialization, deserialization and validation for you.

Python 82
article thumbnail

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

article thumbnail

Deploying dbt Projects at Scale on Google Cloud

Towards Data Science

Containerising and running dbt projects with Artifact Registry, Cloud Composer, GitHub Actions and dbt-airflow Continue reading on Towards Data Science »

article thumbnail

Generative AI for Capital Markets

databricks

Financial Valuations & Comparative Analysis Financial institutions specialized in capital markets such as hedge funds, market makers and pension funds have long been.

84
article thumbnail

New with Confluent Platform: Enhanced security with OAuth Support, Confluent Platform for Apache Flink® (LA), a new Connector, and More

Confluent

Confluent Platform 7.

116
116
article thumbnail

5 Free Online Courses to Learn Data Engineering Fundamentals

KDnuggets

Kickstart a new career in one of the most popular tech careers where you can earn a 6 figure salary.

article thumbnail

The AI Superhero Approach to Product Management

Speaker: Conrado Morlan

In this engaging and witty talk, industry expert Conrado Morlan will explore how artificial intelligence can transform the daily tasks of product managers into streamlined, efficient processes. Using the lens of a superhero narrative, he’ll uncover how AI can be the ultimate sidekick, aiding in data management and reporting, enhancing productivity, and boosting innovation.