Sat.Aug 10, 2024 - Fri.Aug 16, 2024

article thumbnail

Data Engineering Interview Series #1: Data Structures and Algorithms

Start Data Engineering

1. Introduction 2. Data structures and algorithms to know 2.1. List 2.2. Dictionary 2.3. Queue 2.4. Stack 2.5. Set 2.6. Counter (from collections module) 2.7. Heap 2.8. Graph search 2.8.1 Depth First Search (DFS) 2.8.2. Breadth First Search BFS 2.9. Binary Search 3. Common DSA questions asked during DE interviews 3.1. Intervals 3.

Algorithm 200
article thumbnail

Speakers for Amsterdam / Netherlands Tech Events

The Pragmatic Engineer

I (Gergely) sometimes get reachouts to do talks at events in Amsterdam (where I am based,) the Netherlands, or somewhere in Europe. Unfortunately, rarely do talks – I do one conference per year. However, I asked around in the community about tech professionals who do paid talks that software engineers find interesting, engaging, and educational.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Beginner’s Guide to Careers in AI and Machine Learning

KDnuggets

The AI and ML complexity results in a growing number and diversity of jobs that require AI & ML expertise. We’ll give you a rundown of these jobs regarding the technical skills they need and the tools they employ.

article thumbnail

Long Context RAG Performance of LLMs

databricks

Retrieval Augmented Generation (RAG) is the most widely adopted generative AI use case among our customers. RAG enhances the accuracy of LLMs by.

142
142
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.

article thumbnail

Unapologetically Technical Episode 13 – Jeff Chou

Jesse Anderson

Unapologetically Technical’s newest episode is now live! In this episode of Unapologetically Technical, I interview Jeff Chou, CEO and co-founder of Sync Computing. Jeff, who holds a PhD from UC Berkeley and a postdoc from MIT, shares his unique journey from academia to startup life, and how his experience with simulations shaped the vision for Sync Computing.

article thumbnail

Speakers for Amsterdam / Netherlands Tech Events

The Pragmatic Engineer

I (Gergely) sometimes get reachouts to do talks at events in Amsterdam (where I am based,) the Netherlands, or somewhere in Europe. Unfortunately, rarely do talks – I do one conference per year. However, I asked around in the community about tech professionals who do paid talks that software engineers find interesting, engaging, and educational.

More Trending

article thumbnail

Databricks SQL Serverless is now available on Google Cloud Platform

databricks

Databricks SQL Serverless is now Generally Available on Google Cloud Platform (GCP)! SQL Serverless is available in 7 GCP regions and 40+ regions across AWS, Azure and GCP.

article thumbnail

A Melange of Maps

ArcGIS

Different thematic map types are better at supporting some questions than others. Here are a range of alternative approaches.

Designing 121
article thumbnail

Navigating the Future with Cloudera’s Updated Interface

Cloudera

Data practitioners are consistently asked to deliver more with less, and although most executives recognize the value of innovating with data, the reality is that most data teams spend the majority of their time responding to support tickets for data access, performance and troubleshooting, and other mundane activities. At the heart of this backlog of requests is this: data is hard to work with, and it’s made even harder when users need to work to get or find what they need.

article thumbnail

How (Not) To Use Python’s Walrus Operator

KDnuggets

The Walrus operator, introduced in Python 3.8, enables assignment within expressions, but requires careful use to maintain readability. And this tutorial will teach you how.

Python 119
article thumbnail

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

Christophe Louvion, Chief Product & Technology Officer of NRC Health, is here to take us through how he guided his company's recent experience of getting from concept to launch and sales of products within 90 days. In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.

article thumbnail

An Introduction to Time Series Forecasting with Generative AI

databricks

An Introduction to Time Series Forecasting with Generative AI Time series forecasting has been a cornerstone of enterprise resource planning for decades. Predictions.

Retail 120
article thumbnail

How to add 2D features to a 3D scene

ArcGIS

With the growing popularity of 3D GIS, users are shifting from 2D to 3D. What is the proper method to move pre-existing 2D data onto a 3D scene?

Data 103
article thumbnail

Current 2024: What’s on Tap in Data Streaming

Confluent

Current 2024 brings 100+ sessions, keynotes, lightning talks, and more from industry leaders. Check out the agenda, highlights, networking events, and more event info.

Data 73
article thumbnail

Tools Every AI Engineer Should Know: A Practical Guide

KDnuggets

Explore essential tools and skills for AI engineers: Python, R, big data frameworks, and cloud services essential for building and optimizing AI systems.

article thumbnail

How To Speak The Language Of Financial Success In Product Management

Speaker: Jamie Bernard

Success in product management goes beyond delivering great features - it’s about achieving measurable financial outcomes that resonate across the organization. By connecting your product’s journey with the company’s financial success, you’ll ensure that every feature, release, and innovation contributes to the bottom line, driving both customer satisfaction and business growth.

article thumbnail

Building a robust data stewardship tool in life sciences

databricks

This blog was written in collaboration with Gordon Strodel, Director, Data Strategy & Analytics Capability, in addition to Abhinav Batra, Associate Principal, Enterprise.

Building 113
article thumbnail

Calculate the travel time or distance between paired origins and destinations

ArcGIS

Use the ArcGIS Network Analyst route solver and out-of-the-box tools to calculate travel time and distance between origin-destination pairs.

article thumbnail

Data Engineering Weekly #184

Data Engineering Weekly

Try Fully Managed Apache Airflow for FREE Run Airflow without the hassle and management complexity. Take Astro (the fully managed Airflow solution) for a test drive today and unlock a suite of features designed to simplify, optimize, and scale your data pipelines. For a limited time, new sign-ups will receive a complimentary Airflow Fundamentals Certification exam (normally $150).

article thumbnail

Top 5 Free Resources for Learning Advanced SQL Techniques

KDnuggets

Today, we’re looking for five quality resources that will teach you advanced SQL and do it for free.

SQL 141
article thumbnail

What Is Entity Resolution? How It Works & Why It Matters

Entity Resolution Sometimes referred to as data matching or fuzzy matching, entity resolution, is critical for data quality, analytics, graph visualization and AI. Learn what entity resolution is, why it matters, how it works and its benefits. Advanced entity resolution using AI is crucial because it efficiently and easily solves many of today’s data quality and analytics problems.

article thumbnail

Supernovas, Black Holes and Streaming Data

databricks

The blog explores data streams from NASA satellites using Apache Kafka and Databricks. It demonstrates ingestion and transformation with Delta Live Tables in SQL and AI/BI-powered analysis of supernova events.

BI 110
article thumbnail

Make a vintage basemap in ArcGIS Pro with some Living Atlas shenanigans

ArcGIS

How to combine Living Atlas layers into a plausibly 1890s style ArcGIS Pro basemap. And thoughts on time travel.

108
108
article thumbnail

Migrating Kafka to the Cloud: How Skai Went From 90K to 1.8K Topics

Confluent

Learn how Skai, an omnichannel advertising platform, revamped its architecture for effectively migrating Kafka to the cloud with Confluent.

Kafka 64
article thumbnail

5 Tools for Automating Data Cleaning Processes

KDnuggets

Struggling with time-consuming data cleaning tasks? Discover five tools that can automate and simplify the process.

Process 131
article thumbnail

Provide Real Value in Your Applications with Data and Analytics

The complexity of financial data, the need for real-time insight, and the demand for user-friendly visualizations can seem daunting when it comes to analytics - but there is an easier way. With Logi Symphony, we aim to turn these challenges into opportunities. Our platform empowers you to seamlessly integrate advanced data analytics, generative AI, data visualization, and pixel-perfect reporting into your applications, transforming raw data into actionable insights.

article thumbnail

Databricks University Alliance Crosses 1,000 University Threshold

databricks

Databricks is thrilled to share that our University Alliance has welcomed its one-thousandth-member school! This milestone is a testament to our mission to.

IT 117
article thumbnail

Fivetran vs Stitch: Key Comparisons

Hevo

Data integration is central to making informed business decisions for any organization in this data-driven world. ETL tools are central to this since they enable organizations to manage their data from different sources effectively and integrate it efficiently.

article thumbnail

4 Strategies for Media Publishers to Optimize Content with Gen AI

Snowflake

In today's fast-paced world of media publishing, keeping up with technological advancements and changing consumer preferences is no easy task. Tight budgets, fierce competition and evolving audience behaviors add to the pressure, creating what's often termed the "content crash" — a saturation of content that makes it hard for publishers to stand out.

Media 52
article thumbnail

How to Deal with Missing Data Using Interpolation Techniques in Pandas

KDnuggets

Stop data from dropping out - learn how to handle missing data like a pro using interpolation techniques in Pandas.

Data 117
article thumbnail

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

article thumbnail

Beyond the Leaderboard: Unpacking Function Calling Evaluation

databricks

1. Introduction The research and engineering community at large have been continuously iterating upon Large Language Models (LLMs) in order to make them.

article thumbnail

Top-10 Open Source Data Orchestration Tools

Hevo

This blog explores the world of open source data orchestration tools, highlighting their importance in managing and automating complex data workflows. From Apache Airflow to Google Cloud Composer, we’ll walk you through ten powerful tools to streamline your data processes, enhance efficiency, and scale your growing needs.

article thumbnail

5 Easy Data Cleaning Techniques That Turn Garbage Into Gold

Monte Carlo

I’m sure you’ve heard the saying “garbage in, garbage out” when it comes to data. But what if we could actually turn that garbage into something useful? In this article, we’ll look into data cleaning techniques to clean up messy data using some SQL magic. Table of Contents What Data Cleaning Techniques Involve Handling Missing Data Techniques to Handle Missing Data Removing Duplicates Correcting Inconsistencies Techniques to Correct Inconsistencies Standardizing For

article thumbnail

10 Python Statistical Functions

KDnuggets

This guide will go over 10 essential statistical functions in Python using commonly-used libraries.

Python 122
article thumbnail

The AI Superhero Approach to Product Management

Speaker: Conrado Morlan

In this engaging and witty talk, industry expert Conrado Morlan will explore how artificial intelligence can transform the daily tasks of product managers into streamlined, efficient processes. Using the lens of a superhero narrative, he’ll uncover how AI can be the ultimate sidekick, aiding in data management and reporting, enhancing productivity, and boosting innovation.