Sat.Aug 10, 2024 - Fri.Aug 16, 2024

article thumbnail

Data Engineering Interview Series #1: Data Structures and Algorithms

Start Data Engineering

1. Introduction 2. Data structures and algorithms to know 2.1. List 2.2. Dictionary 2.3. Queue 2.4. Stack 2.5. Set 2.6. Counter (from collections module) 2.7. Heap 2.8. Graph search 2.8.1 Depth First Search (DFS) 2.8.2. Breadth First Search BFS 2.9. Binary Search 3. Common DSA questions asked during DE interviews 3.1. Intervals 3.

Algorithm 201
article thumbnail

Speakers for Amsterdam / Netherlands Tech Events

The Pragmatic Engineer

I (Gergely) sometimes get reachouts to do talks at events in Amsterdam (where I am based,) the Netherlands, or somewhere in Europe. Unfortunately, rarely do talks – I do one conference per year. However, I asked around in the community about tech professionals who do paid talks that software engineers find interesting, engaging, and educational.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Beginner’s Guide to Careers in AI and Machine Learning

KDnuggets

The AI and ML complexity results in a growing number and diversity of jobs that require AI & ML expertise. We’ll give you a rundown of these jobs regarding the technical skills they need and the tools they employ.

article thumbnail

Long Context RAG Performance of LLMs

databricks

Retrieval Augmented Generation (RAG) is the most widely adopted generative AI use case among our customers. RAG enhances the accuracy of LLMs by.

142
142
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Unapologetically Technical Episode 13 – Jeff Chou

Jesse Anderson

Unapologetically Technical’s newest episode is now live! In this episode of Unapologetically Technical, I interview Jeff Chou, CEO and co-founder of Sync Computing. Jeff, who holds a PhD from UC Berkeley and a postdoc from MIT, shares his unique journey from academia to startup life, and how his experience with simulations shaped the vision for Sync Computing.

article thumbnail

Speakers for Amsterdam / Netherlands Tech Events

The Pragmatic Engineer

I (Gergely) sometimes get reachouts to do talks at events in Amsterdam (where I am based,) the Netherlands, or somewhere in Europe. Unfortunately, rarely do talks – I do one conference per year. However, I asked around in the community about tech professionals who do paid talks that software engineers find interesting, engaging, and educational.

More Trending

article thumbnail

Databricks SQL Serverless is now available on Google Cloud Platform

databricks

Databricks SQL Serverless is now Generally Available on Google Cloud Platform (GCP)! SQL Serverless is available in 7 GCP regions and 40+ regions across AWS, Azure and GCP.

article thumbnail

A Melange of Maps

ArcGIS

Different thematic map types are better at supporting some questions than others. Here are a range of alternative approaches.

Designing 135
article thumbnail

How Meta animates AI-generated images at scale

Engineering at Meta

We launched Meta AI with the goal of giving people new ways to be more productive and unlock their creativity with generative AI (GenAI). But GenAI also comes with challenges of scale. As we deploy new GenAI technologies at Meta, we also focus on delivering these services to people as quickly and efficiently as possible. Meta AI’s animate feature, which lets people generate a short animation of a generated image, carried unique challenges in this regard.

Media 86
article thumbnail

How (Not) To Use Python’s Walrus Operator

KDnuggets

The Walrus operator, introduced in Python 3.8, enables assignment within expressions, but requires careful use to maintain readability. And this tutorial will teach you how.

Python 99
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

An Introduction to Time Series Forecasting with Generative AI

databricks

An Introduction to Time Series Forecasting with Generative AI Time series forecasting has been a cornerstone of enterprise resource planning for decades. Predictions.

Retail 120
article thumbnail

Make a vintage basemap in ArcGIS Pro with some Living Atlas shenanigans

ArcGIS

How to combine Living Atlas layers into a plausibly 1890s style ArcGIS Pro basemap. And thoughts on time travel.

127
127
article thumbnail

DoorDash Empowers Engineers with Kafka Self-Serve

DoorDash Engineering

DoorDash is supporting an increasingly diverse array of infrastructure use cases as the company matures. To maintain our development velocity and meet growing demands, we are transitioning toward making our stateful storage offerings more self-serve. This journey began with Kafka, one of our most critical and widely used infrastructure components. Kafka is a distributed event streaming platform that DoorDash uses to handle billions of real-time events.

Kafka 82
article thumbnail

Tools Every AI Engineer Should Know: A Practical Guide

KDnuggets

Explore essential tools and skills for AI engineers: Python, R, big data frameworks, and cloud services essential for building and optimizing AI systems.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Building a robust data stewardship tool in life sciences

databricks

This blog was written in collaboration with Gordon Strodel, Director, Data Strategy & Analytics Capability, in addition to Abhinav Batra, Associate Principal, Enterprise.

Building 113
article thumbnail

How to add 2D features to a 3D scene

ArcGIS

With the growing popularity of 3D GIS, users are shifting from 2D to 3D. What is the proper method to move pre-existing 2D data onto a 3D scene?

Data 108
article thumbnail

Navigating the Future with Cloudera’s Updated Interface

Cloudera

Data practitioners are consistently asked to deliver more with less, and although most executives recognize the value of innovating with data, the reality is that most data teams spend the majority of their time responding to support tickets for data access, performance and troubleshooting, and other mundane activities. At the heart of this backlog of requests is this: data is hard to work with, and it’s made even harder when users need to work to get or find what they need.

article thumbnail

Top 5 Free Resources for Learning Advanced SQL Techniques

KDnuggets

Today, we’re looking for five quality resources that will teach you advanced SQL and do it for free.

SQL 125
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Databricks University Alliance Crosses 1,000 University Threshold

databricks

Databricks is thrilled to share that our University Alliance has welcomed its one-thousandth-member school! This milestone is a testament to our mission to.

IT 117
article thumbnail

Calculate the travel time or distance between paired origins and destinations

ArcGIS

Use the ArcGIS Network Analyst route solver and out-of-the-box tools to calculate travel time and distance between origin-destination pairs.

article thumbnail

Current 2024: What’s on Tap in Data Streaming

Confluent

Current 2024 brings 100+ sessions, keynotes, lightning talks, and more from industry leaders. Check out the agenda, highlights, networking events, and more event info.

Data 78
article thumbnail

5 Tools for Automating Data Cleaning Processes

KDnuggets

Struggling with time-consuming data cleaning tasks? Discover five tools that can automate and simplify the process.

Process 117
article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

Supernovas, Black Holes and Streaming Data

databricks

The blog explores data streams from NASA satellites using Apache Kafka and Databricks. It demonstrates ingestion and transformation with Delta Live Tables in SQL and AI/BI-powered analysis of supernova events.

BI 109
article thumbnail

AI-Powered Digital Transformation: Get Your Data and AI Ready

Precisely

Key Takeaways Leverage AI to achieve digital transformation goals: enhanced efficiency, decision-making, customer experiences, and more. Address common challenges in managing SAP master data by using AI tools to automate SAP processes and ensure data quality. Create an AI-driven data and process improvement loop to continuously enhance your business operations.

article thumbnail

Migrating Kafka to the Cloud: How Skai Went From 90K to 1.8K Topics

Confluent

Learn how Skai, an omnichannel advertising platform, revamped its architecture for effectively migrating Kafka to the cloud with Confluent.

Kafka 69
article thumbnail

How to Deal with Missing Data Using Interpolation Techniques in Pandas

KDnuggets

Stop data from dropping out - learn how to handle missing data like a pro using interpolation techniques in Pandas.

Data 96
article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

Beyond the Leaderboard: Unpacking Function Calling Evaluation

databricks

1. Introduction The research and engineering community at large have been continuously iterating upon Large Language Models (LLMs) in order to make them.

article thumbnail

Data Engineering Weekly #184

Data Engineering Weekly

Try Fully Managed Apache Airflow for FREE Run Airflow without the hassle and management complexity. Take Astro (the fully managed Airflow solution) for a test drive today and unlock a suite of features designed to simplify, optimize, and scale your data pipelines. For a limited time, new sign-ups will receive a complimentary Airflow Fundamentals Certification exam (normally $150).

article thumbnail

Data Science Prerequisites: First Steps Towards Your DS Journey

Knowledge Hut

Data Science is one of the fastest-growing, trending tech career tracks. With such a huge demand for the role, a lot of professionals and graduates are trying to step into this field to quench the demand and build lucrative careers. But with so many options around, it can be over whelming to take the perfect first step into the field of data science.

article thumbnail

Using NumPy to Perform Date and Time Calculations

KDnuggets

NumPy allows you to easily create arrays of dates, perform arithmetic on dates and times, and convert between different time units with just a few lines of code.

Coding 75
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.