Sat.Apr 27, 2024 - Fri.May 03, 2024

article thumbnail

Is the “AI developer”a threat to jobs – or a marketing stunt?

The Pragmatic Engineer

This article was published on 14 March 2024 in The Pragmatic Engineer, for subscribers. I'm sharing this piece in public more than a month later, as it provides important context and analysis for the AI dev tools space. Subscribe to The Pragmatic Engineer to stay up-to-date on what is happening with software engineering, Big Tech, and startups.

article thumbnail

Why did Golang lose to Rust for Data Engineering?

Confessions of a Data Guy

A few years ago I wasn’t sure, who was going to win, Golang seemed to be popular, and still is for that matter. When I first wrote a little Golang (~2+ years ago) I was just trying to see what the hype was all about. The funny thing is, at the time, and today, it […] The post Why did Golang lose to Rust for Data Engineering? appeared first on Confessions of a Data Guy.

article thumbnail

5 Simple Steps to Automate Data Cleaning with Python

KDnuggets

Automate your data cleaning process with a practical 5-step pipeline in Python, ideal for beginners.

Python 154
article thumbnail

Build Your Second Brain One Piece At A Time

Data Engineering Podcast

Summary Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful collection of tools that complements the tools that developers already use.

Building 147
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Introducing Confluent Cloud Freight Clusters

Confluent

Confluent Cloud Freight clusters are now available in Early Access. In this blog, learn how Freight clusters can save you up to 90% at GBps+ scale.

Cloud 145
article thumbnail

Databricks named a Leader in the 2024 Forrester Wave for Data Lakehouses

databricks

We are proud to announce that Forrester has recognized Databricks as a Leader with the highest scores in both current offering and strategy.

Data 134

More Trending

article thumbnail

How to build a data team

Christophe Blefari

My personal collection of the best resources to bootstrap a data team and get inspired from what others are doing.

Building 130
article thumbnail

Reaction to Data Engineering Survey for 2024

Confessions of a Data Guy

The post Reaction to Data Engineering Survey for 2024 appeared first on Confessions of a Data Guy.

article thumbnail

Executive Overview: The Rise of Open Foundational Models

databricks

Moving generative AI applications from the proof of concept stage into production requires control, reliability and data governance. Organizations are turning to open.

article thumbnail

Containerize Python Apps with Docker in 5 Easy Steps

KDnuggets

Get up and running with Docker with this tutorial on containerizing Python applications.

Python 148
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Terms You Should Know If You’re Planning To Use Change Data Capture

Seattle Data Guy

If you’ve worked in data long enough, then you’ve likely come across the term change data capture. Often called CDC, change data capture involves tracking and recording changes in a database as they happen, and then transmitting these changes to designated targets. This can be crucial because some pipelines, in particular batch pipelines, don’t capture… Read more The post Terms You Should Know If You’re Planning To Use Change Data Capture appeared first on Seattle D

Database 130
article thumbnail

ArcGIS Pro 3.3 Requires WebView2 Runtime (and you probably already have it)

ArcGIS

ArcGIS Pro 3.3 requires WebView2 Runtime as an installation prerequisite. Here's how to make sure you have it.

IT 127
article thumbnail

Databricks Assistant Tips & Tricks for Data Engineers

databricks

The generative AI revolution is transforming the way that teams work, and Databricks Assistant leverages the best of these advancements. It allows you.

article thumbnail

5 MLOps Courses from Google to Level Up Your ML Workflow

KDnuggets

Want to build and deploy robust machine learning systems to production? Start learning MLOps today with these courses from Google.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Moving Beyond MTEB and BEIR: Snowflake AI Research Joins Forces with the University of Waterloo to Evolve RAG and Retrieval Benchmarks

Snowflake

To accurately answer business questions using LLMs, companies must augment models with their data. Retrieval Augmented Generation (RAG) is a popular solution to this problem, as it integrates the organization’s factual, real-time data into the prompt for the LLM. While the adoption of RAG has increased, an open question remains: How do enterprises know how effective their system is?

Cloud 120
article thumbnail

A Notebook is all I want or Don't

Data Engineering Weekly

The tweet received strong reactions on LinkedIn and Twitter. To clarify, I quoted it as a Notebook-style development, but it is not exactly a Notebook. There is a lot of context missing in that tweet, so I decided to write a blog about it. People have reservations about using tools like Jupytor Notebook for the production pipeline for a good reason.

article thumbnail

Calibrating the Mosaic Evaluation Gauntlet

databricks

A good benchmark is one that clearly shows which models are better and which are worse. The Databricks Mosaic Research team is dedicated.

128
128
article thumbnail

Avoid These 5 Common Mistakes Every Novice in AI Makes

KDnuggets

Top five mistakes made by AI beginners and practical tips to avoid them, along with an engaging "50-Day Challenge" that you cannot afford to miss.

147
147
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

If:Else Logic and Complexity – Hiding the Pea.

Confessions of a Data Guy

I was recently confronted with an interesting conundrum when writing a complex data pipeline. It was an interesting problem that arose from my quest to reduce complexity in part of the design, which found itself creeping into another part, re-enforcing the classic idea of whether you can really make the complexity pea go away, or […] The post If:Else Logic and Complexity – Hiding the Pea. appeared first on Confessions of a Data Guy.

article thumbnail

The right words in the right place

Tweag

tl;dr You may not believe it, but Nix documentation is getting better. Nixpkgs and NixOS still need more time. Table of contents Overview Motivation Statistics Retrospective Thoughts on future work Acknowledgements Overview This is a retrospective of my and many other people’s work on documentation in the Nix ecosystem between October 2022 and March 2024.

article thumbnail

The Modern Data Stack: How The Evolution of Data Architecture Led to The Data Intelligence Platform

databricks

Modern data stacks provide the necessary flexibility and efficiency for analytics and AI. Learn how the Databricks Data Intelligence Platform makes use of them.

article thumbnail

The Ultimate AI Strategy Playbook

KDnuggets

Many businesses rush to adopt AI but fail due to poor strategy. This post serves as your go-to playbook for success.

143
143
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Meet the 2024 Snowflake Startup Challenge Finalists

Snowflake

The 2024 Snowflake Startup Challenge began with over 900 applications from startups Powered by Snowflake in more than 100 countries. Our judges narrowed that long list of contenders down to 10, and after much deliberation, they’ve now pared it down to the final three. We are pleased to announce that BigGeo, Scientific Financial Systems and SignalFlare.ai by Extropy360 will advance to the Snowflake Startup Challenge finale and compete for the opportunity to receive a share of up to $1 million in

Media 105
article thumbnail

Reading and Processing JSON with Rust vs Python.

Confessions of a Data Guy

Have you ever wondered about being explicit in your code vs being vague? I think about this a lot as I’m writing code on a daily basis. I’ve found I like being explicit and verbose when writing code, rather than being vague in what I’m doing most of the time. When it comes to debugging […] The post Reading and Processing JSON with Rust vs Python. appeared first on Confessions of a Data Guy.

Python 100
article thumbnail

Revolutionizing Data in Sports: The Game-Changing Impact of Databricks Marketplace and Delta Sharing

databricks

Unlock the power of advanced sports analytics with Databricks Marketplace and Delta Sharing. Discover how these platforms are transforming the sports industry by enabling seamless data access, collaboration, and real-time insights. Leverage a diverse array of data assets to optimize performance, enhance fan engagement, and gain a competitive edge. Explore the future of sports analytics, powered by Databricks.

Data 111
article thumbnail

Data Science Degrees vs. Courses: The Value Verdict

KDnuggets

Exploring the merits of data science degrees vs courses, this analysis contrasts their depth, prestige, and practicality in job market preparation

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Agile Coach vs Scrum Master: The Difference Stated

Knowledge Hut

Agile methodology is a simple, flexible, and iterative product development model with the distinct advantages of accommodating new requirement changes and incorporating the feedback of the previous iterations over the traditional waterfall development model. Agile methodology is the most popular and dynamic software product development and project maintenance model.

article thumbnail

Google Fires Python. What Next?

Confessions of a Data Guy

What is going on? Is the world coming to an end? I thought Python was going to live forever. Well, apparently not at Google. Recently Google announced it was laying off its entire North American-based Python team that was supporting Google’s special needs with Python, in favor of cheaper offshore workers. Apparently, some of these […] The post Google Fires Python.

Python 100
article thumbnail

Intelligently Balance Cost Optimization & Reliability on Databricks

databricks

The Databricks Data Intelligence Platform offers unparalleled flexibility, allowing users to access nearly instant, horizontally scalable compute resources. This ease of creation can.

article thumbnail

Free Python Resources That Can Help You Become a Pro

KDnuggets

This is a collection of free courses, books, projects, repositories, cheat sheets, and online compilers on Python to help you get started and gain experience.

Python 140
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.