December, 2024

article thumbnail

Top 11 GenAI Powered Data Engineering Tools to Follow in 2025

Analytics Vidhya

What will data engineering look like in 2025? How will generative AI shape the tools and processes Data Engineers rely on today? As the field evolves, Data Engineers are stepping into a future where innovation and efficiency take center stage. GenAI is already transforming how data is managed, analyzed, and utilized, paving the way for […] The post Top 11 GenAI Powered Data Engineering Tools to Follow in 2025 appeared first on Analytics Vidhya.

article thumbnail

Powering AI innovation by acccelerating the next wave of nuclear

Engineering at Meta

Meta releases a Request for Proposals (RFP) to identify nuclear energy developers to support AI innovation and clean and renewable energy goals.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

7 Projects to Master Data Engineering

KDnuggets

Learn to build, run, and manage data engineering pipelines both locally and in the cloud using popular tools.

article thumbnail

Agents of Change: Navigating 2025 with AI and Data Innovation

Data Engineering Weekly

As we approach the new year, it's time to gaze into the crystal ball and ponder the future. In this post, we delve into predictions for 2025, focusing on the transformative role of AI agents, workforce dynamics, and data platforms. Join Ananth Packkildurai, Ashwin Ashish, and Rajesh as they unravel the future and guide us through the fascinating changes ahead.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Unapologetically Technical Episode 15 – Frances Perry

Jesse Anderson

We’re down to the last month of the year, but we’re never leaving without a new episode of the Unapologetically Technical! In this episode, I interview Frances Perry, the Head of Engineering at MotherDuck. Frances Perry is an engineering manager who spent many years as a heads-down coder creating various distributed systems used in Google and Google Cloud.

article thumbnail

Queues in Apache Kafka®: Enhancing Message Processing and Scalability

Confluent

Queue support in Apache Kafka 4.0, enabled by share groups, lets you accommodate traditional queue-type workloads through cooperative consumption.

Kafka 131

More Trending

article thumbnail

Alternatives to Azure Document Intelligence Studio: Exploring Powerful Document Analysis Tools

Seattle Data Guy

Document Intelligence Studio is a data extraction tool that can pull unstructured data from diverse documents, including invoices, contracts, bank statements, pay stubs, and health insurance cards. The cloud-based tool from Microsoft Azure comes with several prebuilt models designed to extract data from popular document types. However, you can also use labeled datasets to train… Read more The post Alternatives to Azure Document Intelligence Studio: Exploring Powerful Document Analysis Tool

Insurance 130
article thumbnail

2024’s Biggest Moments in AI

KDnuggets

2024 has been yet another groundbreaking year for AI, with major breakthroughs, industry shifts, and ethical challenges shaping its future. Let's uncover together the key moments that defined AI this year about to finalize.

IT 127
article thumbnail

Guide to connecting to Excel files in ArcGIS Pro

ArcGIS

This blog provides step-by-step guidance to determine and use a silent install when configuring a driver to use Excel files in ArcGIS Pro. Learn More.

article thumbnail

Secure External Access to Unity Catalog Assets via Open APIs

databricks

We're excited to announce the Public Preview of credential vending for Unity Catalogs open APIs, allowing external clients to securely access Unity Catalog.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Integrating Microservices with Confluent Cloud Using Micronaut® Framework

Confluent

Real-time data streaming and messaging are essential for building scalable, resilient, event-driven microservices. Explore integrating the Micronaut framework with Confluent Cloud.

Cloud 115
article thumbnail

Introducing Configurable Metaflow

Netflix Tech

David J. Berg * , David Casler ^, Romain Cledat * , Qian Huang * , Rui Lin * , Nissan Pow * , Nurcan Sonmez * , Shashank Srikanth * , Chaoying Wang * , Regina Wang * , Darin Yu * *: Model Development Team, Machine Learning Platform ^: Content Demand ModelingTeam A month ago at QConSF, we showcased how Netflix utilizes Metaflow to power a diverse set of ML and AI use cases , managing thousands of unique Metaflow flows.

article thumbnail

The Basics of SFTP: Authentication, Encryption, and File Management

Seattle Data Guy

If you’re looking to pass hundreds of GBs of data quickly, you’re likely not going to use a REST API. That’s why every day, companies share data sets of users, patient claims, financial transactions, and more via SFTP. If youve been in the industry for a while, youve probably come across automated SFTP jobs that… Read more The post The Basics of SFTP: Authentication, Encryption, and File Management appeared first on Seattle Data Guy.

article thumbnail

10 GitHub Repositories to Master Reinforcement Learning

KDnuggets

Learn reinforcement learning using free resources, including books, frameworks, courses, tutorials, example code, and projects.

Coding 134
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Build Better Custom Geoprocessing tools (now with Enable Undo) in ArcGIS Pro!

ArcGIS

Learn how to build a custom geoprocessing tool and about some new features, like Enable Undo for Script and Model tools, in ArcGIS Pro 3.

Building 115
article thumbnail

Strategic Priorities for Data and AI Leaders in 2025

databricks

AI remains at the forefront of every business leaders plans for 2025. Overall, 70% of businesses continue to believe AI is critical to.

Data 111
article thumbnail

Generative AI Meets Data Streaming (Part I) – Data as the Engine: Building the AI Fundamentals

Confluent

Discover how data fuels Generative AI and why streaming data is key to success. Learn the fundamentals to unlock AIs true potential for your business.

Building 111
article thumbnail

Data Contracts were a LIE!

Confessions of a Data Guy

Today we talk about what is really going on with Data Contracts, they came in like a rocket a few years ago, but then died on the vine. What’s the deal? The post Data Contracts were a LIE! appeared first on Confessions of a Data Guy.

Data 100
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Data News — Small break until January

Christophe Blefari

Hey, it's been a few weeks since something has been published here—I hope you haven’t forgotten about me 😊 In the last weeks I've been all over the place and worked on a lot of topics except this newsletter, I've decided to take a break from the newsletter to catchup the rhythm in January! The Forward Data Conference was a huge success and I want to thanks again all the attendees, speakers, sponsors and my co-organisers.

Data 100
article thumbnail

How to Implement Image Captioning with Vision Transformer (ViT) and Hugging Face Transformers

KDnuggets

A beginners guide to getting started with image captioning models with HuggingFace.

123
123
article thumbnail

Doing more with Density tools: Understanding spatial patterns of data in ArcGIS Pro

ArcGIS

Explore Density tools in ArcGIS Pro for spatial data analysis to reveal hidden patterns and effective visualization to aid in informed decision-making.

article thumbnail

Streamline AI Agent Evaluation with New Synthetic Data Capabilities

databricks

Our customers continue to shift from monolithic prompts with general-purpose models to specialized agent systems to achieve the quality needed to drive ROI.

Systems 113
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

LLMs vs Advent of Code, AI is winning by Colin Eberhardt

Scott Logic

Advent of Code (AoC) is an annual, christmas-themed, coding competition that has been running for the past years and is something that I participate in at times. This year, while ~~subjecting myself to~~ learning Rust, I decided to see how OpenAIs latest model faired at the challenge. I quickly knocked together a script, and to my astonishment, found that o1-mini gave correct answers to all but one part of the first six days.

Coding 98
article thumbnail

Value-Focused Data Leaders to Watch in 2025

Snowflake

As organizations mature in their execution of data and AI initiatives, a burning question remains: How do we measure the effectiveness of our teams and our impact on the business? This isnt the perennial Whats my data worth? dilemma often asked rhetorically and answered theoretically. Todays challenge is concrete: to define and track the metrics used to justify continued investment in data and AI innovation.

article thumbnail

AWS S3 Tables. Technical Introduction.

Confessions of a Data Guy

Well, everyone is abuzz with the recently announced S3 Tables that came out of AWS reinvent this year. I’m going to call fools gold on this one right out of the gate. I tried them out, in real life that is, not just some marketing buzz, and it will leave most people, not all, be […] The post AWS S3 Tables. Technical Introduction. appeared first on Confessions of a Data Guy.

AWS 130
article thumbnail

10 Python Libraries Every Developer Should Know

KDnuggets

In this article, we’ll go over Python libraries for tasks like logging, unit testing, data handling, and more — each with features that can simplify your application development.

Python 125
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

What’s New for Spatial Analytics across ArcGIS (Q4 2024)

ArcGIS

Spatial Analytics and Data Science capabilities across ArcGIS have been enhanced this fall with new tools and optimized experiences.

article thumbnail

Introducing Git Support for Queries in Databricks

databricks

Were excited to announce the Public Preview of Query Git integration as part of the new SQL Editor. Git support for queries.

SQL 109
article thumbnail

Preparing Your Data Infrastructure for 2025: Lessons from the Past, Strategies for the Future

Seattle Data Guy

When I broke into the data world, everyone wanted to hire data scientists that would let their companies become more data driven. There were statistics about the exabytes of data that we were creating and the value it would provide. However, a few years into my career, the data world started to make a pivot… Read more The post Preparing Your Data Infrastructure for 2025: Lessons from the Past, Strategies for the Future appeared first on Seattle Data Guy.

Data 130
article thumbnail

Key Takeaways from AWS re:Invent 2024

Cloudera

AWS re:Invent is one of my favorite trade shows. It is one of the biggest technology conferences of the year and is an opportunity to have hundreds of conversations with customers and prospects, listen to their priorities and challenges, hopes, and give them a Cloudera tote bag or a pair of orange sunglasses. What follows is a collection of just a few things I learned and observed during my week in Las Vegas.

AWS 75
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.