Mon.Jun 17, 2024

article thumbnail

5 Free Templates for Data Science Projects on Jupyter Notebook

KDnuggets

Boost your data science project with these templates.

article thumbnail

Delta Lake table as a changelog

Waitingforcode

One of the big challenges in streaming Delta Lake is the inability to handle in-place changes, like updates, deletes, or merges. There is good news, though. With a little bit of effort on your data provider's side, you can process a Delta Lake table as you would process Apache Kafka topics, hence without in-place changes.

Kafka 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

A Tour of Python NLP Libraries

KDnuggets

Exploring the available text Python packages for your data workflow.

Python 136
article thumbnail

Streamlit in Snowflake: Improved Customization, Performance and AI Capabilities

Snowflake

Snowflake’s mission is to mobilize the entire world’s data, and there are millions of data scientists and developers who don’t have access to full-stack engineering teams. It’s been our endeavor to bring the power of the AI Data Cloud to every individual developer, data scientist and machine learning engineer, so that they can build and share world-class data apps — all by themselves.

Python 110
article thumbnail

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

What Data Scientists Should Know About OpenUSD

KDnuggets

Let's dive into what data scientists should know about OpenUSD and how it can enhance their workflows.

Data 129
article thumbnail

Boost your Productivity with Tool Parameter Overrides in ArcGIS Pro 3.3

ArcGIS

Productivity Update! Learn how to override default parameter values for geoprocessing tools in ArcGIS Pro 3.3. Override Geoprocessing Tool Defaults in ArcGIS Pro 3.

109
109

More Trending

article thumbnail

How to Turn a REST API Into a Data Stream with Kafka and Flink

Confluent

Improve REST API response data w/Kafka and Flink SQL in Confluent Cloud; Automatic connector retriability combats REST flakiness; Demo w/OpenSky data.

Kafka 105
article thumbnail

I Took the Google Data Analytics Certification Where 2,148,697 Have Already Enrolled

KDnuggets

These were my thoughts on the certification.

article thumbnail

What’s new for CAD and BIM in ArcGIS Pro 3.3

ArcGIS

Discover what's new in ArcGIS Pro 3.3 for CAD and BIM workflows, allowing you to directly read datasets from Autodesk Revit, Civil 3D, and Industry Foundation Classes.

article thumbnail

6 Startups Redefining 3D Workflows with OpenUSD and Generative AI

KDnuggets

Driving the next era of advanced 3D solutions with OpenUSD

85
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

PySpark Explained: The explode and collect_list Functions

Towards Data Science

Two useful functions to nest and un-nest data sets in PySpark Continue reading on Towards Data Science »

article thumbnail

Protected: What’s new for CAD and BIM in ArcGIS Pro 3.3

ArcGIS

Discover what's new in ArcGIS Pro 3.3 for CAD and BIM workflows, allowing you to directly read datasets from Autodesk Revit, Civil 3D, and Industry Foundation Classes.

article thumbnail

AI in Financial Fraud Detection and Prevention

RandomTrees

AI technology is revolutionizing and changing the way that fraud detection and prevention are being practiced, especially in the finance industry. AI-driven fraud solutions are increasingly being adopted by financial institutions globally to fight against fast-growing cybercrimes. This article looks into AI’s different uses in financial fraud detection, with a focus on techniques involving anomaly detection, machine learning algorithms, and real-time data analysis that help safeguard the credibi

Banking 52
article thumbnail

“The Future of AI is Real-Time Data” Manifesto

Striim

To the data scientists pushing the boundaries of what’s possible, the AI experts and enthusiasts who see beyond the horizon, and the techies building tomorrow’s solutions today — this manifesto is for you. The key to unlocking AI’s full potential lies in real time data. Traditional methods no longer suffice in a world that demands instant insights and immediate action.

article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri