Sat.Feb 10, 2024 - Fri.Feb 16, 2024

article thumbnail

Data Warehousing Essentials: A Guide To Data Warehousing

Seattle Data Guy

Photo by Tiger Lily Data warehouses and data lakes play a crucial role for many businesses. It gives businesses access to the data from all of their various systems. As well as often integrating data so that end-users can answer business critical questions. But if we take a step back and only focus on the… Read more The post Data Warehousing Essentials: A Guide To Data Warehousing appeared first on Seattle Data Guy.

Data Lake 162
article thumbnail

Large Language Models Explained in 3 Levels of Difficulty

KDnuggets

Simple explanations, no matter what your level is right now.

156
156
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Data Sharing Across Business And Platform Boundaries

Data Engineering Podcast

Summary Sharing data is a simple concept, but complicated to implement well. There are numerous business rules and regulatory concerns that need to be applied. There are also numerous technical considerations to be made, particularly if the producer and consumer of the data aren't using the same platforms. In this episode Andrew Jefferson explains the complexities of building a robust system for data sharing, the techno-social considerations, and how the Bobsled platform that he is building

Data Lake 147
article thumbnail

Access Over 181,000 USGS Historical Topographic Maps

ArcGIS

We recently updated our online USGS historical topographic map collection with over 1,745 new maps for a new total of over 181,000 maps.

article thumbnail

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Data News — Week 24.07

Christophe Blefari

Italy Sora ( credits ) Hey you, time for the Data News. Because I did not send the news last week you will get articles from the 2 last weeks. Last few days have been heavily packed with AI News as well. Disclaimer, the 2 events below will be in French. Before jumping to the news there are a few events I want to write about. Next Wednesday I will participate to a Data Night Talk a open discussion about AI & data engineering with other content creators.

Food 130
article thumbnail

7 Steps to Mastering Exploratory Data Analysis

KDnuggets

A Step-by-Step Approach to Unearthing Trends, Outliers, and Insights in your Data.

More Trending

article thumbnail

Why Your Team Needs To Implement Data Quality For Your AI Strategy

Seattle Data Guy

Companies that range from start-ups to enterprises are looking to implement AI and ML into their data strategy. With that it’s important not to forget about data quality. Regardless of how fancy or sophisticated a company’s AI model might be, poor data quality will break it. It will make the outputs of these models useless… Read more The post Why Your Team Needs To Implement Data Quality For Your AI Strategy appeared first on Seattle Data Guy.

Data 130
article thumbnail

Databricks adds new migration Brickbuilder Solutions to help customers succeed with AI

databricks

For the past two years, Databricks has collaborated with leading consulting partners to build innovative solutions for industry, migration, and data and AI.

article thumbnail

Master The Art Of Command Line With This GitHub Repository

KDnuggets

Whether you are a beginner or an experienced user, this guide is perfect for familiarizing yourself with basic and advanced command line tools.

article thumbnail

Snowflake’s Data Classification Lets You Identify and Tag Sensitive Data Directly in Snowsight

Snowflake

At Snowflake, we believe in empowering our customers to harness the full potential of their data while maintaining robust compliance standards and safeguarding data privacy. We recognize the critical importance of quickly identifying and safeguarding sensitive data objects, and we consistently strive to provide solutions that help achieve these goals — from advancements such as classification and tag-based policies to the intuitive Data Governance UI.

Data 126
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

New with Confluent Platform: Seamless Migration Off ZooKeeper, Arm64 Support, and More

Confluent

Confluent Platform 7.6 brings upgrading for existing clusters from ZooKeeper to KRaft, compaction support for Tiered Storage, OAuth (early access), improvements to the Oracle CDC premium connector, and more.

article thumbnail

One map to rule them all

ArcGIS

All we have to decide is what to map with the time that is given us.

120
120
article thumbnail

Jupyter Notebook Magic Methods Cheat Sheet

KDnuggets

KDnuggets' latest original cheat sheet covers Jupyter Notebook magic methods. Check it out now and become a notebook magician.

IT 149
article thumbnail

Four Questions to Consider When Navigating the Rapid Evolution of Generative AI

Snowflake

A strategic approach to data and talent strategies will distinguish leaders in a transformed business landscape. What might that look like? Generative AI’s (gen AI) capabilities seemed startlingly novel a year ago, when ChatGPT’s release led to an explosion of public usage and, simultaneously, intense debate about its potential societal and business impacts.

article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Meta loves Python

Engineering at Meta

By now you’re already aware that Python 3.12 has been released. But did you know that several of its new features were developed by Meta ? Meta engineer Pascal Hartig ( @passy ) is joined on the Meta Tech Podcast by Itamar Oren and Carl Meyer, two software engineers at Meta, to discuss their teams’ contributions to the latest Python release, including new hooks that allow for custom JITs like Cinder , Immortal Objects , improvements to the type system, faster comprehensions, and more.

Python 102
article thumbnail

Introducing SafeTest: A Novel Approach to Front End Testing

Netflix Tech

by Moshe Kolodny In this post, we’re excited to introduce SafeTest, a revolutionary library that offers a fresh perspective on End-To-End (E2E) tests for web-based User Interface (UI) applications. The Challenges of Traditional UI Testing Traditionally, UI tests have been conducted through either unit testing or integration testing (also referred to as End-To-End (E2E) testing).

Coding 101
article thumbnail

2024 Tech Trends: AI Breakthroughs & Development Insights from O’Reilly’s Free Report

KDnuggets

Want to prepare your tech career for 2024 and onwards? Have a look at O’Reilly’s FREE technology trends report.

article thumbnail

New Snowflake Features Released in January 2024

Snowflake

Snowflake kicked off 2024 with exciting releases, including Snowpark Model Registry, Streamlit in Snowflake for Azure, and new enhancements around security features in Snowflake Horizon. Read on to learn more about everything we announced in January. Snowpark Updates Model management with the Snowpark Model Registry – public preview Snowpark Model Registry is an integrated solution to register, manage and use models and their metadata natively in Snowflake.

article thumbnail

The Ultimate Guide to Apache Airflow DAGS

With Airflow being the open-source standard for workflow orchestration, knowing how to write Airflow DAGs has become an essential skill for every data engineer. This eBook provides a comprehensive overview of DAG writing features with plenty of example code. You’ll learn how to: Understand the building blocks DAGs, combine them in complex pipelines, and schedule your DAG to run exactly when you want it to Write DAGs that adapt to your data at runtime and set up alerts and notifications Scale you

article thumbnail

Strategic Thinking For Business Planning

Knowledge Hut

An organization without strategic planning is like a ship without the captain. Developing strategic thinking into a plan and executing is inevitable for the businesses to compete in the global marketing environment. Strategic thinking, planning, and execution encompass all the concerns pertaining to ambitious goals, prevailing conditions, limitations, challenges, competence, changing customers’ preferences etc.

BI 98
article thumbnail

Microservices vs. Monolithic Approaches in Data

Towards Data Science

The Microservice vs.

article thumbnail

What Is Data Lineage, And Why Does It Matter?

KDnuggets

If you’ve ever had conversations with data professionals, you’ve probably heard “data lineage” pop up quite a few times. So what is data lineage all about, and why is it important?

IT 148
article thumbnail

5 Reasons to Connect with Snowflake at Mobile World Congress Barcelona 2024

Snowflake

The telecom industry is evolving rapidly, influenced by tech innovation and trends like generative AI and 5G, and the pressures of cost efficiency and global competition. One way telecom companies stay ahead of the curve is by attending the world’s largest and most influential telecom and connectivity event: Mobile World Congress (MWC) 2024, held February 26 – 29 in Barcelona, Spain.

Python 110
article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Essential Hacks To Become A CBAP Certified Professional

Knowledge Hut

Certified Business Analysis Professional ( CBAP ®) offered by the International Institute of Business Analysis (IIBA®) is the most prestigious professional certification that a business analyst can do. In order to tackle and conquer the exam you need meticulous planning, diligent and honest preparation and confidence in facing the exam. This article intends to provide guidance on passing the examination from the time of filling up the application form.

article thumbnail

5 Bad Habits Killing Your Potential as a Data Engineer

Towards Data Science

And easy solutions that can immediately turn them around Continue reading on Towards Data Science »

article thumbnail

Learn Data Science on a Budget

KDnuggets

This blog will go through platforms and courses you can take that will get you from 0-100 on your data science knowledge.

article thumbnail

Accelerating Success with Databricks: A Deep Dive into antuit.ai's Decision and Customer Impact

databricks

In the dynamic realm of AI-driven forecasting, businesses navigate a landscape where strategic choices shape their trajectory. One such pivotal decision was made.

Retail 97
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

How to Encourage Knowledge Sharing in the Workplace?

Knowledge Hut

The conventional practices of working have changed in this 21st century. People cannot just stay as a specialist for a single area in the workplace anymore. These days, many people attend Agile Management certification courses online to learn the art of agility or how to become more adaptive at workplaces, which is a brilliant idea! They must have at least a preliminary knowledge of the job functions of other people in their company.

article thumbnail

Apache Beam: Data Processing, Data Pipelines, Dataflow and Flex Templates

Towards Data Science

In this first article, we’re exploring Apache Beam, from a simple pipeline to a more complicated one, using GCP Dataflow.

article thumbnail

Synthetic Data for Machine Learning

KDnuggets

You don't always have high-quality labeled datasets for supervised machine learning. Learn about why you should augment your real data with synthetic data as well as the ways to generate it.

article thumbnail

Email customer support automation using Databricks LLM platform

databricks

About UK Power Networks UK Power Networks is the largest electricity distributor in the UK. It maintains electricity cables and lines in London.

IT 96
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.