Wed.May 29, 2024

article thumbnail

5 Python Best Practices for Data Science

KDnuggets

Level up your Python skills for data science with these by following these best practices.

article thumbnail

What’s New in ArcGIS Roads and Highways and ArcGIS Pipeline Referencing (May 2024)

ArcGIS

The latest release of ArcGIS Roads and Highways and ArcGIS Pipeline Referencing includes a variety of new and enhanced features.

article thumbnail

From Data to Destinations: How Skyscanner Optimizes Traveler Experiences with Databricks Unity Catalog

databricks

This blog is authored by Michael Ewins, Director of Engineering at Skyscanner At Skyscanner , we're more than just a flight search engine.

article thumbnail

What’s New from the Geodatabase Team in ArcGIS Pro 3.3

ArcGIS

Here's everything new in ArcGIS Pro 3.3 from the Geodatabase Team.

Data 135
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Top 15 R Libraries for Data Science in 2024

Knowledge Hut

While many people opt for Python for data science tasks today, R remains a staple in the data scientist's toolkit. With its clean code, ability to chain functions and the pipe operator, R can often make simple tasks like exploratory analysis or visualization super easy to do. It also stands its ground well when it comes to complex tasks like forecasting or modelling.

article thumbnail

Solving the Dual-Write Problem: Effective Strategies for Atomic Updates Across Systems

Confluent

The dual-write problem can arise in any distributed system. Fortunately, it has solutions in event sourcing & the transactional outbox & listen-to-yourself patterns.

Systems 94

More Trending

article thumbnail

Evaluating Large Language Models with Giskard in MLflow

databricks

Over the last few years, Large Language Models (LLMs) have been reshaping the field of natural language, thanks to their transformer-based architectures and.

article thumbnail

Retail Media’s Business Case for Data Clean Rooms Part 2: Commercial Models

Snowflake

In Part 1 of “Retail Media’s Business Case for Data Clean Rooms,” we discussed how to (1) assess your data assets and (2) define your data structures and permissions. Once you have a plan on paper, you can begin sizing the data clean room opportunity for your business. Step 3: Commercial Models to Unlock Revenue at Scale Modeling the business value comes down to two things: (1) What data are you making accessible; and (2) How many partners are you willing (and able) to engage?

Retail 91
article thumbnail

Generative AI on Architecture Diagram Creation

RandomTrees

In the contemporary digital landscape , this process involves creating a comprehensive solution that processes images from a specified webpage, converts the visual content into interpretable text or code using advanced large language models, and then transforms the generated text or code into an editable diagram by leveraging a diagram creation platform.

article thumbnail

How Confluent Champion Zhibo Takes on Difficult Sales Conversations

Confluent

Learn how Confluent Champion Zhibo helps APAC customers uncover hidden data problems in his role as an account executive in Japan.

Data 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

20 Best Datasets for Data Visualization

Knowledge Hut

The choice of datasets is crucial for creating impactful visualizations. Demographic data, such as census data and population growth, help uncover patterns and trends in population dynamics. Economic data, including GDP and employment rates, identify economic patterns and business opportunities. Environmental data, like climate change and pollution levels, contribute to scientific research and policy formulation and so on.

article thumbnail

The Ultimate Guide to Snowflake Data Cloud Summit 2024

Monte Carlo

Can you believe Snowflake Summit is almost here? Time really flies when you’re living in the GenAI hype cycle. If you’ll be at Snowflake Summit in San Francisco June 3-6 and you haven’t planned your daily schedule yet, never fear. We bookmarked the can’t miss moments for you. Read on to learn the speaking sessions we’re most excited about, the giveaways on the conference floor that are actually pretty cool, and the after-parties you don’t want to miss.

Cloud 69