Tue.Aug 20, 2024

article thumbnail

Building a Recommendation System with Hugging Face Transformers

KDnuggets

Learn how to build the recommendation system with advanced technology.

Systems 123
article thumbnail

Unlock Real-Time Cross-Platform Collaboration with Delta Sharing Tableau Connector

databricks

Special thanks to Kevin Glover, Martin Ko, Kuber Sharma and the team at Tableau for their valuable insights and contributions to this blog.

116
116
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

How to Conduct Time Series Analysis in R

KDnuggets

This article explains the basics of time series analysis. Learn to prepare your data and visualize trends in R.

Data 118
article thumbnail

What is a “Good” Data or Software Engineer?

Confessions of a Data Guy

Recently, for some unknown reason, I was pursuing the new Stackoverflow … called Reddit, for Data Engineering … and I ran across an interesting question … more or less it was related to “what makes a good Software Engineer … in a Data Engineering context.” This isn’t the first time this idea has come up […] The post What is a “Good” Data or Software Engineer?

article thumbnail

Apache Airflow® 101 Essential Tips for Beginners

Apache Airflow® is the open-source standard to manage workflows as code. It is a versatile tool used in companies across the world from agile startups to tech giants to flagship enterprises across all industries. Due to its widespread adoption, Airflow knowledge is paramount to success in the field of data engineering.

article thumbnail

5 Tips for Effective Data Visualization

KDnuggets

Looking to make your data visuals stand out? Check out these five tips for effective data visualization.

Data 115

More Trending

article thumbnail

AI Challenges and How Cloudera Can Help

Cloudera

By now, every organization, regardless of industry, has at least explored the use of AI, if not already embraced it. In today’s market, the AI imperative is firmly here, and failing to act quickly could mean getting left behind. But even as adoption soars, struggles remain, and scalability continues to be a major issue. Organizations are quick to adopt AI, but getting it established across the organization brings a unique set of challenges that come into play.

article thumbnail

How Composable CDPs Empower Healthcare with Secure Data Insights

Snowflake

Healthcare and life sciences professionals face unique challenges when they want to use customer data. Much of this data is sensitive and highly regulated by laws like HIPAA. Data also tends to be fragmented between disparate systems, serving different stakeholders, such as patients, providers, business teams, insurance companies and more. These challenges are worth grappling with.

article thumbnail

Automating Report Distribution with Snowpark

Cloudyard

Read Time: 1 Minute, 13 Second Imagine a scenario where a business needs to automatically generate and send customer invoices at the end of each month. The invoices are generated from transaction data store in Snowflake, and customer receives an email with invoice attach as a CSV file. This use case requires a solution that can not only generate the invoices but handle email distribution with attachments.

article thumbnail

Luigi vs Airflow: Which is the Better Tool?

Hevo

When it comes to orchestrating workflows and managing data pipelines, Luigi and Airflow are two of the most popular tools in the industry. Both have their own unique strengths and use cases, but choosing between them can be challenging.

article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Magic in the Data: Data Curation for AI/BI Genie

databricks

An MBA intern’s summer experience: curating a custom Databricks AI/BI Genie Space to answer critical data questions and speed up enterprise workflows by 10X.

BI 52
article thumbnail

dbt vs Airflow: A Comprehensive Guide

Hevo

Data has become the foundation of any successful business. The ability to efficiently extract, transform, and load data for analysis is crucial for making informed data-driven decisions. Therefore, the tools you choose for managing your business data are also extremely important. This blog will discuss two such tools: dbt and Airflow.

article thumbnail

Podcast: Open Source DataOps Tools on Roaring Elephant (Part 2)

DataKitchen

DataOps, the promising future that nobody seems to be able to make reality. But not for lack of trying: meet Chris Bergh, "Head Chef" at DataKitchen, joining us again to tell us how te filed evolved over the last few years. To get in.

52
article thumbnail

Driving Retail Transformation: How Striim Powers Seamless Cloud Migration and Data Modernization

Striim

In today’s fast-paced retail environment, digital transformation is essential to stay competitive. One powerful way to achieve this transformation is by modernizing data architecture and migrating to the cloud. There are countless ways to leverage Striim but this is one of the most exciting, as the platform offers large retailers the tools they need to seamlessly transition from legacy systems to a more agile, cloud-based infrastructure.

Retail 52
article thumbnail

Apache Airflow® Crash Course: From 0 to Running your Pipeline in the Cloud

With over 30 million monthly downloads, Apache Airflow is the tool of choice for programmatically authoring, scheduling, and monitoring data pipelines. Airflow enables you to define workflows as Python code, allowing for dynamic and scalable pipelines suitable to any use case from ETL/ELT to running ML/AI operations in production. This introductory tutorial provides a crash course for writing and deploying your first Airflow pipeline.

article thumbnail

AI Agents for Customer Support

DareData

Learn how AI Agents can transform your customer support processes In today’s world, the way businesses interact with customers is evolving rapidly. AI agents, powered by advanced technologies like large language models (LLMs) and natural language processing (NLP), are revolutionizing customer support. In the past, deploying chatbots often led to customer dissatisfaction.

article thumbnail

AI in Government – Balancing productivity gains with accountability by Graham Odds

Scott Logic

To begin with a quote regularly and erroneously attributed to Henry Ford , “If I had asked my customers what they wanted, they would have said a faster horse.” A lot of the hype surrounding the latest developments in Generative AI (GenAI) focuses on its potential to carry out particular tasks much faster than humans. It’s an understandable human impulse to look for ways to speed up time-consuming (boring?

article thumbnail

How to Extract Snowflake Data Observability Metrics Using SQL

Hevo

Ensuring the quality and reliability of data is crucial in today’s data-driven world, as it is essential for making informed decisions and improving operational efficiency. This is where data observability comes into play. It is understanding, diagnosing, and managing data health throughout the lifecycle.

SQL 40
article thumbnail

What Is a Project Plan in Prince2?

Edureka

You might be willing to become a future planning manager and be a part of the successful days of your business. Prince2 Project planning offers people like you the skills and knowledge needed. This will help you to plan, execute, and implement projects properly and efficiently. Different topics like project start-up, scope definition, budgeting, scheduling, risk management, and stakeholder communication are covered in the courses.

Project 40
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Redesigning Pinterest’s Ad Serving Systems with Zero Downtime (part 2)

Pinterest Engineering

Ning Zhang; Sr. Technical Program Manager | Ang Xu; Principal Machine Learning Engineer | Claire Liu; Staff Software Engineer | Haichen Liu; Staff Software Engineer | Yiran Zhao; Staff Software Engineer | Haoyu He; Sr. Software Engineer | Sergei Radutnuy; Sr. Machine Learning Engineer | Di An; Sr. Software Engineer | Danyal Raza; Sr. Software Engineer | Xuan Chen; Sr.

Systems 62
article thumbnail

How To Use PRINCE2 Methodologies in Project Management

Edureka

We shall explore several Prince2 techniques during this post. These methodical techniques help teams effectively begin, organize, complete, oversee, and conclude Prince2s. Scrum, Waterfall, and Agile techniques are a couple of examples. To ensure the success of Prince2 Certification, each Prince2 management methodology offers unique roles, procedures, and deliverables.

Project 40
article thumbnail

The 6 Pillars of AWS Well-Architected Framework

Edureka

In this article, we will briefly examine AWS’s well-architected framework, try to understand its six principles, and explain why they are essential. We aim to offer you all a simplistic guideline with comprehensible and concise information. Table of Content What is AWS Well-Architected Framework? The 6 Pillars of the AWS Well-Architected Framework Operational Excellence Security Reliability Performance Efficiency Cost Optimization Sustainability Why the AWS Well-Architected Framework is Im

AWS 40
article thumbnail

PRINCE2 Risk Management Approach: Types, Process, Strategy

Edureka

Prince2 is intended to supply a thorough understanding of risk identification, assessment, and mitigation during a certain sector. Techniques for risk analysis, risk assessment frameworks, and risk communication methods are a number of the themes covered in these courses. To enhance their risk management abilities, students also study business continuity planning, crisis management, and regulatory compliance.

Process 40
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.