Sat.Feb 10, 2024 - Fri.Feb 16, 2024

article thumbnail

Data Warehousing Essentials: A Guide To Data Warehousing

Seattle Data Guy

Photo by Tiger Lily Data warehouses and data lakes play a crucial role for many businesses. It gives businesses access to the data from all of their various systems. As well as often integrating data so that end-users can answer business critical questions. But if we take a step back and only focus on the… Read more The post Data Warehousing Essentials: A Guide To Data Warehousing appeared first on Seattle Data Guy.

Data Lake 162
article thumbnail

Data Sharing Across Business And Platform Boundaries

Data Engineering Podcast

Summary Sharing data is a simple concept, but complicated to implement well. There are numerous business rules and regulatory concerns that need to be applied. There are also numerous technical considerations to be made, particularly if the producer and consumer of the data aren't using the same platforms. In this episode Andrew Jefferson explains the complexities of building a robust system for data sharing, the techno-social considerations, and how the Bobsled platform that he is building

Data Lake 147
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

What's new on the cloud for data engineers - part 12 (10.2023-02.2024)

Waitingforcode

It's time for another part of "What's new on the cloud for data engineers" Let's see what happened in the last 5 months.

article thumbnail

Semantic Layers are the Missing Piece for AI-Enabled Analytics

KDnuggets

Integrating a semantic layer with Language Learning Models (LLMs) presents a clean solution to this, particularly in the realm of AI chatbots. This combination empowers businesses to generate fast responses and reports based on their data. Leveraging AI and semantic layers is advancing business intelligence, making it easier than ever for people to interact with data.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Why Your Team Needs To Implement Data Quality For Your AI Strategy

Seattle Data Guy

Companies that range from start-ups to enterprises are looking to implement AI and ML into their data strategy. With that it’s important not to forget about data quality. Regardless of how fancy or sophisticated a company’s AI model might be, poor data quality will break it. It will make the outputs of these models useless… Read more The post Why Your Team Needs To Implement Data Quality For Your AI Strategy appeared first on Seattle Data Guy.

Data 130
article thumbnail

New with Confluent Platform: Seamless Migration Off ZooKeeper, Arm64 Support, and More

Confluent

Confluent Platform 7.6 brings upgrading for existing clusters from ZooKeeper to KRaft, compaction support for Tiered Storage, OAuth (early access), improvements to the Oracle CDC premium connector, and more.

More Trending

article thumbnail

Generative AI Playground: Text-to-Image Stable Diffusion with Stability AI, Stable Diffusion XL, and CompVis on the Latest Intel® GPU

KDnuggets

Stable Diffusion models are revolutionizing digital artistry, transforming mere text into stunning, lifelike images. Explore further here.

140
140
article thumbnail

Databricks adds new migration Brickbuilder Solutions to help customers succeed with AI

databricks

For the past two years, Databricks has collaborated with leading consulting partners to build innovative solutions for industry, migration, and data and AI.

article thumbnail

Snowflake’s Data Classification Lets You Identify and Tag Sensitive Data Directly in Snowsight

Snowflake

At Snowflake, we believe in empowering our customers to harness the full potential of their data while maintaining robust compliance standards and safeguarding data privacy. We recognize the critical importance of quickly identifying and safeguarding sensitive data objects, and we consistently strive to provide solutions that help achieve these goals — from advancements such as classification and tag-based policies to the intuitive Data Governance UI.

Data 101
article thumbnail

Introducing SafeTest: A Novel Approach to Front End Testing

Netflix Tech

by Moshe Kolodny In this post, we’re excited to introduce SafeTest, a revolutionary library that offers a fresh perspective on End-To-End (E2E) tests for web-based User Interface (UI) applications. The Challenges of Traditional UI Testing Traditionally, UI tests have been conducted through either unit testing or integration testing (also referred to as End-To-End (E2E) testing).

Coding 100
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

What Is Data Lineage, And Why Does It Matter?

KDnuggets

If you’ve ever had conversations with data professionals, you’ve probably heard “data lineage” pop up quite a few times. So what is data lineage all about, and why is it important?

IT 117
article thumbnail

Strategic Thinking For Business Planning

Knowledge Hut

An organization without strategic planning is like a ship without the captain. Developing strategic thinking into a plan and executing is inevitable for the businesses to compete in the global marketing environment. Strategic thinking, planning, and execution encompass all the concerns pertaining to ambitious goals, prevailing conditions, limitations, challenges, competence, changing customers’ preferences etc.

BI 98
article thumbnail

Meta loves Python

Engineering at Meta

By now you’re already aware that Python 3.12 has been released. But did you know that several of its new features were developed by Meta ? Meta engineer Pascal Hartig ( @passy ) is joined on the Meta Tech Podcast by Itamar Oren and Carl Meyer, two software engineers at Meta, to discuss their teams’ contributions to the latest Python release, including new hooks that allow for custom JITs like Cinder , Immortal Objects , improvements to the type system, faster comprehensions, and more.

Python 98
article thumbnail

New Snowflake Features Released in January 2024

Snowflake

Snowflake kicked off 2024 with exciting releases, including Snowpark Model Registry, Streamlit in Snowflake for Azure, and new enhancements around security features in Snowflake Horizon. Read on to learn more about everything we announced in January. Snowpark Updates Model management with the Snowpark Model Registry – public preview Snowpark Model Registry is an integrated solution to register, manage and use models and their metadata natively in Snowflake.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Master The Art Of Command Line With This GitHub Repository

KDnuggets

Whether you are a beginner or an experienced user, this guide is perfect for familiarizing yourself with basic and advanced command line tools.

article thumbnail

Essential Hacks To Become A CBAP Certified Professional

Knowledge Hut

Certified Business Analysis Professional ( CBAP ®) offered by the International Institute of Business Analysis (IIBA®) is the most prestigious professional certification that a business analyst can do. In order to tackle and conquer the exam you need meticulous planning, diligent and honest preparation and confidence in facing the exam. This article intends to provide guidance on passing the examination from the time of filling up the application form.

article thumbnail

Back to the Financial Regulatory Future

Cloudera

It’s hard to believe it’s been 15 years since the global financial crisis of 2007/2008. While this might be a blast from the past we’d rather leave in the proverbial rear-view mirror, in March of 2023 we were back to the future with the collapse of Silicon Valley Bank (SVB), the largest US bank to fail since 2008. While there are clear reasons SVB collapsed, which can be reviewed here , my purpose in this post isn’t to rehash the past but to present some of the regulatory and compliance c

article thumbnail

Four Questions to Consider When Navigating the Rapid Evolution of Generative AI

Snowflake

A strategic approach to data and talent strategies will distinguish leaders in a transformed business landscape. What might that look like? Generative AI’s (gen AI) capabilities seemed startlingly novel a year ago, when ChatGPT’s release led to an explosion of public usage and, simultaneously, intense debate about its potential societal and business impacts.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Synthetic Data for Machine Learning

KDnuggets

You don't always have high-quality labeled datasets for supervised machine learning. Learn about why you should augment your real data with synthetic data as well as the ways to generate it.

article thumbnail

How to Encourage Knowledge Sharing in the Workplace?

Knowledge Hut

The conventional practices of working have changed in this 21st century. People cannot just stay as a specialist for a single area in the workplace anymore. These days, many people attend Agile Management certification courses online to learn the art of agility or how to become more adaptive at workplaces, which is a brilliant idea! They must have at least a preliminary knowledge of the job functions of other people in their company.

article thumbnail

Experiment Faster and with Less Effort

DoorDash Engineering

Business Policy Experiments Using Fractional Factorial Designs At DoorDash, we constantly strive to improve our experimentation processes by addressing four key dimensions, including velocity to increase how many experiments we can conduct, toil to minimize our launch and analysis efforts, rigor to ensure a sound experimental design and robustly efficient analyses, and efficiency to reduce costs associated with our experimentation efforts.

article thumbnail

Apache Beam: Data Processing, Data Pipelines, Dataflow and Flex Templates

Towards Data Science

In this first article, we’re exploring Apache Beam, from a simple pipeline to a more complicated one, using GCP Dataflow.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Free Data Engineering Course for Beginners

KDnuggets

Interested in data engineering but don't know where to start? Get up to speed in data engineering fundamentals with this free course.

article thumbnail

Core Business Components in Business Analysis

Knowledge Hut

In today’s competitive world, the role of a business analyst has become one of the key elements for any business entity. In order to understand the role of a business analyst it is imperative to, first of all, understand what a business is and the complexities involved in operating a business entity. The definition of a business analyst as described in the BABOK® can be decoded as follows.

article thumbnail

One map to rule them all

ArcGIS

All we have to decide is what to map with the time that is given us.

117
117
article thumbnail

Just In:  New Symbols on the Robinhood 24 Hour Market

Robinhood

Robinhood is the only US retail brokerage to offer 24/5 trading of single name stocks At Robinhood, we believe the future of investing is 24/7. That’s why we launched the Robinhood 24 Hour Market last year, to give our customers unprecedented flexibility and access to the markets. Today, we’re excited to announce that we’ve expanded the total number of symbols available overnight in the US to 922.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

3 Research-Driven Advanced Prompting Techniques for LLM Efficiency and Speed Optimization

KDnuggets

This article has explored three promising prompting techniques that have been developed to reduce the occurrence of hallucinations in large language models.

115
115
article thumbnail

Get Your Business Analysis Experience Professionally Recognised With CBAP

Knowledge Hut

Business analysis is the discipline of recognizing change in an organization and driving that change. It’s about delivering as much value as possible to the stakeholders and ensuring that the organization profits from changes. As a business analyst, you must have played a key role in developing your organization and meeting its business needs from time to time.

article thumbnail

Introducing Confluent’s Migration Accelerator: Accelerate Your Journey to a Complete Data Streaming Platform

Confluent

Confluent Migration Accelerator, a new program in partnership with the Confluent partner ecosystem to jump-start organizations' data streaming journeys.

article thumbnail

A Major Step Forward For Generative AI and Vector Database Observability

Monte Carlo

Organizations are racing to deploy generative AI applications to unlock new sources of value and stave off potential disruptors as this transformative technology takes hold. LLMs have quickly become plug-and-play APIs while smaller specialized models are becoming rapidly commoditized as well. To differentiate and expand the usefulness of these models, organizations must augment them with first-party data – typically via a process called RAG (retrieval augmented generation).

article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.