Sat.Oct 12, 2024 - Fri.Oct 18, 2024

article thumbnail

Roadmap for AI Engineers

KDnuggets

Learn about 10 easy steps to becoming an AI engineer in 2024.

article thumbnail

Did Automattic commit open source theft?

The Pragmatic Engineer

The below was originally published in The Pragmatic Engineer. To get timely analysis on the tech industry like this, on a weekly basis: sign up to The Pragmatic Engineer Newsletter. If you are into podcasts, check out The Pragmatic Engineer Podcast. Imagine Apple decided Spotify was a big enough business threat that it had to take unfair measures to limit Spotify’s growth on the App Store.

article thumbnail

Meta’s open AI hardware vision

Engineering at Meta

At the Open Compute Project (OCP) Global Summit 2024, we’re showcasing our latest open AI hardware designs with the OCP community. These innovations include a new AI platform, cutting-edge open rack designs, and advanced network fabrics and components. By sharing our designs, we hope to inspire collaboration and foster innovation. If you’re passionate about building the future of AI, we invite you to engage with us and OCP to help shape the next generation of open hardware for AI.

article thumbnail

Introducing the New SQL Editor

databricks

Over the last few years, we've seen tremendous growth and adoption of Databricks SQL , our intelligent data warehouse purpose-built on the Data.

SQL 131
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

10 GitHub Repositories for Advanced Machine Learning Projects

KDnuggets

Where can you find projects dealing with advanced ML topics? GitHub is a perfect source with its many repositories. I’ve selected ten to talk about in this article.

article thumbnail

Interesting startup idea: benchmarking cloud platform pricing

The Pragmatic Engineer

Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one section from this week’s from last week’s The Pulse issue. To get full issues twice a week, subscribe here.

Cloud 278

More Trending

article thumbnail

The Rise of the Declarative Data Stack

Simon Späti

Data stacks have come a long way, evolving from monolithic, one-fits-all systems like Oracle/SAP to today’s modular open data stacks. This begs the question, what’s next? Or why is the current not meeting our needs? As we see more analytics engineering and software best practices, embracing codeful, Git-based, and more CLI-based workflows, the future looks more code-first.

SQL 130
article thumbnail

A Data Scientist GenAI Survival Guide

KDnuggets

This guide emphasizes the growing significance of GenAI but also highlights the crucial role that data scientists play in harnessing this technology to solve real-world problems.

Data 150
article thumbnail

Attribute serverless costs to departments and users with budget policies

databricks

We are excited to announce the Public Preview of Databricks serverless budget policies. Administrators can use budget policies to ensure that the correct.

128
128
article thumbnail

How to use nested data types effectively in SQL

Start Data Engineering

1. Introduction 2. Code & Data 3. Using nested data types effectively 3.1. Use STRUCT for one-to-one & hierarchical relationships 3.2. Use ARRAY[STRUCT] for one-to-many relationships 3.3. Using nested data types in data processing 3.3.1. STRUCT enables more straightforward data schema and data access 3.3.2. Nested data types can be sorted 3.3.3.

SQL 130
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

The Enterprise Case for DuckDB: 5 Key Categories and Why Use It

Simon Späti

DuckDB has a significant share and is frequently featured in the latest data engineering news. However, it’s still in its early adopter phase and has yet to be adopted by larger enterprises. Sure, all data creators and startups have used and potentially grown to love DuckDB, but is it also suitable for enterprises? What about scaling out and sharing it with others in the organization?

IT 130
article thumbnail

What Can AI Do for Data Science?

KDnuggets

Check out these 10 use cases for AI to shine.

article thumbnail

Building Confidence in Your Genie Space with Benchmarks and Ask for Review

databricks

AI/BI Genie is a conversational experience for business teams to self-serve insights from their data through natural language. Genie leverages generative AI tailored.

BI 120
article thumbnail

OCP Summit 2024: The open future of networking hardware for AI

Engineering at Meta

At Open Compute Project Summit (OCP) 2024, we’re sharing details about our next-generation network fabric for our AI training clusters. We’ve expanded our network hardware portfolio and are contributing two new disaggregated network fabrics and a new NIC to OCP. We look forward to continued collaboration with OCP to open designs for racks, servers, storage boxes, and motherboards to benefit companies of all sizes across the industry.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

What is the WordPress drama about?

Confessions of a Data Guy

I figured a few of us might need the WordPress drama explained like we are 5. So, here you go. WordPress is the GOAT of internet website builders WordPress was founded by Matt Mullenweg With much of the internet running on WordPress … hosting WordPress is of course … lucrative and a big business. The […] The post What is the WordPress drama about?

Data 113
article thumbnail

Practical Solutions for AI workloads in the Enterprise

KDnuggets

This is a comprehensive resource for developers at all levels, whether they are just starting in AI or are looking to refine their expertise further.

145
145
article thumbnail

Announcing Azure Cobalt 100 VMs: Powering the Future of Azure Databricks

databricks

At Databricks, we are constantly innovating and optimizing our platform to ensure that our customers can maximize the value of their data and.

Data 114
article thumbnail

Container Runtime: GPU Training & Inference with Snowflake Notebooks

Snowflake

Predictive machine learning continues to be a cornerstone of data-driven decision-making. However, as organizations accumulate more data in a wide variety of forms, and as modeling techniques continue to advance, the tasks of a data scientist and ML engineer are becoming increasingly complex. Oftentimes, more effort is spent on managing infrastructure, jumping through package management hurdles, and dealing with scalability issues than on actual model development.

Food 98
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

How to make the PEFECT Pull Request (PR)

Confessions of a Data Guy

Is there anything worse than the PR process (Pull Request) at most companies? Probably not. It’s the dreaded 600-pound gorilla in the room that no one wants to talk about. Everyone hates it, everyone has to do it. But, it doesn’t have to be like that. There are a few tried and true ways to […] The post How to make the PEFECT Pull Request (PR) appeared first on Confessions of a Data Guy.

Process 100
article thumbnail

5 Innovative Statistical Methods for Small Data Sets

KDnuggets

Various statistical methods you might never have known previously but useful for your workflow.

Data 143
article thumbnail

Smart Airport: Elevating Airport Safety and Security to New Heights with GIS

ArcGIS

Geographic Information Systems (GIS), are revolutionizing airport safety and security, and driving the transformation into smart airports.

Systems 86
article thumbnail

Govern an Open Lakehouse with Snowflake Open Catalog

Snowflake

To enhance security and ease operational burden, many organizations with data lakes or lakehouses want flexibility to securely integrate their tools of choice on a single copy of data. An open standard for storage format and catalog API has helped, but there’s still a need for open standards for the catalog, including a consistent way to apply security access controls to data.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Investigation of a Workbench UI Latency Issue

Netflix Tech

By: Hechao Li and Marcelo Mayworm With special thanks to our stunning colleagues Amer Ather , Itay Dafna , Luca Pozzi , Matheus Leão , and Ye Ji. Overview At Netflix, the Analytics and Developer Experience organization, part of the Data Platform, offers a product called Workbench. Workbench is a remote development workspace based on Titus that allows data practitioners to work with big data and machine learning use cases at scale.

article thumbnail

How to Learn Python the Lazy Way

KDnuggets

The title says everything. It is a guide for lazy people who want to learn Python and earn dollars.

Python 141
article thumbnail

Preparing the Consumer Fetch: Kafka Producer and Consumer Internals, Part 3

Confluent

Third installment of the Producer/Consumer Internals series that covers preparing the consumer fetch: how consumers interact with brokers, coordinate partitions, and send requests.

Kafka 80
article thumbnail

Snowflake Migration Success Stories: Core Digital Media and NAVEX

Snowflake

Many of our customers — from Marriott to AT&T — start their journey with the Snowflake AI Data Cloud by migrating their data warehousing workloads to the platform. For organizations who are considering moving from a legacy data warehouse to Snowflake, are looking to learn more about how the AI Data Cloud can support legacy Hadoop use cases, or are struggling with a cloud data warehouse that just isn’t scaling anymore, it often helps to see how others have done it.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Hill-climbing towards good plannings with TimeFold

Picnic Engineering

Each day, Picnic delivers groceries to tens of thousands of customers. To do this as efficiently as possible, operations follow a plan created the day before the actual deliveries. This so-called Master Planning Process (MPP) represents the planning of the Picnic supply chain path for every ordered article from the Fulfilment Centre to the customer’s door.

article thumbnail

7 Free Data Science Platform for Beginners

KDnuggets

Explore free platforms for learning, building portfolios, accessing code editors, engaging with communities, and hosting projects.

article thumbnail

The Legend Awakens: Introducing Robinhood Legend, Futures Trading, and Index Options

Robinhood

Robinhood is rolling out a suite of new advanced trading tools built from the ground up for active traders Today, to kick off HOOD Summit –our first-ever customer-focused conference geared towards active traders–we announced Robinhood Legend, a powerful, sleek desktop trading platform built specifically for active traders. We’re also launching futures trading and index options on mobile.

article thumbnail

Streamline RAG with New Document Preprocessing Features

Snowflake

As organizations increasingly seek to enhance decision-making and drive operational efficiencies by making knowledge in documents accessible via conversational applications, a RAG-based application framework has quickly become the most efficient and scalable approach. As RAG-based application development continues to grow, the solutions to process and manage the documents that power these applications need to evolve with scalability and efficiency in mind.

SQL 78
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.