Sat.Oct 12, 2024 - Fri.Oct 18, 2024

article thumbnail

Roadmap for AI Engineers

KDnuggets

Learn about 10 easy steps to becoming an AI engineer in 2024.

article thumbnail

Did Automattic commit open source theft?

The Pragmatic Engineer

The below was originally published in The Pragmatic Engineer. To get timely analysis on the tech industry like this, on a weekly basis: sign up to The Pragmatic Engineer Newsletter. If you are into podcasts, check out The Pragmatic Engineer Podcast. Imagine Apple decided Spotify was a big enough business threat that it had to take unfair measures to limit Spotify’s growth on the App Store.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

25 SQL tips to level up your data engineering skills

Start Data Engineering

Introduction Setup SQL tips 1. Handy functions for common data processing scenarios 1.1. Need to filter on WINDOW function without CTE/Subquery use QUALIFY 1.2. Need the first/last row in a partition, use DISTINCT ON 1.3. STRUCT data types are sorted based on their keys from left to right 1.4. Get the first/last element with ROW_NUMBER() + QUALIFY 1.5.

SQL 130
article thumbnail

The Rise of the Declarative Data Stack

Simon Späti

Data stacks have come a long way, evolving from monolithic, one-fits-all systems like Oracle/SAP to today’s modular open data stacks. This begs the question, what’s next? Or why is the current not meeting our needs? As we see more analytics engineering and software best practices, embracing codeful, Git-based, and more CLI-based workflows, the future looks more code-first.

SQL 130
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

10 GitHub Repositories for Advanced Machine Learning Projects

KDnuggets

Where can you find projects dealing with advanced ML topics? GitHub is a perfect source with its many repositories. I’ve selected ten to talk about in this article.

article thumbnail

Interesting startup idea: benchmarking cloud platform pricing

The Pragmatic Engineer

Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one section from this week’s from last week’s The Pulse issue. To get full issues twice a week, subscribe here.

Cloud 273

More Trending

article thumbnail

The Enterprise Case for DuckDB: 5 Key Categories and Why Use It

Simon Späti

DuckDB has a significant share and is frequently featured in the latest data engineering news. However, it’s still in its early adopter phase and has yet to be adopted by larger enterprises. Sure, all data creators and startups have used and potentially grown to love DuckDB, but is it also suitable for enterprises? What about scaling out and sharing it with others in the organization?

IT 130
article thumbnail

What is the WordPress drama about?

Confessions of a Data Guy

I figured a few of us might need the WordPress drama explained like we are 5. So, here you go. WordPress is the GOAT of internet website builders WordPress was founded by Matt Mullenweg With much of the internet running on WordPress … hosting WordPress is of course … lucrative and a big business. The […] The post What is the WordPress drama about?

Data 113
article thumbnail

A Data Scientist GenAI Survival Guide

KDnuggets

This guide emphasizes the growing significance of GenAI but also highlights the crucial role that data scientists play in harnessing this technology to solve real-world problems.

Data 127
article thumbnail

Meta’s open AI hardware vision

Engineering at Meta

At the Open Compute Project (OCP) Global Summit 2024, we’re showcasing our latest open AI hardware designs with the OCP community. These innovations include a new AI platform, cutting-edge open rack designs, and advanced network fabrics and components. By sharing our designs, we hope to inspire collaboration and foster innovation. If you’re passionate about building the future of AI, we invite you to engage with us and OCP to help shape the next generation of open hardware for AI.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Introducing the New SQL Editor

databricks

Over the last few years, we've seen tremendous growth and adoption of Databricks SQL , our intelligent data warehouse purpose-built on the Data.

SQL 124
article thumbnail

How to make the PEFECT Pull Request (PR)

Confessions of a Data Guy

Is there anything worse than the PR process (Pull Request) at most companies? Probably not. It’s the dreaded 600-pound gorilla in the room that no one wants to talk about. Everyone hates it, everyone has to do it. But, it doesn’t have to be like that. There are a few tried and true ways to […] The post How to make the PEFECT Pull Request (PR) appeared first on Confessions of a Data Guy.

Process 100
article thumbnail

7 Free Data Science Platform for Beginners

KDnuggets

Explore free platforms for learning, building portfolios, accessing code editors, engaging with communities, and hosting projects.

article thumbnail

OCP Summit 2024: The open future of networking hardware for AI

Engineering at Meta

At Open Compute Project Summit (OCP) 2024, we’re sharing details about our next-generation network fabric for our AI training clusters. We’ve expanded our network hardware portfolio and are contributing two new disaggregated network fabrics and a new NIC to OCP. We look forward to continued collaboration with OCP to open designs for racks, servers, storage boxes, and motherboards to benefit companies of all sizes across the industry.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Attribute serverless costs to departments and users with budget policies

databricks

We are excited to announce the Public Preview of Databricks serverless budget policies. Administrators can use budget policies to ensure that the correct.

113
113
article thumbnail

Container Runtime: GPU Training & Inference with Snowflake Notebooks

Snowflake

Predictive machine learning continues to be a cornerstone of data-driven decision-making. However, as organizations accumulate more data in a wide variety of forms, and as modeling techniques continue to advance, the tasks of a data scientist and ML engineer are becoming increasingly complex. Oftentimes, more effort is spent on managing infrastructure, jumping through package management hurdles, and dealing with scalability issues than on actual model development.

Food 94
article thumbnail

Practical Solutions for AI workloads in the Enterprise

KDnuggets

This is a comprehensive resource for developers at all levels, whether they are just starting in AI or are looking to refine their expertise further.

116
116
article thumbnail

Investigation of a Workbench UI Latency Issue

Netflix Tech

By: Hechao Li and Marcelo Mayworm With special thanks to our stunning colleagues Amer Ather , Itay Dafna , Luca Pozzi , Matheus Leão , and Ye Ji. Overview At Netflix, the Analytics and Developer Experience organization, part of the Data Platform, offers a product called Workbench. Workbench is a remote development workspace based on Titus that allows data practitioners to work with big data and machine learning use cases at scale.

article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Building Confidence in Your Genie Space with Benchmarks and Ask for Review

databricks

AI/BI Genie is a conversational experience for business teams to self-serve insights from their data through natural language. Genie leverages generative AI tailored.

BI 98
article thumbnail

Govern an Open Lakehouse with Snowflake Open Catalog

Snowflake

To enhance security and ease operational burden, many organizations with data lakes or lakehouses want flexibility to securely integrate their tools of choice on a single copy of data. An open standard for storage format and catalog API has helped, but there’s still a need for open standards for the catalog, including a consistent way to apply security access controls to data.

article thumbnail

The Key to LLMs: A Mathematical Understanding of Word Embeddings

KDnuggets

Word embeddings is a numerical representation of text, allowing a computer to process words efficiently by converting words into numerical vectors that can be processed with machine learning algorithm.

Algorithm 106
article thumbnail

Preparing the Consumer Fetch: Kafka Producer and Consumer Internals, Part 3

Confluent

Third installment of the Producer/Consumer Internals series that covers preparing the consumer fetch: how consumers interact with brokers, coordinate partitions, and send requests.

Kafka 80
article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

Announcing Azure Cobalt 100 VMs: Powering the Future of Azure Databricks

databricks

At Databricks, we are constantly innovating and optimizing our platform to ensure that our customers can maximize the value of their data and.

Data 96
article thumbnail

Snowflake Migration Success Stories: Core Digital Media and NAVEX

Snowflake

Many of our customers — from Marriott to AT&T — start their journey with the Snowflake AI Data Cloud by migrating their data warehousing workloads to the platform. For organizations who are considering moving from a legacy data warehouse to Snowflake, are looking to learn more about how the AI Data Cloud can support legacy Hadoop use cases, or are struggling with a cloud data warehouse that just isn’t scaling anymore, it often helps to see how others have done it.

article thumbnail

How to Create Custom Educational Podcasts with NotebookLM

KDnuggets

Creating custom educational podcasts with NotebookLM is straightforward. Simply upload your content, customize the audio output to highlight key topics, and export your final podcast for sharing with your audience.

Education 106
article thumbnail

The Legend Awakens: Introducing Robinhood Legend, Futures Trading, and Index Options

Robinhood

Robinhood is rolling out a suite of new advanced trading tools built from the ground up for active traders Today, to kick off HOOD Summit –our first-ever customer-focused conference geared towards active traders–we announced Robinhood Legend, a powerful, sleek desktop trading platform built specifically for active traders. We’re also launching futures trading and index options on mobile.

article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

Smart Airport: Elevating Airport Safety and Security to New Heights with GIS

ArcGIS

Geographic Information Systems (GIS), are revolutionizing airport safety and security, and driving the transformation into smart airports.

Systems 85
article thumbnail

Inspiring Success Stories from our 2024 Data Integrity Award Winners

Precisely

This year, our annual Data Integrity Summit, Trust ’24, was better than ever – and a big part of what made the event so exciting was our first-ever Data Integrity Awards ! We were thrilled to shine the spotlight on customers driving AI innovation, business results, and societal impact. As you plan for 2025, get inspired with their success stories today.

article thumbnail

How to Learn Python the Lazy Way

KDnuggets

The title says everything. It is a guide for lazy people who want to learn Python and earn dollars.

Python 126
article thumbnail

Hill-climbing towards good plannings with TimeFold

Picnic Engineering

Each day, Picnic delivers groceries to tens of thousands of customers. To do this as efficiently as possible, operations follow a plan created the day before the actual deliveries. This so-called Master Planning Process (MPP) represents the planning of the Picnic supply chain path for every ordered article from the Fulfilment Centre to the customer’s door.

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.