Thu.Sep 19, 2024

article thumbnail

Paying down tech debt: further learnings

The Pragmatic Engineer

This is a follow-up to the article Paying down tech debt , written by industry veteran Lou Franco. Lou has been in the software business for over 30 years as an engineer, EM, and executive. He’s also worked at four startups and the companies that later acquired them; most recently Atlassian as a Principal Engineer on the Trello iOS app. Later this year, he’s publishing a book on tech debt.

article thumbnail

Data Modeling in the Brave New Lakehouse World

Confessions of a Data Guy

It is a Brave New World out there these days. The new tools and features come out faster than your mom on Sunday morning getting you ready for church. The same goes for the context and advice being produced on a myriad of platforms, the ole’ Like and Subscribe, and all that bit. It does […] The post Data Modeling in the Brave New Lakehouse World appeared first on Confessions of a Data Guy.

Data 113
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Fine-tuning Llama 3.1 with Long Sequences

databricks

Mosaic AI Model Training now supports fine-tuning up to 131K context length for Llama 3.1 models. More efficient training at long sequence lengths is made possible by several optimizations highlighted in this post.

133
133
article thumbnail

Partial Functions in Python: A Guide for Developers

KDnuggets

In Python, functions often require multiple arguments, and you may find yourself repeatedly passing the same values for certain parameters. This is where partial functions can help. Python’s built-in functools module allows you to create partial functions.

Python 121
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Establish your Generative AI expertise with the latest Databricks certification

databricks

The value of Generative AI, the deepened investment Databricks has made in the space, and how customers have benefited from the certification.

article thumbnail

How to Visualize Data with ggplot2 in R

KDnuggets

ggplot2 is a tool in R for making charts. You can create charts with dots, bars, or lines. You can also add layers to show more details. This article will help you learn how to use ggplot2 to create visualizations. Getting started with ggplot2 Before using ggplot2, you need to install it and.

Data 108

More Trending

article thumbnail

10 GitHub Repositories for Deep Learning Enthusiasts

KDnuggets

Image generated with FLUX.1 [dev] and edited with Canva Pro The 10 GitHub Repository Education Series has been a hit among readers, so here is another list to help you master the basics of deep learning. This collection will guide you through understanding popular deep learning frameworks and various model architectures. In short, you.

article thumbnail

Snowflake Acquires Night Shift Development, Inc. to Accelerate Growth in US Public Sector

Snowflake

Data is increasingly becoming critical for the public sector — from guiding decisions in higher education to enhancing citizen services and streamlining government operations. Government agencies are overwhelmed with data, whether it be structured, like incident logs, or unstructured, like satellite images. Harnessing the vast amount of data can become a burden for any organization, yet the insights have the potential to significantly improve quality of life and strengthen national security.

article thumbnail

AI Success – Powered by Data Governance and Quality

Precisely

Key Takeaways: Data integrity is essential for AI success and reliability – helping you prevent harmful biases and inaccuracies in AI models. Robust data governance for AI ensures data privacy, compliance, and ethical AI use. Proactive data quality measures are critical, especially in AI applications. Using AI systems to analyze and improve data quality both benefits and contributes to the generation of high-quality data.

article thumbnail

Performing Multidimensional Analysis with TEMPO Nitrogen Dioxide Data

ArcGIS

Guide to TEMPO NO2 multidimensional analysis in ArcGIS Pro 3.3. Track air quality changes and gain insights for public health and environmental monitoring.

Data 61
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Small Data, Big Impact: Insights from MotherDuck’s Jacob Matson

Striim

What makes MotherDuck and DuckDB a game-changer for data analytics? Join us as we sit down with Jacob Matson, a renowned expert in SQL Server, dbt, and Excel, who recently became a developer advocate at MotherDuck. During this episode, Jacob shares his compelling journey to MotherDuck, driven by his frequent use of DuckDB for solving data challenges.

BI 52
article thumbnail

What is PgMP Certification? Requirements, Fees & Format

Knowledge Hut

Are you working on many big projects with lots of smaller tasks that all depend on each other? If you become a PgMP-certified pro, you can make things run smoothly. You can learn more about this by taking a PgMP certification course. Getting PgMP certified doesn't just help your career—it also teaches you important things about how organizations work.

article thumbnail

Feature Caching for Recommender Systems w/ Cachelib

Pinterest Engineering

Li Tang; Sr. Software Engineer | Saurabh Vishwas Joshi; Sr. Staff Software Engineer | Zhiyuan Zhang; Sr. Manager, Engineering | At Pinterest, we operate a large-scale online machine learning inference system, where feature caching plays a critical role to achieve optimal efficiency. In this blog post, we will discuss our decision to adopt Cachelib project by Meta Open Source (“Cachelib”) and how we have built a high-throughput, flexible feature cache by leveraging and expanding upon the capabili

Systems 56
article thumbnail

Key Takeaways from Snowflake Industry Day 2024

Snowflake

Building on the momentum from Snowflake Summit , where Snowflake announced the rollout of dozens of new features, this year’s Industry Day showcased the numerous ways these capabilities can be put to use, particularly in an AI- and ML-driven world. In his keynote address , Snowflake CEO Sridhar Ramaswamy explained how the AI Data Cloud aligns with customers’ AI and data strategies, highlighting the platform’s unique position to achieve enterprise AI goals.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.