Thu.Sep 19, 2024

article thumbnail

Paying down tech debt: further learnings

The Pragmatic Engineer

This is a follow-up to the article Paying down tech debt , written by industry veteran Lou Franco. Lou has been in the software business for over 30 years as an engineer, EM, and executive. He’s also worked at four startups and the companies that later acquired them; most recently Atlassian as a Principal Engineer on the Trello iOS app. Later this year, he’s publishing a book on tech debt.

article thumbnail

Data Modeling in the Brave New Lakehouse World

Confessions of a Data Guy

It is a Brave New World out there these days. The new tools and features come out faster than your mom on Sunday morning getting you ready for church. The same goes for the context and advice being produced on a myriad of platforms, the ole’ Like and Subscribe, and all that bit. It does […] The post Data Modeling in the Brave New Lakehouse World appeared first on Confessions of a Data Guy.

Data 113
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Partial Functions in Python: A Guide for Developers

KDnuggets

In Python, functions often require multiple arguments, and you may find yourself repeatedly passing the same values for certain parameters. This is where partial functions can help. Python’s built-in functools module allows you to create partial functions.

Python 119
article thumbnail

Snowflake Acquires Night Shift Development, Inc. to Accelerate Growth in US Public Sector

Snowflake

Data is increasingly becoming critical for the public sector — from guiding decisions in higher education to enhancing citizen services and streamlining government operations. Government agencies are overwhelmed with data, whether it be structured, like incident logs, or unstructured, like satellite images. Harnessing the vast amount of data can become a burden for any organization, yet the insights have the potential to significantly improve quality of life and strengthen national security.

article thumbnail

Building Your BI Strategy: How to Choose a Solution That Scales and Delivers

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

How to Visualize Data with ggplot2 in R

KDnuggets

ggplot2 is a tool in R for making charts. You can create charts with dots, bars, or lines. You can also add layers to show more details. This article will help you learn how to use ggplot2 to create visualizations. Getting started with ggplot2 Before using ggplot2, you need to install it and.

Data 108
article thumbnail

Fine-tuning Llama 3.1 with Long Sequences

databricks

Mosaic AI Model Training now supports fine-tuning up to 131K context length for Llama 3.1 models. More efficient training at long sequence lengths is made possible by several optimizations highlighted in this post.

76

More Trending

article thumbnail

Establish your Generative AI expertise with the latest Databricks certification

databricks

The value of Generative AI, the deepened investment Databricks has made in the space, and how customers have benefited from the certification.

article thumbnail

Small Data, Big Impact: Insights from MotherDuck’s Jacob Matson

Striim

What makes MotherDuck and DuckDB a game-changer for data analytics? Join us as we sit down with Jacob Matson, a renowned expert in SQL Server, dbt, and Excel, who recently became a developer advocate at MotherDuck. During this episode, Jacob shares his compelling journey to MotherDuck, driven by his frequent use of DuckDB for solving data challenges.

BI 52
article thumbnail

AI Success – Powered by Data Governance and Quality

Precisely

Key Takeaways: Data integrity is essential for AI success and reliability – helping you prevent harmful biases and inaccuracies in AI models. Robust data governance for AI ensures data privacy, compliance, and ethical AI use. Proactive data quality measures are critical, especially in AI applications. Using AI systems to analyze and improve data quality both benefits and contributes to the generation of high-quality data.

article thumbnail

What is PgMP Certification? Requirements, Fees & Format

Knowledge Hut

Are you working on many big projects with lots of smaller tasks that all depend on each other? If you become a PgMP-certified pro, you can make things run smoothly. You can learn more about this by taking a PgMP certification course. Getting PgMP certified doesn't just help your career—it also teaches you important things about how organizations work.

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.

article thumbnail

Performing Multidimensional Analysis with TEMPO Nitrogen Dioxide Data

ArcGIS

Guide to TEMPO NO2 multidimensional analysis in ArcGIS Pro 3.3. Track air quality changes and gain insights for public health and environmental monitoring.

52
article thumbnail

Introducing Databricks Assistant Quick Fix

databricks

Today, we're excited to introduce Databricks Assistant Quick Fix , a powerful new feature designed to automatically correct common, single-line errors such as.

Designing 105
article thumbnail

Feature Caching for Recommender Systems w/ Cachelib

Pinterest Engineering

Li Tang; Sr. Software Engineer | Saurabh Vishwas Joshi; Sr. Staff Software Engineer | Zhiyuan Zhang; Sr. Manager, Engineering | At Pinterest, we operate a large-scale online machine learning inference system, where feature caching plays a critical role to achieve optimal efficiency. In this blog post, we will discuss our decision to adopt Cachelib project by Meta Open Source (“Cachelib”) and how we have built a high-throughput, flexible feature cache by leveraging and expanding upon the capabili

Systems 43
article thumbnail

Key Takeaways from Snowflake Industry Day 2024

Snowflake

Building on the momentum from Snowflake Summit , where Snowflake announced the rollout of dozens of new features, this year’s Industry Day showcased the numerous ways these capabilities can be put to use, particularly in an AI- and ML-driven world. In his keynote address , Snowflake CEO Sridhar Ramaswamy explained how the AI Data Cloud aligns with customers’ AI and data strategies, highlighting the platform’s unique position to achieve enterprise AI goals.

article thumbnail

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

Christophe Louvion, Chief Product & Technology Officer of NRC Health, is here to take us through how he guided his company's recent experience of getting from concept to launch and sales of products within 90 days. In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.