article thumbnail

Data Pruning MNIST: How I Hit 99% Accuracy Using Half the Data

Towards Data Science

Building more efficient AI TLDR : Data-centric AI can create more efficient and accurate models. The standard algorithm was too slow for my CPU given all thetests. MNIST handwritten digit database. I experimented with data pruning on MNIST to classify handwritten digits. Image byauthor. Setting up a proper test harness was key.

article thumbnail

Preventing Fraud at Robinhood using Graph Intelligence

Robinhood

Part 2: Types of graph intelligence for combating fraud To gain intelligence for combating fraud via graph, there are two graph algorithms. -> Type 1: Vertex-centric intelligence Vertex-centric graph intelligence helps us quantify the likelihood that the user is a bad actor.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 23.14

Christophe Blefari

At the same time Maxime Beauchemin wrote a post about Entity-Centric data modeling. Rare footage of a foundation model ( credits ) Fast News ⚡️ Twitter's recommendation algorithm — It was an Elon tweet. But the algorithm as a whole contains a lot of features, filters and network algorithms.

article thumbnail

Data News — Week 13.14

Christophe Blefari

At the same time Maxime Beauchemin wrote a post about Entity-Centric data modeling. Rare footage of a foundation model ( credits ) Fast News ⚡️ Twitter's recommendation algorithm — It was an Elon tweet. But the algorithm as a whole contains a lot of features, filters and network algorithms.

article thumbnail

The Race For Data Quality in a Medallion Architecture

DataKitchen

Bronze layers can also be the raw database tables. In that case, a practical approach is to set up periodic polling of the Silver layer database to run data quality tests and check for anomalies at scheduled intervals. Bronze layers should be immutable. Alternatively, suppose you do not control the ingestion code.

article thumbnail

10 Lessons from 10 Years of Innovation and Engineering at Picnic

Picnic Engineering

A decade ago, Picnic set out to reinvent grocery shopping with a tech-first, customer-centric approach. For instance, we built self-service tools for all our engineers that allow them to handle tasks like environment setup, database management, or feature deployment effectively.

article thumbnail

The Rise of the Data Engineer

Maxime Beauchemin

Storage and compute is cheaper than ever, and with the advent of distributed databases that scale out linearly, the scarcer resource is engineering time. The use of natural, human readable keys and dimension attributes in fact tables is becoming more common, reducing the need for costly joins that can be heavy on distributed databases.