Love

Why Users Love Cleanlab

With a relentless focus on solving the hardest problem in GenAI – reliability, discover why users across industries love Cleanlab.

JUMP TO section

Customer

Google

Berkeley Research Group

BBVA

Gavagai

e2f, inc.

Uber

DeepLearning.AI

Delivery Hero

Rubrix

Scale.ai

VAST-OSINT

Recognai

Explosion AI

Oracle

OpenMined

Red Hat Experience Engineering

LifeOmic

Renumics

iRobot

Statistics Canada

Ludwig-Maximilians-Universität München

Tencent

Imagem

The Stakeholder Company (TSC)

Gojek

Shands Hospittal (UFHealth)

Harrison.ai

AIMotion Institute

Kaggle (Google - Isolated Sign Language Recognition Entry)

Ping An Insurance

Calmcode.io

BinIt

Engineer at Appen

Petco

TikTok

Google Senior Software Engineer uses Cleanlab to find data errors at scale.

“Cleanlab is well-designed, scalable and theoretically grounded: it accurately finds data errors, even on well-known and established datasets. After using it for a successful pilot project at Google, Cleanlab is now one of my go-to libraries for dataset cleanup.”

Patrick Violette, Senior Software Engineer at Google

Senior Data Scientist at Berkeley Research Group uses Cleanlab for data processing and labeling.

“[Cleanlab] really opened my eyes to the whole idea of confident learning. I’m currently checking out the Cleanlab package and I really appreciate the clean api. A lot of what we do involves questionable labels so we’re looking into making Cleanlab a standard processing step whenever we get labels.”

Karl Schliep, Senior Managing Consultant and Data Scientist at Berkeley Research Group

BBVA, one of the largest financial institutions in the world, used Cleanlab in an update of one of the functionalities offered by the BBVA app: the categorization of financial transactions.

“CleanLab helped us reduce the uncertainty of noise in the tags. This process enabled us to train the model, update the training set, and optimize its performance. The goal was to reduce the number of labeled transactions and make the model more efficient, requiring less time and dedication. This allows data scientists to focus on tasks that generate greater value for customers and organizations.”

David Muelas Recuenco, Expert Data Scientist at BBVA

Head of Data Science at Gavagai.io uses Cleanlab Studio for automated quality assurance of their in-house NLP annotation pipeline.

“[Cleanlab] allows me to upload a dataset and obtain a ranked list of all the potential label issues in the data in just a few clicks. The label issues can then be assessed and fixed right away in the GUI… Cleanlab Studio is a very effective solution to calm my nerves when it comes to label noise.”

Fredrik Olsson, PhD. Head of Data Science at Gavagai

Head of Data Science at e2f finds a 5 percent model improvement using Cleanlab, calling it “breakthrough work.”

“We have used Cleanlab to clean SRL benchmark dataset. The result was impressive. […] There was a significant improvment in the F1 score when training with the corrected data. 0.5 or 5% marginal improvements in both dev and test folds.”

Sam Shamsan, Head of Data Science at e2f, inc.

Uber AI Engineer uses Cleanlab for a data-centric AI challenge.

“Recently took part in a new kind of ML competition based on Andrew Ng’s idea of shifting focus from model-centric to data-centric AI. Found cleanlab, a useful package in supporting this data-centric movement. It is based on the field of confident learning and helps to detect and learn in the presence of noisy real world labels. Some of the most common datasets like ImageNet, CIFAR and MIST have errors too.”

Sanjeev Suresh, Machine Learning Engineer at UberAI

Founder of DeepLearning.AI recognizes Cleanlab founders.

“Curtis Northcutt and Anish Athalye at MIT and Jonas Mueller at Amazon trained a model to identify erroneous labels in popular datasets such as ImageNet, Amazon Reviews, and IMDB. Accuracy on a test set that’s rife with errors is not a true measure of a model’s ability, and bad labels in the test set have a disproportionate impact on bigger models.

It’s time for our community to shift from model-centric to data-centric Al development. Many state-of-the-art models work well enough that tinkering with their architecture yields little gain in many problems, and the most direct path to improved performance is to systematically improve the data your algorithm learns from.”

Andrew Ng, Founder of DeepLearning.AI

Delivery Hero Senior Data Scientist uses Cleanlab: “It works like a charm.”

“Noise in the labels … hearing this probably triggers cold chills to any scientist willing to train a model for production purposes. Today, I’ve discovered an amazing library that implements Confident Learning: Cleanlab! It identifies label errors in datasets, based on the principles of pruning noisy data, counting with probabilistic thresholds to estimate noise, and ranking examples to train with confidence.

Cleanlab makes it so simple to use, it’s well maintained, and with production standards. This thing works like a charm and can benefit your training in the context of high noise in your labels! One of the rare work/libraries that I wonder why it took me so long to discover…”

Hagop Dippel (Boghazdeklian), Senior Data Scientist at Delivery Hero.

Rubrix software library depends on Cleanlab to find label errors in text data.

“As shown recently by Curtis G. Northcutt et al. label errors are pervasive even in the most-cited test sets used to benchmark the progress of the field of machine learning. They introduce a new principled framework to “identify label errors, characterize label noise, and learn with noisy labels” called confident learning. It is open-sourced as the cleanlab Python package that supports finding, quantifying, and learning with label errors in data sets Rubrix provides built-in support for cleanlab and makes it a breeze to find potential label errors in your dataset.”

Find Errors with Cleanlab Tutorial on the Rubrix website.

Scale AI ML Engineer calls Cleanlab “theortically justified.”

“I think CleanLab is a very interesting idea and is one of the few methods I’ve seen that is both theoretically justified and work in practice.”

Will LeVine, Senior Machine Learning Researcher at Scale AI

VAST-OSINT CTO uses Cleanlab to build toxic-language detection model.

“A couple weeks ago, I found out about a Python library called “cleanlab” that can help identify mislabeled training data. […] Tonight, I took the sentence-labeled training data and threw it at cleanlab to see how well confident learning could identify the incorrect labels. These results look amazing to me. […] I like this. I really need to dig into this. If nothing else, this can help identify training data to TOSS if you don’t want to automate correction.”

David Knickerbocker, CTO at VAST-OSINT

Recognai Co-founder calls cleanlab is a ‘simple trick’ for ML with noisy labels.

“Improving your training data is more important than using the latest “state-of-the-art” mode. Here’s a very simple trick: use cleanlab, a Python package for machine learning with noisy labels and finding mislabeled data.”

Daniel Vila Suero, Co-founder of Recognai

Explosion AI ML Engineer endorses Cleanlab to fix datasets.

“More people should do check their labels more frequently. Anybody is free to try out any trick that they like, but if you’re looking for a simple place to start, check out the cleanlab project. It’s made by the same authors of the labelerrors-paper and is meant to help you find bad labels. I’ve used it a bunch of times and I can confirm that it’s able to return relevant examples to double-check. […] The disclaimer on the Google Emotions paper checks a lot of boxes, but imagine that in the future they’d add ‘we checked out labels with cleanlab before releasing it’. For a dataset that’s meant to become a public benchmark, it’d sure be a step worth adding.”

Vincent Warmerdam, Machine Learning Engineer at Explosion AI

Oracle PM shares model improvements with Cleanlab.

“One of the challeges in AML is that labels are not clean and prone to human error. […] There has been extensive reasearch into learning with Noisy labels. One of the most usable approaches has been described in this paper and implemented in [the cleanlab] open source python package. […] First and foremost cleanlab can be used to identify the noisy labels.”

Govind Nair, Senior Product Manager at Oracle

OpenMined Engineer calls Cleanlab “something profound in supervised AI.”

“This feels like something profound in supervised AI! Perhaps those last few percentage points of many benchmarks are only a cleanlab away from SOTA results. Awesome work!”

Madhava Jay, Head of Engineering at OpenMined

Red Hat Principal Software Engineer uses Cleanlab to improve disaster model accuracy from 79% to 85%.

“Most of the datasets are noisy and incorrectly labelled. Sharing an example notebook to show case how to detect and clean noisy labels in the disaster tweets text dataset can improve the accuracy(0.79 -> 0.85). This is done after detecting and removing the noisy labels using cleanlab.”

Manikandan Sivanesan, Senior Principal Software Engineer at Red Hat Experience Engineering

ML Engineer at LifeOmic speaks to the usefulness of Cleanlab.

“Thanks for sharing about the cleanlab package @Addison! I prototyped it late last week on the outputs of one of our classification models, and its outputs were of course not perfect, but definitely useful. It can help us identify: potentially duplicate classes, potentially mislabeled examples, and noisy classes that should potentially have their examples moved out into different already existing classes e.g. legacy superset classes. One thing I really like about that package is how model-agnostic it is. All that’s needed is a sequence of labels.”

Evan Peterson, Machine Learning Engineer at LifeOmic, posted in the MLOps Slack Community

Renumics uses Cleanlab and CleanVision for Data-centric AI playbook.

“Use the Cleanlab library to compute outlier scores based on model output (embeddings, probabilities) and inspect outlier candidates. Use the Cleanvision library to extrapact typical image issues (brightness, blurr, aspect ratio, SNR and duplicates) and identify critical segments through manual inspection.”

Curated list of open source tooling for data-centric AI on unstructured data posted by Renumics on GitHub.

Principal Data Scientist at iRobot recognizes Cleanlab founders.

“Knuckle bump to [Curtis Northcutt] Anish Atalye, and Jonas Mueller for the paper diving into this. https://arxiv.org/pdf/2103.14749.”

Brandon Rohrer, Principal Data Scientist at iRobot.

Canadian government begins using Cleanlab for census data.

“Our data is particularly “dirty” involving many mislabeled data points and in my research of how to alleviate this issue I came across the Clean Lab ai. I’m very interested in your work and how it may apply to our project.”

Andrew Stelmack, Senior Methodologist at Statistics Canada.

Ludwig-Maximilians-Universität München student uses Cleanlab for PhD.

“Cleanlab is immensely helpful for my work. Thank you for that. Love the story behind the company and your work! Keep going with this great tool!”

Dietrich Trautmann while completing final year of his PhD.

Tencent uses Confident Learning for MTCL.

“We propose a novel mean-teacher-assisted confident learning framework to robustly exploit the noisy labeled data for the challenging hepatic vessel segmentation task. Specifically, with the adapted confident learning assisted by a third party, i.e., the weight-averaged teacher model, the noisy labels in the additional low-quality dataset can be transformed from ‘encumbrance’ to ‘treasure’ via progressive pixel-wise soft-correction, thus providing productive guidance.”

Noisy Labels are Treasure: Mean-Teacher-Assisted Confident Learning for Hepatic Vessel Segmentation, 2021.

Computer Vision Engineer at Imagem uses CleanVision to improve the quality of a Radar imagery dataset.

“CleanVision helped me improve the quality of my image data and, as a result, the accuracy of my model. This tool has proven to be invaluable to me, it is helping me to improve the data quality of computer vision projects, allowing us to effectively address a variety of common issues in our imagery dataset.”

Translated quote from blog written by Joāo Ataíde, Computer Vision Engineer at Imagem.

The Stakeholder Company (Singapore) saves 8x time with Cleanlab.

“We used Cleanlab to quickly validate one of our classifier models’ predictions for a dataset. This is typically a very time-consuming task since we would have to check thousands of examples by hand. However, since Cleanlab helped us identify the data points that were most likely to have label errors, we only had to inspect an eighth of our dataset to see that our model was problematic. We later realized that this was due to a post-processing error in the dataset — something that would otherwise have taken a much longer time to notice.”

Seah Bei Ying, Data Analyst at The Stakeholder Company

Gojek Data Scientist uses Cleanlab to remove low-quality labels.

“I used an open-sourced library, cleanlab, to remove low-quality labels on an image dataset. The [ResNet] model trained on the dataset without low-quality data gained 4 percentage points of accuracy compared to the baseline model (trained on all data).”

Travis Tang, Data Scientist at Gojek in Towards AI, 2022

Shands Hospital at the University of Florida uses Cleanlab for real-time AI monitoring of ICU patients.

“Our approach is based on the Cleanlab implementation of active learning for data annotation. Our datasets include over 18 million depth image frames and 22 million patient face image frames extracted from videos. It is not practical to annotate the entirety of these massive datasets. Active learning is an important machine learning technique that involves an iterative process to choose most informative data samples to be labeled. Another important aspect is the annotator quality, which can significantly impact the training effectiveness of the machine learning model.”

From AI-Enhanced Intensive Care Unit: Revolutionizing Patient Care with Pervasive Sensing, 2024

Head of AI Engineering at Harrison.ai uses Cleanlab to detect noisy data.

“I demonstrate the use of cleanlab, a confident learning implementation, to easily find noise in the data. Confident learning provides a solid foundation for analyzing a dataset of noisy or OOD samples — a technique that’s quite effective for multi-class approaches, with the evolving support for multi-label classification.”

Suneeta Mall, Head of AI Engineering at Harrison.ai in Are Label Errors Imperative? Is Confident Learning Useful? in Towards Data Science, 2022

AI Researcher at AIMotion Institute (Bavaria) uses Cleanlab to format datafor upload.

“My Cleanlab Studio experience was very positive. Very surprised how fast and easy it was to get results. Most work was transforming metadata into CSV file. You have really great product here, formatting the data for upload is really the only work needed to analyze/improve any data. You can take somebody who has no Computer Science background and they can have big impact where they previously could not play with and improve data directly. Customer support experience was also great, all of my questions/issues were quickly resolved by Cleanlab engineering team.”

Lukas Lodes, AI Researcher at AIMotion Institute in Bavaria

Cleanlab used to win 4th place (out of 1165 teams) and $100K in Kaggle competition.

“CleanLab was used to remove approximately 5,000 scenes that were considered noise. I did some experiments, including some that weren’t included in the final submission. The model trained on the cleaned dataset increased LB in the stand-alone model, but not much when ensembling. The following two points can be made from this evaluation: CleanLab is effective (+0.003), and large effect of ensembling FixedLen and VariableLen (+0.01).”

Kaggle competitor OHKAWA3 in a discussion on Google - Isolated Sign Language Recognition (Kaggie competition entry.)

Sage AI builds MLOps Pipelines to automatically relabel data using Cleanlab and DVC.

“Cleaning Data Labels — A Problem for Today and Tomorrow […] Since we don’t have an a priori gold label set to evaluate our confidence scores, we’ll use Cleanlab’s processing of the predicted probabilities from cross validation segments to arrive at a reasonable approximation for a set of gold labels. Thus, we can reduce the downstream burden of human evaluation by finding and relabeling the worst performers automatically. […] Our pipeline and Cleanlab’s algorithm detected between 1,354 and 1,993 label issues (depending on the classifier used) which were then relabeled, or moved to an unknown category for further inspection.”

Published by Todd Cook, Data Scientist and Engineer, for Sage AI in Build an MLOps Pipeline to Automatically Relabel Data using Cleanlab and DVC, 2023

Ping An Insurance (China) uses Cleanlab to find noise in their data, remove bad data, and train their e-commerce product classifier.

“If the classifier is trained with these noisy images directly, its performance could be degraded. In view of this, we attempted to find label errors in the image dataset with an open source tool cleanlab”, a framework powered by the theory of confident learning. Specifically, we trained multiple ResNet50 image classifiers to compute the predicted product category probabilities for all the training samples in a |cross-validation manner. Then the cleanlab tool could utilize the matrix of predicted probabilities to find noisy samples, ordered by likelihood of being an error. We removed the top 10% noisy samples from the training set.”

Published in A Multimodal Late Fusion Model for E-Commerce Product Classification, 2020

Kili Technology posts blog about Cleanlab and methods to deal with label errors.

“Recently last year MIT researchers created Cleanlab as a tool to find label errors in image, text, and audio datasets. The scholars tested their tool with the 10 most popular image, text, and audio datasets, including Cifar-10, ImageNet, IMDB, and AudioSet. In one of their publications, they show how these datasets, which are widely used to benchmark new and improved Machine Learning algorithms, have errors. […] These new labeling error detection algorithms are recent, and their effectiveness has to be tested in environments external to academia.”

Adrien Pujol, Product Marketing Manager at Salesforce in My State-Of-The-Art Machine Learning Model does not reach its accuracy promise: What can I do?

Machine Learning Engineer at BinIt.AI uses recommends Cleanlab.

“The simplest approach is to stop training a model early, before it’s memorised the training set, and then use this model to run inference back over its own training set. The frames with the largest disagreement between the original labels and the model’s predictions are likely to include incorrect annotations. Send the top X of these to your labelling tool, and correct where appropriate. See cleanlab to get started.[…] 90% of the gains can be achieved through a pretty simple set-up built around open-source tools like cleanlab, CVAT, Voxel Fiftyone and maybe the odd Streamlit app.”

George Pearse, ML Engineer at BinIt.AI

Cleanlab used to win 28th place (out of 757 teams) in Kaggle competition.

“This is a weakly supervised multi-label classification problem and a code competition. Given images of cells from our microscopes and labels of protein location assigned together for all cells in the image, Kagglers will develop models capable of segmenting and classifying each individual cell with precise labels. If successful, you’ll contribute to the revolution of single-cell biology! We use the Kaggle provided dataset and the public dataset to train and evaluate using different model architectures. The public tools used include Fastai, Opencv, CellSegmentator, Cleanlab, etc.”

Kaggle competitor Sky walker in a discussion on Human Protein Atlas - Single Cell Classification (Kaggie competition entry.)

Engineer at Appen uses Cleanlab to create ML-You-Can-Use Wikidata Occupations labeled dataset.

“Wikidata is a great resource of free data. However to interact with it meaningfully most people will find it necessary to clean the data. For more details on how some data was labeled manually, how BERT embeddings were used to build a classifier and how Cleanlab was used to detect problematic labels, please visit the ML-You-Can-Use notebooks regarding the label provenance.”

Todd Cooke, Senior Software Engineer at Appen

Petco Researcher and Engineer writes blogpost on Cleanlab technology.

“Similar to the deep learning framework PyTorch, Cleanlab is a framework for machine learning and deep learning with error labels. […] It can be used to describe, find and learn label errors. The cleanlab Python package is free and open source.”

Xiaoyao Xi, Senior Manager Data Science at PetCo.

Calmcode.io tutorial features Cleanlab as the go-to solution.

“It’d be a shame if our machine learning models are merely optimal because they overfit on the bad labels. That’s why we’re going to explore heuristics to find bad labels in our training data so that we may try to improve the quality of our training data. This will also give us the opportunity to explore cleanlab, which is made by the creators of the label errors website to help spot bad labels.”

Bad Labels: Introduction course on Calmcode.io

ByteDance Computer Vision Engineer uses Cleanlab to deliver high quality video tagging models.

“At TikTok, I deploy models for video tagging at an enormous scale. My expertise lies in Large-scale ML Ops operations. I’ve witnessed the transformative impact of enhancing data quality, often overshadowed by flashier methods. At TikTok, I actively utilize Cleanlab to swiftly identify incorrect annotations, consistently delivering high-quality models on schedule.”

Computer Vision Engineer on the TikTok Video Understanding Team.

Consulting Data Scientist / Machine Learning Engineer finds label errors in NLP with Cleanlab.

“We use the Python package cleanlab which leverages confident learning to find label errors in datasets and for learning with noisy labels. Its called cleanlab because it CLEANs LABels. cleanlab is: fast (single-shot, non-iterative, parallelized algorithms), robust (provable generalization and risk minimization guarantees, including imperfect probability estimation), general (works with any probabilistic classifier), and unique (the only package for multi-class learning with noisy labels or finding label errors for any dataset / classifier).”

Tobias Sterbak, Machine Learning consultant in Depends on the Definition blog

Participant of Data-Centric AI course shares impact of course.

“As the course comes to a close, I would like to take a moment to express my sincerest gratitude for your guidance and support throughout the lectures. It took me about a month to complete all nine lectures, including labs and notes, but I can say that this path has been the most illuminating experience in my educational life. Your unwavering dedication to teaching and commitment to my learning experience has not gone unnoticed. You have inspired me to continue learning and growing beyond the classroom. Your generosity with your time and knowledge has made a significant impact on my journey. Thank you once again for all that you have done for us.”

Yunus Gümüssoy, participant in Cleanlab’s Data-Centric AI course

AMBL Blog uses Cleanlab to detect noisy labels.

“It takes time and effort to check each image and manually remove the noise image, but it seems to be easy by using cleanlab. I think it is also convenient that it can be used regardless of the framework.”

Translated quote from Colors by AMBL blog.

GitHub user praises cleanlab (open-source).

“I’m just starting to get the hang of this and read on how it works. But right now from the first results it looks like pure black magic… So thank you for this!”

GitHub user NorthyFN’s Discussions post in the cleanlab GitHub repository

Twitter user praises Cleanlab Vision Library.

“I collected custom image data from the internet for one of my pet projects. When i went through the data, i saw a lot of duplicate images. Initially i was deleting them all manually (not fun at all). This library was a game changer. Just one function and everything is done.”

Twitter user @wannabemonk

Share your story with us!

Please let us know how you are using Cleanlab, we’d love to hear about your experience and what you love about Cleanlab.

Resources

Open-Source Growth over time How scientists use Cleanlab