Get started today
Start your 2-week free trial today. No credit card needed.
Cleanlab Open-Source
GitHub
Limited Python API Access
Automatically detects issues
No auto-fix
Learn more
Cleanlab Studio
Free trial
Sign up now
No code / ML engineering needed
Web interface and API access
Auto-fix data issues
Image, text, document, and tabular data
AI-automated data labeling
Trustworthy Language Model (TLM)
Analytics
AutoML model training/deployment
Contact sales
Cleanlab Studio Enterprise
Contact sales
Everything in free trial
More ML and data correction tasks
Project-optimized AutoML
Image segmentation
Object detection
VPC and cloud integration
Hosted deployment / inference
Priority for new feature requests
Scale to massive datasets
Dedicated support engineer
Book demo
* Per-token pricing for Trustworthy Language Model (TLM) is also available. Contact sales to learn more.
Compare plans
Find the plan that fits your business needs.
Open-Source
Documentation
Open-Source
API access
Open-Source
Finds data and label issues
Open-Source
Support for image, text, and tabular data
Open-Source
Slack community of 1000+ ML engineers and data scientists
Open-Source
Support for audio, video, and PDF
Open-Source
Image segmentation and object detection
Free Trial
Start free trial
Free Trial
API access
Free Trial
Finds data and label issues
Free Trial
Support for image, text, and tabular data
Free Trial
Slack community of 1000+ ML engineers and data scientists
Free Trial
No code and no ML engineering needed
Free Trial
Supercharged AI-automated data labeling (10x faster)
Free Trial
Train reliable ML models and LLMs
Free Trial
Find and remove ambiguous data
Free Trial
Find and fix near-duplicate data
Free Trial
Find and filter low-quality images and text
Free Trial
Find and filter inappropriate content (PII, toxic language, slang, NSFW images)
Free Trial
Find and fix out of distribution and outlier data
Free Trial
1-click AutoML model training and deployment
Free Trial
Trustworthy Language Model (TLM)
Enterprise
Contact sales
Enterprise
API access
Enterprise
Finds data and label issues
Enterprise
Support for image, text, and tabular data
Enterprise
Slack community of 1000+ ML engineers and data scientists
Enterprise
No code and no ML engineering needed
Enterprise
Supercharged AI-automated data labeling (10x faster)
Enterprise
Train reliable ML models and LLMs
Enterprise
Find and remove ambiguous data
Enterprise
Find and fix near-duplicate data
Enterprise
Find and filter low-quality images and text
Enterprise
Find and filter inappropriate content (PII, toxic language, slang, NSFW images)
Enterprise
Find and fix out of distribution and outlier data
Enterprise
1-click AutoML model training and deployment
Enterprise
Trustworthy Language Model (TLM)
Enterprise
Support for audio, video, and PDF
Enterprise
Image segmentation and object detection
Enterprise
Scale to massive datasets
Enterprise
Monitor additional data for issues as it comes into data store
Enterprise
On cloud, on premises, VPC (virtual private cloud), and cloud integration options available
Enterprise
Priority for new feature requests
Enterprise
Dedicated support engineer
Enterprise
Commercial license of Cleanlab Open-Source available
Open-Source
Enterprise
API access
Open Source
Open Source
Open Source
Finds data and label issues
Open Source
Open Source
Open Source
Support for image, text, and tabular data
Open Source
Open Source
Open Source
Slack community of 1000+ ML engineers and data scientists
Open Source
Open Source
Open Source
No code and no ML engineering needed
Open Source
Open Source
Open Source
Supercharged AI-automated data labeling (10x faster)
Open Source
Open Source
Open Source
Train reliable ML models and LLMs
Open Source
Open Source
Open Source
Find and remove ambiguous data
Open Source
Open Source
Open Source
Find and fix near-duplicate data
Open Source
Open Source
Open Source
Find and filter low-quality images and text
Open Source
Open Source
Open Source
Find and filter inappropriate content (PII, toxic language, slang, NSFW images)
Open Source
Open Source
Open Source
Find and fix out of distribution and outlier data
Open Source
Open Source
Open Source
1-click AutoML model training and deployment
Open Source
Open Source
Open Source
Trustworthy Language Model (TLM)
Open Source
Open Source
Open Source
Support for audio, video, and PDF
Open Source
Open Source
Open Source
Image segmentation and object detection
Open Source
Open Source
Open Source
Scale to massive datasets
Open Source
Open Source
Open Source
Monitor additional data for issues as it comes into data store
Open Source
Open Source
Open Source
On cloud, on premises, VPC (virtual private cloud), and cloud integration options available
Open Source
Open Source
Open Source
Priority for new feature requests
Open Source
Open Source
Open Source
Dedicated support engineer
Open Source
Open Source
Open Source
Commercial license of Cleanlab Open-Source available
Open Source
Open Source
Open Source
Yiwen Jiang
I've found that the app corrects mislabelling very well, but I didn't get the results I was looking for when I used your open source library Cleanlab directly in Python. It turns out that your app on the web version works much better than the library in Python!
Yiwen Jiang | Data Engineer at Orange
Data Engineer at Orange
Andrew Ng
Question: There’ve been many Model-Centric breakthroughs that have excited and inspired the field. What are some of your favorite examples of Data-Centric breakthroughs or wins that will inspire the field?
Answer: “The Cleanlab stuff out of MIT”
Andrew Ng, Keynote talk at ICML 2023 Workshop on Data-Centric Machine Learning
Keynote talk at ICML 2023 Workshop on Data-Centric Machine Learning
Fredrik Olsson
Cleanlab Studio is a very effective solution to calm my nerves when it comes to label noise!
Fredrik Olsson | PhD. Head of Data Science at Gavagai
PhD. Head of Data Science at Gavagai
Lukas Lodes
I got significantly better results using Cleanlab Studio than the cleanlab open-source package, mainly because it’s so much easier to use.
Lukas Lodes | AI Researcher at AIMotion Institute
AI Researcher at AIMotion Institute
Cher Simon
Manually inspecting and fixing potential label errors can be time-consuming. We can train a better model using Cleanlab to filter noisy data.
Cher Simon | Principal Solutions Architect at Amazon AWS
Principal Solutions Architect at Amazon AWS
Steven Gawthorpe
We used Cleanlab Studio for document curation and saved our legal clients millions of dollars. Using Cleanlab in litigation discovery, we can accomplish with 5 lawyers what previously required 50 lawyers.
Steven Gawthorpe, Associate Director of Data Science at Berkeley Research Group
Associate Director of Data Science at Berkeley Research Group
Yiwen Jiang
I've found that the app corrects mislabelling very well, but I didn't get the results I was looking for when I used your open source library Cleanlab directly in Python. It turns out that your app on the web version works much better than the library in Python!
Yiwen Jiang | Data Engineer at Orange
Data Engineer at Orange
Andrew Ng
Question: There’ve been many Model-Centric breakthroughs that have excited and inspired the field. What are some of your favorite examples of Data-Centric breakthroughs or wins that will inspire the field?
Answer: “The Cleanlab stuff out of MIT”
Andrew Ng, Keynote talk at ICML 2023 Workshop on Data-Centric Machine Learning
Keynote talk at ICML 2023 Workshop on Data-Centric Machine Learning
FAQs
Can I do _ _ _ _ with your software?Accordion Arrow

Start by looking through our documentation. If you don’t find what you’re seeking, do note that: Cleanlab provides significantly more functionality for Enterprise customers than is available in the generally available public version of our software. Learn more in a demo.

How can I best use your software in my Data / AI projects?Accordion Arrow

For Enterprises, we offer Proof-of-Value services where our engineers will run the software on your data to show you why thousands of data scientists across all industries use Cleanlab software. Learn more in a demo. We also offer non-AGPL licenses for the Open-Source packages that come with support from our engineers, as properly using these packages requires ML expertise.

How does Cleanlab Studio compare to existing software and fit into my workflows?Accordion Arrow

Since Cleanlab Studio is a novel type of Data-Centric AI platform based on years of research from our team, it can be tricky to understand in terms of existing tools (no other tool like this exists). Read our blogpost that situates Cleanlab Studio in the landscape of existing software tools and Data/AI workflows.

Where can I learn more about the differences between Cleanlab Studio and the Open-Source package?Accordion Arrow

Read our blogpost that details the differences.

Do you offer discounts for academic research and small startups?Accordion Arrow

Absolutely, we enthusiastically support research (and publish a lot ourselves) and new innovative organizations. Just email: sales@cleanlab.ai

Can one person really handle: labeling & curating a big dataset + training and deploying a reliable ML model on it?Accordion Arrow

Absolutely, there have been many stories of a single data scientist or engineer building deployed ML applications from scratch at their company in days. This is possible because Cleanlab Studio automates lots of the work for you, leaving you to only handle the most impactful decisions (and you can do it all without having to write code). Cleanlab Studio offers automated support for: data labeling, data issue detection and correction, ML model training with hyperparameter-tuning and model-selection, and ML model deployment to serve predictions.