Using Cleanlab Studio to Audit Public Datasets

The Cleanlab Studio Audit uses AI to auto-detect problems in a given dataset. Like a PSA, the CSA is a recurring series to inform the community about issues in popular datasets — all automatically found and corrected with Cleanlab Studio.

Cleanlab Studio can just as easily help you improve your own image, text, or tabular/CSV/Excel dataset. Try it now!

If you find interesting issues in any dataset, they can be featured here! Just fill out this form.



View more errors detected with Cleanlab in famous ML benchmark datasets at labelerrors.com
Get started today
TLM is free to try and adds a reliabilty layer to RAG and GenAI systems in a few lines of code.
More resources
Explore applications of Cleanlab via blogs, tutorials, videos, and read the research that powers this next-generation platform.
Join us on Slack
Join the Cleanlab Community to ask questions and see how scientists and engineers are practicing Data-Centric AI.