Software Engineer (Data/ML Platform)

Location: San Francisco (3 days per week in office)

Cleanlab’s mission is to democratize safe, trustworthy, and reliable AI. We believe the most promising path to such AI follows a data-centric approach. We build open-source algorithms and no-code SaaS solutions based on our research to help individuals and teams systematically and algorithmically curate high-quality datasets, which enables AI systems to train reliably on real-world messy and error-prone data.

The role

As a software engineer working on Cleanlab’s data platform and ML platform, you will be responsible for building the data and ML systems that underpin Cleanlab Studio, a user-friendly web app powered by our data-centric ML algorithms.

What you might work on

  • Productionizing the algorithms invented by our ML team by designing and implementing systems that perform data ingestion, data processing, ML training, and ML inference
  • Optimizing code for speed, scalability, security, and reliability
  • Ideating and prototyping novel product features enabled by our research
  • Collaborating with our data scientists and machine learning engineers to integrate feedback

Our data/ML platform tech stack includes Python and libraries like Polars, Arrow, Pandas, NumPy, and scikit-learn.

What we’re looking for

  • Track record of shipping high-quality code in challenging projects
  • Exceptional programming skills
  • Ability to thrive in a fast-paced environment, navigate ambiguity, and adapt to changing priorities

Cleanlab strives to be a place where high-potential individuals come together to build something great. We value grit and the ability to learn quickly as much as skill and experience.

Our software engineering jobs have no hard requirements (like years of industry work experience, or even experience with the programming languages we use). Instead, we evaluate each application individually and consider any aspects that might stand out. Do you have some amazing open-source contributions? Did you write an impactful paper? Did you do some amazing work at a past job?

If you’re a great hacker, you believe in our mission, and you identify with our culture, we encourage you to apply!

How to apply


Benefits

Working at Cleanlab is awesome! Beyond the opportunity to work at a well-funded AI startup with an incredible, friendly founding team of MIT graduates, all full-time employees receive the following:

  • Premium health insurance (+ dental and vision)
    • We provide a fantastic $4 (we cover the rest) health insurance option. We also provide a $0 deductible 100% coverage premium health care option for those who prefer the best health insurance.
  • Work with a talented team unified by a vision to solve challenging problems in artificial intelligence.
  • Professional development stipend to keep up with the latest innovations in ML and software.
  • Competitive salary (+ equity offering for certain roles), with regular opportunities for a raise if things are going well.
  • Relocation bonus to move to San Francisco

The compensation range for this role is $150,000 to $250,000. The final offer details are determined by several factors including candidate experience/expertise and may vary from the pay range provided.

About us

Prior to Cleanlab, our founders (3 ML PhDs from MIT) worked at OpenAI, Google, Microsoft, Amazon, AWS, Facebook AI Research (FAIR), Dropbox, Oculus, Palantir, NASA, General Electric, MIT Lincoln Laboratory, MIT, Harvard, and Stanford – at every place we worked we repeatedly encountered the same issue – AI solutions failed to work reliably on real-world, human-centric data due to label errors and poor data quality. So, we spent eight years of PhD research at MIT inventing a new field to solve this problem and after successful pilots with world-leading organizations, Cleanlab emerged.

Everything we do at Cleanlab is guided by our north star – to improve the world’s ML data more easily and quicker than any other solution – enabling AI systems to train more reliably on real-world, messy, error-prone data. We develop next-generation data-centric AI, open-source algorithms and provide no-code SaaS enterprise solutions to help individuals and teams at companies (across all industries) diagnose/fix issues in their datasets and produce more reliable ML models by providing clean labels for training.

While many companies can help store/manage data or develop ML models, there exist few solutions today to improve the quality of existing data, which is the core asset of the modern enterprise. This is where you come in. At Cleanlab, you’ll be able to take ownership of critical projects that pioneer the future of data-centric AI.

We are a hybrid company, with over half of our team (and office) located in San Francisco.

  • Read about the Cleanlab team here.
  • Read how Cleanlab went from MIT PhD research to tech used by Amazon, Google, etc here.
  • See what Google, Tencent, and other Cleanlab users think here.