Announcing | TLM (Trustworthy Language Model) for reliable LLM outputs.Learn more.

Experienced Cloud Engineer (Data Platform)

Location: San Francisco (flexible/hybrid)

NOTE: this position is for experienced candidates only. If you have less than ~10 years of experience, please see other openings on, where you likely will be better set up for success!

At Cleanlab, you’ll get to

  • Engineer a data platform and work on cutting-edge MLOps with guidance from MIT PhDs who are prominent in systems and ML research.

  • Pioneer novel software systems for the rapidly growing field of data-centric AI. Our tools enable data scientists and engineers across all industries to effectively diagnose and fix issues in their datasets, thus improving the quality of their business’s core asset.

  • Work with the latest tooling and ML models at a dynamic startup.

In this position, you will

  • Drive the future of our data platform by owning high-priority projects that will increase our platform’s value, resilience, and scalability. You will also make critical technical decisions for our backend, comprising the data ingestion, processing, and query pipelines.

  • Provide technical leadership to other engineers as you make architectural platform decisions. We work with a modern tech stack, including the latest tooling and ML models, and will look to you to help evolve this further.

  • Identify reliability and security risks and design solutions to address them. You will also guide leadership on how to prioritize these initiatives (short and long-term) and decide how they fit into our roadmaps.

  • Embrace the start-up chaos! You will help us discover and define how we streamline collaboration with product, sales, and marketing teams to create the best solutions for our current and future customers!

We are looking for

  • Strong software engineering & data platform architecture and engineering skills. You have likely built significant backend distributed systems.

  • Multiple years of experience running production code and models in cloud environments. You should be comfortable with the unique challenges of running large software systems at scale.

  • Several years of cross-team initiatives in a platform or infrastructure-focused environment. You should have led impactful technical initiatives where first-order concerns include performance, data security, cost efficiency, ease of use, and operability.

  • You enjoy and are good at mentoring and coaching other engineers to become even stronger. Teams win!

  • This is a senior role. Success is more likely if you have at least 10+ years of real-world experience.


Working at Cleanlab is awesome! Beyond the opportunity to work at a well-funded AI startup with an incredible, friendly founding team of MIT graduates, all full-time employees receive the following:

  • Annual travel stipend
    • Travel enhances our empathy with different cultures and enables us to work together more effectively. It’s how we grow and learn: traveling is an essential part of what makes us human. At Cleanlab, every two months you will receive a reimbursable travel benefit. This is a unique benefit that lets you work from Paris for a week in February, then take a backpacking trip in the Andes for a weekend in March.
  • Premium health insurance (+ dental and vision)
    • We provide a fantastic $4 (we cover the rest) health insurance option. We also provide a $0 deductible 100% coverage premium health care option for those who prefer the best health insurance.
  • Stipend for attending conferences to keep up with the latest innovations in ML and software.
  • Competitive salary (+ equity offering for certain roles), with regular opportunities for a raise if things are going well.

The compensation range for this role is $170,000 to $230,000. The final offer details are determined by several factors including candidate experience/expertise and may vary from the pay range provided.

About Us

Prior to Cleanlab, our founders (3 ML PhDs from MIT) worked at OpenAI, Google, Microsoft, Amazon, AWS, Facebook AI Research (FAIR), Dropbox, Oculus, Palantir, NASA, General Electric, MIT Lincoln Laboratory, MIT, Harvard, and Stanford – at every place we worked we repeatedly encountered the same issue – AI solutions failed to work reliably on real-world, human-centric data due to label errors and poor data quality. So, we spent eight years of PhD research at MIT inventing a new field to solve this problem and after successful pilots with world-leading organizations, Cleanlab emerged.

Everything we do at Cleanlab is guided by our north star – to improve the world’s ML data more easily and quicker than any other solution – enabling AI systems to train more reliably on real-world, messy, error-prone data. We develop next-generation data-centric AI, open-source algorithms and provide no-code SaaS enterprise solutions to help individuals and teams at companies (across all industries) diagnose/fix issues in their datasets and produce more reliable ML models by providing clean labels for training.

While many companies can help store/manage data or develop ML models, there exist few solutions today to improve the quality of existing data, which is the core asset of the modern enterprise. This is where you come in. At Cleanlab, you’ll be able to take ownership of critical projects that pioneer the future of data-centric AI.

We are a hybrid company, with over half of our team (and office) located in San Francisco.

  • Read about the Cleanlab team here.
  • Read how Cleanlab went from MIT PhD research to tech used by Amazon, Google, etc here.
  • See what Google, Tencent, and other Cleanlab users think here.

How to Apply