Loading organizations...

§ Private Profile · San Francisco, CA, USA
LLM evaluation platform for evaluating, monitoring, and optimizing LLM applications for AI teams to improve quality.
Confident AI has raised $2.0M across 1 funding round.
Key people at Confident AI.
Confident AI was founded in 2024 by Kritin Vongthongsri (Co-Founder) and Jeffrey Ip (CEO & Cofounder).
Confident AI has raised $2.0M in total across 1 funding round.
Based in San Francisco, California, Confident AI develops an open-source platform for evaluating, monitoring, and optimizing large language model applications. The company provides engineering, quality assurance, and product teams with tools like DeepEval to benchmark and safeguard artificial intelligence quality within continuous integration and deployment pipelines. Operating with a team of seven employees, the startup's core open-source framework currently processes two million evaluations daily, generates over three million monthly downloads, and has accumulated 12,600 GitHub stars. Backed by lead investor Y Combinator as part of its Winter 2025 batch, the enterprise software provider has raised $2.2 million in total seed funding to date. Its commercial SaaS platform is utilized by major corporate enterprise clients including Microsoft, AstraZeneca, AXA, and the Boston Consulting Group. Confident AI was founded in 2024 by Jeffrey Ip and Kritin Vongthongsri.
Confident AI has raised $2.0M across 1 funding round. Most recently, it raised $2.0M Seed in March 2025.
| Date | Round | Lead Investors | Other Investors | Status |
|---|---|---|---|---|
| Mar 1, 2025 | $2M Seed | — | Flexcap, Third Prime, Aaron King, Kunal Shah, Robert Wuttke | Announced |
Key people at Confident AI.
Confident AI was founded in 2024 by Kritin Vongthongsri (Co-Founder) and Jeffrey Ip (CEO & Cofounder).
Confident AI has raised $2.0M in total across 1 funding round.
Confident AI's investors include Flexcap, Third Prime, Aaron King, Kunal Shah, Robert Wuttke.
Confident AI is an open-source company that builds DeepEval, a unit testing and evaluation framework specifically designed for large language model (LLM) applications such as chatbots, agents, and retrieval-augmented generation (RAG) pipelines. Their product suite includes the DeepEval open-source package and a cloud platform that enables engineering teams to benchmark, safeguard, and improve LLM applications with best-in-class, use-case-specific evaluation metrics and guardrails. This platform helps teams save hundreds of hours weekly on fixing regressions, cut inference costs by up to 80%, and confidently deploy AI systems with continuous evaluation integrated into CI/CD pipelines. Confident AI primarily serves engineering teams and data scientists in enterprises across sectors like healthcare, insurance, finance, and technology, addressing the critical problem of the "black box" nature of LLMs by providing transparent, automated, and scalable testing solutions. Their growth is marked by widespread adoption of DeepEval (with millions of evaluations weekly) and enterprise clients including Microsoft, BCG, AstraZeneca, and AXA[1][3][4][6].
Confident AI was founded by Jeffrey Ip and Kritin Vongthongsri. Jeffrey Ip brings experience from Google, where he scaled YouTube's creator studio infrastructure, and Microsoft, where he worked on document recommenders for Office 365. Kritin Vongthongsri is an AI researcher with a background in NLP pipelines for fintech startups and research in self-driving cars and human-computer interaction at Princeton. The idea for Confident AI emerged from the founders' recognition of the challenges in reliably evaluating LLM applications, which often behave like opaque black boxes. They created DeepEval to provide deterministic, use-case-specific evaluation metrics and later built the Confident AI cloud platform to bring these capabilities to production environments, enabling continuous evaluation and improvement. Early traction includes significant open-source adoption and enterprise usage, validating the need for specialized LLM testing tools[3][6].
Confident AI rides the wave of rapid LLM adoption across industries, addressing a critical bottleneck: rigorous, scalable evaluation of AI models. As enterprises increasingly deploy LLMs in mission-critical workflows, the need for transparent, reliable testing to prevent regressions and optimize performance is paramount. The timing is ideal given the explosion of LLM use cases and the complexity of managing AI quality at scale. Confident AI’s focus on evaluation-first workflows aligns with market forces emphasizing AI safety, compliance, and operational excellence. By providing open-source tools and cloud infrastructure, they influence the ecosystem by setting new standards for AI testing, fostering trust, and enabling faster, data-driven AI innovation[3][4][6][7].
Looking ahead, Confident AI is poised to expand its impact by deepening enterprise integrations, enhancing automated prompt and model optimization, and publishing case studies that demonstrate ROI. Trends such as increasing regulatory scrutiny, demand for AI explainability, and the proliferation of LLM-powered applications will shape their journey. Their influence may evolve from a niche evaluation tool to a foundational platform for AI governance and continuous improvement across industries. As the complexity of AI systems grows, Confident AI’s mission to demystify and safeguard LLM applications will remain critical, potentially catalyzing broader adoption of rigorous AI testing practices and raising the bar for AI reliability and trustworthiness[3][6].