Loading organizations...

§ Private Profile · San Francisco, CA, USA
Building the open source standard for evaluating LLM Applications
Ragas has raised $500K across 1 funding round.
Key people at Ragas.
Ragas was founded in 2023 by Jithin James (Founder) and Shahul ES (Founder).
Ragas has raised $500K in total across 1 funding round.
The fragmented and proprietary evaluation tools today are leading to significant inefficiencies and confusion among developers. The world needs a standard everyone can rely on and that is why we are building Ragas as the open-source standard.
We have 4k stars on GitHub, 1.3k members in our discord community, and over 80+ external contributors. We also have partnerships with key AI companies like Langchain, Llamaindex, Arize, Weaviate and more to help create a standard.
We already process 5 million evaluations monthly for engineers from companies like AWS, Microsoft, Databricks, and Moody’s and it is growing at 70% month over month.
We are building LLM application testing and evaluation infrastructure for Enterprises.
Ragas is an open source framework designed to test and evaluate large language model (LLM) applications, particularly those using Retrieval-Augmented Generation (RAG) workflows. It provides automatic metrics, synthetic test data generation, and evaluation workflows that help developers and organizations quantitatively measure the performance, robustness, and accuracy of their LLM applications[1][2][4]. By enabling continuous monitoring and detailed diagnostics, Ragas helps improve LLM-based products by identifying issues like hallucinations or irrelevant retrievals, thus enhancing user trust and application reliability.
For an investment firm, Ragas represents a mission-driven technology focused on setting an open standard for evaluating LLM applications, reflecting an investment philosophy centered on transparency, reliability, and innovation in AI tooling. Its key sector is AI infrastructure and developer tools, impacting the startup ecosystem by enabling better benchmarking and quality assurance for LLM-powered products, which accelerates innovation and reduces risk in AI deployments.
For a portfolio company, Ragas builds a developer-centric evaluation platform that serves AI product teams, researchers, and enterprises deploying LLM applications. It solves the problem of measuring and improving LLM application performance in a systematic, reproducible way, addressing challenges like model hallucination and retrieval relevance. Its growth momentum is evidenced by adoption in production environments, integration with platforms like Amazon Bedrock and Elasticsearch, and active community contributions[1][2][4].
Ragas was founded by AI practitioners and researchers who recognized the need for a standardized, open source framework to evaluate LLM applications, especially as RAG workflows became more prevalent. The idea emerged from practical challenges in assessing the accuracy and reliability of LLM outputs enhanced by external data retrieval, which existing tools did not adequately address[1][3]. Early traction came from integration with major platforms such as Elasticsearch and Amazon Bedrock, and from community adoption through GitHub and open source contributions[1][2][3].
Key contributors include Pavan Belagatti, who has publicly shared tutorials and code examples demonstrating Ragas’ capabilities, helping to build awareness and adoption in the AI developer community[3]. The project has evolved from a simple evaluation toolkit to a comprehensive framework supporting synthetic data generation, continuous monitoring, and detailed metric reporting[4][8].
Ragas rides the wave of rapid LLM adoption and the rise of Retrieval-Augmented Generation workflows, which combine LLMs with external knowledge sources to improve accuracy and relevance. As LLM applications proliferate across industries, the need for robust, standardized evaluation frameworks becomes critical to ensure quality, reduce hallucinations, and build user trust.
The timing is crucial because many organizations are moving from experimental LLM use to production deployments, where continuous monitoring and performance diagnostics are essential. Market forces such as increasing regulatory scrutiny, demand for explainability, and the complexity of multi-component AI systems favor tools like Ragas that provide transparency and actionable insights.
By enabling systematic evaluation and benchmarking, Ragas influences the broader ecosystem by raising the bar for LLM application quality, accelerating innovation cycles, and reducing the risk of deploying unreliable AI systems[1][2][6][8].
Looking ahead, Ragas is poised to expand its influence as the de facto open source standard for LLM application evaluation, potentially integrating with more AI platforms and cloud providers. Trends shaping its journey include the growing complexity of LLM workflows, the push for AI governance and compliance, and the increasing importance of continuous AI observability.
Its future may involve deeper automation, more sophisticated synthetic data generation, and enhanced support for multi-modal and multi-agent LLM applications. As LLMs evolve, Ragas’ role in ensuring trustworthiness and performance will become even more critical, making it a foundational tool for AI product teams and investors focused on sustainable AI innovation.
Ragas was founded in 2023 by Jithin James (Founder) and Shahul ES (Founder).
Ragas has raised $500K in total across 1 funding round.
Ragas's investors include 468 Capital, Heavybit, Y Combinator, Lisha Li.
Key people at Ragas.
Ragas has raised $500K across 1 funding round. Most recently, it raised $500K Exploding Gradients - Seed in April 2024.
| Date | Round | Lead Investors | Other Investors | Status |
|---|---|---|---|---|
| Apr 1, 2024 | $500K Seed | — | 468 Capital, Heavybit, Y Combinator, Lisha LI | Announced |