Arize AI: $70 Million (Series C) Closed For AI Observability And LLM Evaluation Company

Arize AI, a leader in AI observability and LLM evaluation, announced a $70 million Series C round to accelerate its goal of making AI work reliably in production. The funding round was led by Adams Street Partners, with participation from M12 (Microsoft’s venture fund), Sinewave Ventures, OMERS Ventures, Datadog, PagerDuty, Industry Ventures, and Archerman Capital. Existing investors Foundation Capital, Battery Ventures, TCV, and Swift Ventures also cemented their confidence in Arize’s vision.

AI adoption is rapidly increasing. But while AI models are more powerful than ever, most LLMs struggle to perform reliably in real-world applications like voice assistants. And many AI models are trained and optimized using synthetic data—data generated by other AI models rather than real-world sources. But what happens when those models cannot evaluate the results of their own synthetic data?

In a research effort called OpenEvals, Arize has shown that LLMs struggle to assess the correctness of synthetic datasets reliably compared to non-synthetic data—a major blind spot as enterprises rush to scale generative AI. These findings highlight serious risks in AI model training and self-improvement loops, where unchecked errors in synthetic data can compound over time. LLMs are still a black box for engineering teams—unpredictable, difficult to troubleshoot, and prone to failures that can derail entire projects.

With the industry grappling with these challenges, AI engineers need better tools to ensure their models aren’t built on faulty foundations. With Arize’s AI observability and LLM evaluation platform, teams can test, troubleshoot, and course-correct AI systems before failures escalate into real-world consequences. This is especially important as enterprises race to implement semi-autonomous multi-agent systems, voice assistants, and increasingly sophisticated consumer-facing AI applications.

Since launching a few years ago, Arize has become an AI observability and evaluation backbone for the world’s top enterprises and government agencies, including Booking.com, Condé Nast, Duolingo, Hyatt, PepsiCo, Priceline, TripAdvisor, Uber, and Wayfair, among hundreds more. Arize Phoenix, the company’s open-source offering, has emerged as the most widely adopted AI observability and evaluation library for development, with over two million monthly downloads.

Arize’s partnership with Microsoft is also expanding, with M12’s investment reinforcing a long-standing partnership. And the company recently launched deeper integrations with Azure AI Studio and the Azure AI Foundry portal, SDK, and CLI, making it easier than ever for AI engineers to integrate observability and evaluation into their workflows.

KEY QUOTES:

“Building AI is easy. Making it work in the real world is the hard part. Enterprises can’t afford to deploy unreliable AI. Engineering teams need better infrastructure to test, evaluate, and troubleshoot their models before they impact customers. That’s exactly what Arize delivers—whether through our enterprise platform, Arize AX, or our open-source offering, Arize Phoenix.”

Jason Lopatecki, CEO and Co-Founder of Arize AI

“As AI research and real-world applications accelerate, Arize will continue to pioneer new tools, like our recent first-to-market launch of audio evaluation for voice assistants, to help engineers working on these systems better evaluation, debug, and improve what they build.”

Aparna Dhinakaran, Chief Product Officer and Co-Founder of Arize

“We believe AI observability is the missing piece in making AI truly enterprise-ready. As AI adoption accelerates, companies need robust, cohesive tools to ensure their AI systems are performant, reliable, and aligned with business goals. Through our research and diligence in this market, we believe Arize AI has built the category-defining platform for AI observability and evaluation, trusted by leading enterprises and AI-first organizations. We’re excited to support their vision as they scale to meet the growing demand for production-grade AI.”

Fred Wang, Partner at Adams Street Partners

“Arize AI’s innovative approach to AI observability and LLM evaluation is transforming the way enterprises deploy and manage AI systems. Our investment reflects our confidence in their ability to set new standards in the industry and empower AI engineers and developers to achieve real-world results.”

Todd Graham, Managing Partner at M12

“Tripadvisor’s billion-plus reviews and contributions are becoming even more important in a world of AI search and recommendations where travel experiences are more conversational, personal and even agentic. As we build out new AI products and capabilities, having the right infrastructure in place to evaluate and observe AI is important. Arize has been a valuable partner on that front.”

Rahul Todkar, Head of Data and AI at Tripadvisor

“With GenAI, we’re facilitating more tailored experiences that adapt and respond to travelers’ needs faster than ever before. As we continue to innovate, our technical teams blend an approach of pioneering new tools in-house and using platforms like Arize to help in testing, evaluating and tracing new AI-powered applications and workflows.”

Jeroen Hofman, ML Engineering Manager at Booking

“Arize AI deserves a lot of credit for pioneering AI observability and creating a de facto standard for enterprises that want to achieve real-world results with generative AI. We’re proud to continue to back the company as it scales.”

Brett Wilson, General Partner at Swift Ventures