Coval evaluates AI voice and chat agents like self-driving cars


What do AI voice agents and self-driving cars have in common? Their performance can be evaluated in the same way, argues Brooke Hopkins, a former tech lead at Waymo. Coval, Hopkins’ new startup, looks to do just that.

“When I left Waymo, I realized a lot of these problems that we had at Waymo were exactly what the rest of the AI industry was facing,” Hopkins (pictured above in the center) told TechCrunch. “But everyone was saying that this is a new paradigm, we’re having to come up with testing practices from first principles and that basically we all have to recreate everything. And I looked at that and said, wait, we’ve spent the last 10 years in self driving figuring out how to do this.”

In 2024, she decided to launch Coval, a platform that builds simulations for AI voice and chat agents that tests and evaluates how they perform tasks in the same way Hopkins tested self-driving cars at Waymo. Coval can run thousands of simulations simultaneously, like having the agent make a restaurant reservation or having the agent respond to a customer service question asked in an indirect way.

Coval’s tech evaluates the agents on a general set of metrics, but companies can also customize what they are looking for and use Coval to continue to evaluate for regressions. Users can also take this data, and the insights they gleam off of it, and bring it to their end-customers either for a demo or as a monitoring tool to show their customers the agent is working as intended.

Coval Screenshot SentimentAnalysis 1

“One of the biggest blockers to agents being adopted by enterprises is them feeling confident that this isn’t just a demo with smoke and mirrors,” Hopkins said. “Choosing between vendors is a really complicated task for these executives because it’s just very hard to know what you even ask or how do you even prove that these agents are doing what you expect. And so this gives our companies the ability to really show that and demonstrate it.”

Hopkins really formulated the idea behind Coval during the Y Combinator Summer 2024 batch before launching the product publicly in October 2024. She said that demand has been strong and has become explosive in the last two months, with customers asking how quickly they can get their agents evaluated.

The San Francisco-based startup is now announcing a $3.3 million seed round led by MaC Venture Capital with participation from Y Combinator and General Catalyst. The startup will use the capital to build out its engineering team and work to achieve product-market fit. Hopkins added that the company will also be working toward enabling its users to evaluate other types of AI agents, like web-based agents, in the future.

Coval comes on the scene while both momentum — and hype — around AI agents appears to be at an all-time high. Enterprise tech leaders like Marc Benioff have been praising (and marketing) the technology by saying Salesforce will deploy more than a billion of its AI agents by next year. OpenAI is rumored to be releasing its take on an AI agent very soon.

There are also numerous startups building in the space, too. There were more than 100 startups building AI agents across Y Combinator’s three 2024 cohorts alone. Some AI agent startups have landed sizable venture funding rounds too. One, /dev/agents, raised a $55 million seed round at a $500 million valuation in November 2024, less than a year after it was founded.

This momentum means it’s likely that there will be more companies looking for help to evaluate their agents too. Hopkins said Coval has a good shot at standing out from the pack because, unlike the inevitable new entrants, Coval has a head start.

“I think where we really stand out is I’ve been working in this space for half a decade and I’ve built these systems over and over,” she said. “We’ve built multiple iterations and we’ve seen how they fail and how they scale and we’re building the same concepts into Coval and all of those learnings.”



Source link

About The Author

Scroll to Top