There is a particular kind of frustration that builds slowly, over many sprints, across hundreds of model outputs, until it becomes impossible to ignore. For Ankur Goyal, that frustration arrived while he was running the AI/ML platform at Figma - the company that had acquired his previous startup, Impira, in 2022.
Every team building with AI ran into the same wall: how do you actually know if the model is getting better? You tweak a prompt, you run it, you look at the outputs, you ask someone on the team if they "feel" like it improved. That feeling is not a system. Goyal, who had spent years ensuring that SingleStore's distributed database performed reliably at scale, found the informality of AI quality measurement genuinely alarming.
He founded Braintrust in 2023 because the problem wasn't abstract - he had personally lived it. The company he built addresses the gap between a working AI demo and a production-grade AI product. That gap, it turns out, is mostly evaluation infrastructure.