Vellum

Who They Are Now

A quiet layer under a very loud industry

It is 2026, and everyone is shipping AI. The demos are dazzling. The launch posts are breathless. And somewhere in a New York office, a 33-person company is doing the part nobody tweets about: making sure the dazzling thing still works on Tuesday, after the model changed, when a real customer asks it something nobody anticipated.

That company is Vellum. It does not sell magic. It sells the floor under the magic - a development platform where teams build, evaluate, deploy, and monitor large language model applications without crossing their fingers. More than 150 companies now run AI on it, from compliance software to real estate to telecom to healthcare. Most of their users have never heard the name. That is rather the point.

Vellum is the part of the AI boom that doesn't photograph well - and may be the part that lasts.- The thesis, in one line

The Problem They Saw

The demo lies. The production system tells the truth.

Building an AI demo is easy. Embarrassingly easy. You wire up a model, write a clever prompt, get a jaw-dropping result, and feel like a genius for an afternoon. Then you try to ship it to real users, and the genius act collapses.

The prompt you wrote last week behaves differently this week. You have no record of what changed, no way to measure whether the new version is better or worse, and no warning when it quietly breaks for one user in a thousand. The model provider pushes an update; your app's behavior shifts underneath you. There is, in other words, no ground to stand on.

Developing AI feels like writing software in quicksand - the ground keeps shifting and teams struggle just to stay afloat.- Akash Sharma, Co-Founder & CEO

The founders had lived this exact frustration. Before Vellum, the three worked together at Dover, a Y Combinator company, where they spent two-plus years building real LLM features - recruiting emails, job descriptions, the unglamorous stuff. They kept hitting the same wall: they couldn't version their prompts in production, couldn't measure quality, couldn't tell whether a change helped or hurt. The pain was specific. It was also, they suspected, universal.

The Founders' Bet

What if AI just needed software's old habits?

Three people made the wager. Akash Sharma, who had spent five years at McKinsey's Silicon Valley office before deciding the more interesting problem was building, not advising. Sidd Seethepalli and Noa Flaherty, both MIT engineers - Sidd from Quora's ML platform team, Noa from DataRobot's MLOps group. People who had seen, up close, how real machine learning gets shipped and maintained.

Their bet was almost contrarian in its modesty. While much of the industry chased bigger models and flashier capabilities, Vellum bet on discipline. Testing. Version control. Monitoring. The decades-old practices that made traditional software trustworthy - applied, finally, to the weird new world of probabilistic AI. They called it test-driven development for AI, and entered Y Combinator's Winter 2023 batch to build it.

It's one thing to build a demo. It's another thing entirely to get it production-ready.- Vellum's founding observation

Akash Sharma

Co-Founder & CEO

Five years at McKinsey's Silicon Valley office before trading slide decks for a product. Announced both the seed and the Series A.

Sidd Seethepalli

Co-Founder & CTO

MIT engineer, formerly on Quora's ML platform team. Drives the platform's technical architecture and security posture.

Noa Flaherty

Co-Founder & CTO

MIT-trained, came through DataRobot's MLOps team and product engineering at Dover. The MLOps instinct made flesh.

The Vellum Milestones

Demo → Dependable, charted

2022 — The itch

Building LLM features at Dover

Akash, Sidd, and Noa spend 2+ years shipping production AI and discover prompts you can't version, measure, or trust.

Winter 2023

Vellum joins Y Combinator

The three found Vellum to bring software rigor to LLM development and enter YC's W23 batch.

July 2023

$5M seed round

Backed by Rebel Fund, Eastlink Capital, Pioneer Fund, Y Combinator and angels as demand for AI tooling surges.

2024

Enterprise trust, earned

SOC 2 Type 1 & Type 2 attestations plus HIPAA compliance - the table stakes for handling sensitive customer data.

2025

Vellum for Agents

An agent builder that takes a plain-language task, asks follow-ups, connects your tools, and shows its work.

July 2025

$20M Series A

Led by Leaders Fund with YC, Socii, Rebel and Pioneer. 150+ companies on board; new verticals and markets next.

The Product

One platform for the whole messy AI lifecycle

Vellum's pitch is not a feature; it is a sequence. The entire arc of building an AI application - from first prompt to live monitoring - lives in one place, so the thing you tested is the thing you shipped, and the thing you shipped is the thing you can watch.

Workflow Builder

A visual UI plus an SDK for designing and testing AI logic - including RAG and document retrieval - so the wiring is legible instead of buried.

Evaluation Suite

Quantitative testing that measures prompt and model quality and catches edge-case failures before a customer ever does.

Deployment & Versioning

Update AI behavior without a risky redeploy, with full version control. Roll forward; roll back; sleep.

Monitoring

Real-time observability with feedback loops - even nightly evaluation jobs - so regressions surface while you sleep.

Prompt & Model Lab

Write, compare, and fine-tune across every major LLM provider. Swap models without rewriting your app.

Vellum for Agents

Describe a task in plain language. It asks questions, connects your tools, handles the logic, and shows exactly what it did.

The thing you tested is the thing you shipped - a sentence that should be obvious and somehow isn't.- Why one platform beats five

The Proof

Receipts, not adjectives

Anyone can claim reliability. The interesting question is whether customers behave as though it's true - and Vellum's do. The platform's users skew toward companies that cannot afford to be wrong: compliance, healthcare, real estate, telecom.

Drata Redfin Swisscom Headspace DeepScribe Rely Health Rentgrata GravityStack

Take Drata, the compliance-automation company. Answering security questionnaires used to eat 3 to 5 hours each. Built on Vellum, that dropped by 97%. Drata now runs more than 28,000 isolated vector databases through Vellum and fires off nightly evaluation jobs to catch AI drift before morning. Redfin pushed Vellum-powered features to millions of users across 14 markets. DeepScribe cut the time clinicians spend iterating on notes by 20 to 40%.

What "production-ready" buys you

Reported customer outcomes on Vellum • sources: Vellum, Drata case studies

Drata
questionnaire time cut

97% faster

DeepScribe
note-iteration time cut

up to 40%

Drata
vector databases run

28,000+

Companies
building on Vellum

150+

Bars are scaled for readability, not a shared axis - each measures a different thing. The numbers, though, are the customers' own.

Drata runs its AI checks nightly on Vellum - which means the robot's homework gets graded while everyone is asleep.- The least glamorous superpower in AI

The Mission

Make AI development boring - the good kind

Vellum's mission reads almost like an anti-pitch: help teams bridge the gap from proof-of-concept to production. No promises of artificial superintelligence. No manifesto about transforming humanity. Just the deeply unfashionable belief that AI, like every powerful technology before it, becomes useful only when it becomes dependable.

There is a quiet irony here. The most hyped technology of the decade turns out to need the least hyped thing in software - rigor. Vellum's wager is that when the novelty wears off, what remains is the work: testing, versioning, watching, fixing. The company built its whole business on the unsexy middle of that sentence.

Things that amuse and inform

The founders met building AI at another YC startup - their product is basically the tool they wished they'd had.
Two MIT engineers and a McKinsey operator walked into a YC batch. The output now powers 150+ AI products.
"Test-driven development for AI" borrows a decades-old software discipline for a brand-new problem.
Most people who use software built on Vellum will never know the name - infrastructure's quiet curse.

Why It Matters Tomorrow

When the demos stop being enough

Here is the bet on the future, stated plainly. The AI industry is sprinting through its demo phase. Eventually - sooner than the hype suggests - the demos stop being enough. Customers will want AI that works the hundredth time, the thousandth time, the time it really counts. The companies that can deliver that will need the boring machinery underneath. The ones that can't will be the cautionary tales.

The next frontier is agents - AI that doesn't just answer but acts. Acting is far less forgiving than answering; a wrong word is awkward, a wrong action is expensive. Vellum's Series A is, in part, a bet that the discipline it built for prompts is exactly what agents will require, only more so.

In a gold rush, the people who get rich are not always the ones panning. Sometimes they sell the floor everyone stands on.- The unglamorous case for Vellum

Return to that New York office, 2026. Everyone is still shipping AI. The demos are still dazzling. But somewhere downstream of all that noise, a small team keeps making sure the dazzling thing works on Tuesday - and the ground, for once, holds. Vellum did not make AI smarter. It made AI trustworthy enough to ship. In an industry drunk on possibility, that may be the most valuable sobriety on offer.

Watch

Demos & conversations

Product walkthroughs and founder talks - see the platform move instead of taking our word for it.

► Vellum on YouTube - product demos & tutorials ► Search: Vellum AI product demo ► Search: Akash Sharma founder interviews

Vellum.

A quiet layer under a very loud industry

The demo lies. The production system tells the truth.

What if AI just needed software's old habits?

The Vellum Milestones

Building LLM features at Dover

Vellum joins Y Combinator

$5M seed round

Enterprise trust, earned

Vellum for Agents

$20M Series A

One platform for the whole messy AI lifecycle

Workflow Builder

Evaluation Suite

Deployment & Versioning

Monitoring

Prompt & Model Lab

Vellum for Agents

Receipts, not adjectives

What "production-ready" buys you

Make AI development boring - the good kind

Things that amuse and inform

When the demos stop being enough

Demos & conversations

Find Vellum