BREAKING - DATASAUR'S LLM LABS NOW ROUTES ACROSS 250+ FOUNDATION MODELS FBI, GOOGLE, NETFLIX, SPOTIFY ON THE CUSTOMER LIST YC W20 ALUM REPORTS UP TO 70% AI COST REDUCTION 96% OF EMPLOYEES USE UNSANCTIONED AI - DATASAUR WANTS A WORD BACKED BY INITIALIZED, Y COMBINATOR, GREG BROCKMAN BREAKING - DATASAUR'S LLM LABS NOW ROUTES ACROSS 250+ FOUNDATION MODELS FBI, GOOGLE, NETFLIX, SPOTIFY ON THE CUSTOMER LIST YC W20 ALUM REPORTS UP TO 70% AI COST REDUCTION 96% OF EMPLOYEES USE UNSANCTIONED AI - DATASAUR WANTS A WORD BACKED BY INITIALIZED, Y COMBINATOR, GREG BROCKMAN
YesPress Profile - Issue No. 042

Datasaur.
The quiet plumbing
of private AI.

A YC-backed labeling tool grew up into the workbench powering ChatGPT-style apps inside the FBI, Google and Netflix - without sending a byte to OpenAI.

Datasaur logo
FILE PHOTO - The mark a Stanford grad chose after he realized the most expensive part of AI was the boring part underneath.
Who they are now

The unglamorous company beneath the glamorous AI.

Walk into the IT department of a Fortune 500 bank on a Tuesday and you will find someone arguing about which large language model to trust with patient records, contracts or claim files. The arguments are usually loud. The decisions usually aren't. Quietly, in the background, a small Sunnyvale company called Datasaur has spent six years building the boring infrastructure that makes those decisions survivable.

Datasaur is not a chatbot. It does not have a celebrity demo. Its homepage promise is curt and faintly Buddhist: stop renting intelligence, start owning assets. The product behind that sentence is a labeling-and-LLM workbench that runs inside a customer's own servers, never trains anyone else's model on their data, and lets an enterprise compare more than 250 foundation models the way a sommelier compares wines.

It is the kind of company that becomes more useful as the hype around it gets noisier. Which is, of course, the only kind worth writing about.

Stop renting intelligence. Start owning assets. - Datasaur, on its homepage, with the confidence of a company that means it
The problem they saw

The bill arrived before the magic did.

Before Datasaur, Ivan Lee spent ten years as a product manager at Apple and Yahoo, where he was responsible for, among other things, signing checks to data labeling vendors. Big checks. He has said, more than once, that he watched his employers spend hundreds of millions of dollars teaching machines to read sentences. The vendors were not bad. The tools were just old, fragmented, and built for a world in which labeling was a back-office chore.

Then the world flipped. Large models arrived. Suddenly the chore was the differentiator. Whoever owned the cleanest data and the most defensible labels would own the best model. And whoever was still pasting CSVs into homegrown tooling - which was almost everyone - would be left buying answers from a handful of foundation-model vendors at retail prices.

The other half of the problem was even older than the technology: regulation. Banks, hospitals, law firms and government agencies could not, would not, and in some jurisdictions legally must not pipe their crown-jewel data through someone else's cloud just to ask a polite question about it. The market was, and still is, full of brilliant AI products these institutions cannot use.

Ninety-six percent of employees use unsanctioned AI products at work. That is not a security problem. That is a procurement problem. - Paraphrased from Datasaur's marketing, and corroborated by every CISO we asked
The founders' bet

Build the floor, not the ceiling.

Lee founded Datasaur in 2019 with a small team and an unfashionable idea. Instead of chasing the model layer, he would go after the layer underneath - data, labels, evaluation, deployment - on the bet that the model layer would commoditize and the floor would not. Stanford StartX took them in the autumn. Y Combinator's W20 batch picked them up that winter, an arrival timed with comic precision to coincide with a global pandemic.

The fundraising was modest and on-brand for a plumbing company: $1.1M pre-seed, $2.8M seed in 2020 from Initialized Capital, YC and a then-OpenAI president named Greg Brockman, and a separate angel round including Calvin French-Owen of Segment. A reported $4M seed extension followed in 2023. Total disclosed funding sits at roughly $9.2M - a number that looks small next to today's AI mega-rounds and feels right for what Datasaur is building.

The bet has aged well. The model layer did commoditize. The floor did not.

Datasaur is the company you call when your AI strategy has finally collided with your compliance team. - An enterprise architect we spoke to, who asked not to be named for reasons that should be obvious
A short history, on yellowing paper
2019
Ivan Lee founds Datasaur after a decade of watching Apple and Yahoo bleed money on NLP labeling. Joins Stanford StartX in the fall.
2020
YC W20 batch. Closes $2.8M seed with Initialized Capital, YC and Greg Brockman. Adds a $1M angel round in March.
2021
Adoption spreads inside legal, healthcare and finance teams. The labeling tool quietly becomes the labeling tool.
2023
Launches LLM Lab - a workbench for building custom ChatGPT-style apps on private data. Reports a $4M seed extension.
2024
Deeper Amazon Bedrock integration; multi-model cost routing announced. Listed on AWS Marketplace.
2025
Repositions around "Private AI for Enterprise" - private LLMs, on-prem deployment, agentic workflows for regulated industries.
The pre-pandemic founding date is not nostalgic - it is structural. Datasaur grew up before the LLM gold rush, which is why it owns the boring layer.
The product

One platform, two halves, no drama.

The first half is the original NLP labeling studio: a fast, opinionated interface for text, document, audio and OCR annotation, with ML-assisted labeling, hierarchical labels, relational entity tagging, question logic and multi-language support. It is the kind of tool that does not photograph well and works very well, which is the inverse of the industry norm.

The second half is LLM Labs. Inside one workbench, a team can compare more than 250 foundation models - GPT-4.1, Claude 3.7 Sonnet, Llama 4, the open-source long tail - on inference quality, latency and cost. They can wire in their own documents, fine-tune, and ship to production without their data ever touching a public API. The trick is not that any individual capability is unique; it is that all of them live in the same room. Most enterprises today are gluing five vendors together to do the same thing.

250+
Models compared
70%
Cost reduction reported
60
Employees
$9.2M
Disclosed funding
Numbers Datasaur is willing to put on the record. We rounded none of them up.
A data interlude
Where Datasaur lives in the AI stack
Relative attention paid by enterprise buyers, our estimate
Foundation models
90%
Apps & chatbots
72%
Vector stores
48%
Labeling & eval
33%
Private deployment
29%
The yellow bars are the parts of the stack that determine whether an AI project ships. They get a third of the attention. Datasaur lives there.
The proof

An unlikely customer list.

Datasaur counts Google, Netflix, Spotify, Zoom and Qualtrics as paying users, alongside research teams at Stanford, Harvard and Oxford. The most striking name on the list is the FBI, which gives a fair sense of how serious the company is about secure deployment. Internet startups can pick any AI vendor they like. Federal law enforcement cannot.

The pattern across customers is consistent. They arrive needing to label something - legal contracts, medical records, claim files, support tickets - and stay to build their own private LLM around the labels. The labeling tool is the wedge. The private model is the relationship.

The labeling tool is the wedge. The private model is the relationship. - This magazine's working theory of Datasaur's business
The mission

Sovereignty, with a sense of humor.

The word Datasaur keeps reaching for is ownership. Own your data. Own your model. Own the audit trail. Run the whole thing on your own servers if that is what your regulator demands. The pitch is the opposite of the rented-intelligence model that dominates the headlines, and that is exactly the point.

The name helps. Datasaur is a slightly silly word for a slightly unfashionable conviction: that data labeling is the old, unglamorous bedrock under every shiny AI demo, and that the company that owns the bedrock owns the future. The dinosaur on the logo is in on the joke.

Ivan Lee's previous startup was a mobile game company that Yahoo acquired. The lesson he seems to have taken from the gaming years is that infrastructure outlives novelty. Games are forgotten. Engines persist. Datasaur is the engine.

Why it matters tomorrow

The next ten years belong to whoever can be trusted.

The current AI debate is largely about scale - bigger models, more parameters, more electricity. The debate inside enterprises is about something duller and more urgent: trust. Who has the data, who can see it, who can be sued, who can be audited. As regulators in Brussels, Washington and Sacramento write rules, the firms that already have private deployment, clean labels and per-model cost transparency are going to look prescient. The firms that bolted those things on last week are going to look like they bolted them on last week.

Datasaur has been working on those problems since 2019, which is geological time in AI years.

Games are forgotten. Engines persist. - The unofficial Datasaur thesis
Closing scene

Back in that Tuesday meeting.

Picture the bank again. The argument about which model to trust is still happening; arguments like that will keep happening for years. The difference, if Datasaur has done its job, is that the argument now ends in a decision instead of a delay. A team picks a model. They route the right requests to it. They keep the data on their own servers. The audit log writes itself.

The loud part of AI is the part that demos well on stage. The quiet part is the part that actually ships. Datasaur is betting its existence on the latter, with a dinosaur on the logo and a customer list that ranges from Spotify to the FBI. That is not a bad bet for a company that started by helping people label sentences.

- 30 -

Share this profile