Snorkel AI raises $100M Series D at $1.3B valuation Alex Ratner: "Better data builds better models" Harvard physics to Stanford AI to unicorn CEO Snorkel AI hits $148M ARR in 2025 Expert Data-as-a-Service launches for frontier LLM developers What started as an "afternoon project" became a billion-dollar company Affiliate Professor at University of Washington Paul G. Allen School Snorkel technology deployed at Google, Apple, and Intel Snorkel AI raises $100M Series D at $1.3B valuation Alex Ratner: "Better data builds better models" Harvard physics to Stanford AI to unicorn CEO Snorkel AI hits $148M ARR in 2025 Expert Data-as-a-Service launches for frontier LLM developers What started as an "afternoon project" became a billion-dollar company Affiliate Professor at University of Washington Paul G. Allen School Snorkel technology deployed at Google, Apple, and Intel
Latest Snorkel AI closes $100M Series D - $1.3 billion valuation - May 2025
Co-Founder & CEO / Snorkel AI

Alexander Ratner

The physicist who taught machines to learn without asking for permission.

He started Snorkel as an afternoon project in a Stanford lab. A decade later it's a unicorn - and the infrastructure layer quietly powering how the world's largest enterprises train AI.

Data-Centric AI Weak Supervision Enterprise AI Stanford PhD Harvard Physics
Alexander Ratner, Co-Founder and CEO of Snorkel AI
Share: Twitter/X LinkedIn Facebook
$1.3B
Valuation (2025)
$148M
Annual Revenue
$338M
Total Funding
1,100+
Employees

The Man Who Made Data the New Moat

There is a sentence Alex Ratner has heard more times than he can count: "We have the model. We just don't have the data." He built a billion-dollar company around the fact that everyone keeps saying it and almost nobody knows what to do next.

Ratner's advisor at Stanford, Christopher Re, described it to him in 2015 as an "afternoon project." Build a tool, Re suggested, that lets researchers label training data without hand-annotating every single example. The two of them stood at a whiteboard. The math got complicated fast. The afternoon project took four and a half years - and became Snorkel, the open-source library that quietly rewired how the AI field thinks about data.

The key insight was almost perversely simple: instead of asking humans to label thousands of examples one by one, what if you could write rules - heuristics, knowledge bases, distant supervision - and let those rules do the labeling at scale? The labeled data would be noisy. Ratner's framework would clean it up statistically. The resulting model would still be good. Often, it would be very good.

The vast majority of data has no labels - or, at least, no useful labels for your application.

- Alex Ratner

To test the idea, two researchers used Snorkel to label 20,000 documents in a single day. The same task, done by hand, would have taken more than ten weeks. That was the number that mattered. Not the theory. Not the whiteboard math. The ten weeks turned into one day.

Before Stanford, Ratner studied physics at Harvard - the kind of education that teaches you to strip a problem down to its governing equations. He graduated in 2011, went into consulting, and found himself writing scripts to dig through patent databases. "I was fascinated," he has said, "by all this human knowledge locked inside unstructured text." The fascination outlasted the consulting job. He went back to graduate school and found Christopher Re's lab, where that fascination found its application.

The Snorkel paper, published at VLDB in 2018, earned a "Best Of" designation. Google deployed it internally under the name Snorkel DryBell. Apple, Intel, and U.S. government agencies followed. The research was working. The question was whether it could become a company.

The Dad Jokes Clause

When Ratner's second child was born, his team at Snorkel AI formalized something that had been happening informally. They granted him two dad jokes per day. Not one. Not unlimited. Two - a negotiated ceiling that reflects something real about how the company runs: with warmth, and with precision about things that matter.

Snorkel AI was incorporated in March 2019, the same year Ratner completed his PhD. He took the role of CEO - an unusual choice for an academic who had spent five years with his head in statistical learning theory. He describes his evolution into the role with characteristic directness: "You can have extremely kind empathetic, friendly people who are also very hard-charging and type A." He was building toward a culture where both are true at once.

The timing was not obvious. In 2019, the dominant framing of AI progress was all about model architectures. Transformers were ascendant. BERT had just landed. The field was in a kind of model-centric rapture. Ratner was making a bet in the opposite direction: that the real bottleneck was not the model. It was the data. And that enterprises, in particular, would run headlong into that bottleneck as they tried to deploy AI on their own proprietary domains.

The buck has shifted from model development to data labeling and development.

- Alex Ratner

He was right, and the market moved toward him. In 2021, Snorkel AI raised $85 million at a $1 billion valuation - unicorn status, three years in. The round was co-led by Addition and BlackRock. The company had built a platform that enterprises actually used to deploy AI at scale, not just experiment with it.

In May 2025, Snorkel AI closed a $100 million Series D at a $1.3 billion valuation, with investors including Prosperity 7 Ventures, Greylock, Lightspeed, BNY, and QBE Ventures. Alongside the round, Ratner launched two new products: Snorkel Evaluate, for measuring AI model performance, and Snorkel Expert Data-as-a-Service, which pairs domain experts with Snorkel's programmatic platform to produce high-quality datasets for frontier LLM developers. The company was reporting $148 million in annual revenue.

The context matters: agentic AI had become the obsession of 2025. Every enterprise wanted AI agents that could take actions, not just answer questions. Ratner's read was precise: "We are seeing a surge of momentum around agentic AI, but specialized enterprise agents aren't ready for production in most settings." The gap between demo and deployment was, again, the data. Snorkel was positioned exactly at that gap.

Alongside running the company, Ratner holds an appointment as Affiliate Assistant Professor at the University of Washington's Paul G. Allen School of Computer Science and Engineering. He corrects people who overstate the title - "I'm not the professor yet" - with the kind of precision you'd expect from someone who spent five years in a research lab where words mean exactly what they say.

He got into programming as a child, drawn to it for two reasons he still articulates: the instant feedback loop, and the fact that you can build things without asking anyone for permission. The second part has aged into something like a philosophy. Snorkel itself is an infrastructure play built on the premise that enterprises shouldn't have to wait for months of manual labeling every time they want to build a new AI application. The permission slip, in other words, is the data bottleneck. Ratner built a company to eliminate it.

From Physics to Machines

The Lawrenceville School
Secondary Education
2003 - 2007
Harvard University
A.B. in Honors Physics
2007 - 2011
Stanford University
PhD in Computer Science
Advisor: Christopher Re
2014 - 2019

Snorkel AI - Funding Trajectory

2019
Seed
Founded
2020
Series A/B
~$35M
2021
Series C - Unicorn
$85M @ $1B
2025
Series D
$100M @ $1.3B

Total funding: $338M | Lead investors include Addition, Greylock, Lightspeed, BlackRock

A Decade of Making Data Work

2007
Enrolled at Harvard University to study Honors Physics - the training ground for seeing systems clearly.
2011
Graduated Harvard, moved into consulting - writing code to parse patent databases and realizing how much human knowledge was trapped in unstructured text.
2014
Returned to academia; enrolled in Stanford's Computer Science PhD program under Christopher Re.
2015
Started the Snorkel open-source project at Stanford AI Lab - an "afternoon project" that would take four and a half years.
2016
Published "Data Programming" at NeurIPS 2016, the foundational paper establishing weak supervision as a rigorous ML methodology.
2018
Snorkel paper wins VLDB 2018 "Best Of." Google deploys the technology internally as Snorkel DryBell.
2019
Completed Stanford PhD. Co-founded Snorkel AI with Christopher Re and colleagues. Named Affiliate Assistant Professor at University of Washington.
2021
Snorkel AI raises $85M Series C at $1B valuation from Addition and BlackRock - achieving unicorn status two years after founding.
2025
$100M Series D at $1.3B valuation. Launched Snorkel Evaluate and Expert Data-as-a-Service. Company reports $148M ARR with 1,100+ employees.

Quotes That Define the Thinking

It all started with a massive con from my advisor and co-founder Chris - he suggested this as an 'afternoon project.'

You get to build a lot without having to ask anyone for permission!

The buck has shifted from model development to data labeling and development.

We are seeing a surge of momentum around agentic AI, but specialized enterprise agents aren't ready for production in most settings.

You can have extremely kind empathetic, friendly people who are also very hard-charging and type A.

I see Snorkel becoming a trusted partner for all large enterprises that are serious about AI.

Five Things That Tell the Story

01

He studied Honors Physics at Harvard before pivoting to computer science. The physics training - stripping problems to governing equations - still shapes how he frames AI challenges.

02

Two researchers labeled 20,000 documents in one day using Snorkel. The same task by hand: over 10 weeks. That single data point drove the company's early enterprise pitch.

03

His PhD advisor Christopher Re is also a co-founder of Snorkel AI. The company grew directly out of a research collaboration, not a garage startup.

04

The team formally granted Ratner two dad jokes per day upon the birth of his second child. The ceiling is negotiated and respected.

05

He holds the Stanford Bio-X Morgridge Family SIGF Fellowship - an interdisciplinary award bridging biology, medicine, and computing. His work on biomedical NLP earned it.

06

Before AI, he was in consulting, writing scripts to analyze patent databases. That early encounter with unstructured text containing locked human knowledge became the seed of his research vision.

Alex Ratner on Video

Follow Alex Ratner

- - - YesPress Profile - Alexander Ratner - Snorkel AI - Data-Centric AI Pioneer - - -