BREAKING PULSE PROCESSES OVER ONE BILLION PAGES SEED $3.9M LED BY NFDG, FEB 2025 BATCH Y COMBINATOR S24 HQ 465 CALIFORNIA STREET, SAN FRANCISCO STACK IN-HOUSE VLM, OCR, LAYOUT, READING ORDER CUSTOMERS SAMSUNG, CLOUDERA, UC BERKELEY, HOWARD HUGHES BREAKING PULSE PROCESSES OVER ONE BILLION PAGES SEED $3.9M LED BY NFDG, FEB 2025 BATCH Y COMBINATOR S24 HQ 465 CALIFORNIA STREET, SAN FRANCISCO STACK IN-HOUSE VLM, OCR, LAYOUT, READING ORDER CUSTOMERS SAMSUNG, CLOUDERA, UC BERKELEY, HOWARD HUGHES
YesPress · Company File No. 003

PULSE

The quiet API turning the world's paperwork into structured data - one billion pages at a time.

● San Francisco ● Founded 2024 ● YC S24 ● 33 people
Pulse AI brand image
PULSE AI · DOCUMENT VISION · SHOT FOR YESPRESS, 2026
The Scene

A billion pages later

Somewhere on the 4th floor of an office tower at 465 California Street, a server quietly inhales a 312-page insurance claim with handwritten margin notes, a fax cover sheet from 1998, and a spreadsheet whose author retired in 2014. Eleven seconds later, it spits out JSON. Clean keys. Typed values. Reading order intact. The tables - the impossible, three-merged-cell, footnoted tables - parsed. This is what Pulse does. It does it about a billion times.

"Production-grade unstructured document extraction." — Pulse's one-line pitch, which, refreshingly, is also what it does
The Premise

The unsexiest problem in AI, solved properly

Every large language model demo is glittering and easy. The unglamorous truth is that the demo works because someone, somewhere, parsed the PDF first. RAG pipelines are only as smart as the parser feeding them. Hallucinations are often not hallucinations at all - they are punishment for upstream sloppiness.

Pulse is the upstream. Founded in 2024 by Sid Manchkanti and Ritvik Pandey - two engineers who left Tesla, NVIDIA, D.E. Shaw, and Goldman Sachs to build OCR, of all things - the company trained its own vision-language model from scratch. The bet: that the data-ingestion layer of the AI stack is not a commodity, and that the company that owns it owns a quiet kingdom.

So far the bet is holding. The API now sits inside Fortune 10 enterprises and AI-native startups in finance, healthcare, insurance, legal, real estate, and supply chain. Samsung uses it. Cloudera uses it. Howard Hughes uses it. UC Berkeley uses it. Most of them never tweet about it.

That is the Pulse personality in a sentence: shipped, deployed, indispensable, unbothered.

The Numbers

By the count

1B+
Pages processed
$3.9M
Seed round
33
People on team
S24
YC batch

Where Pulse Shows Up

Public customer + use-case footprint, by vertical
Finance
92
Healthcare
84
Insurance
78
Legal
66
Supply Chain
59
Real Estate
44

Indexed vertical mix, YesPress estimate from public materials. Not audited.

The Goods

What you actually get

API

Pulse API

Send a PDF, Word, Excel, image, or scan. Receive structured JSON ready for an LLM, a database, or a human who finally has their afternoon back.

Model

Pulse Studio VLM

An in-house vision-language model purpose-built for documents and spreadsheets. Layout detection, OCR, reading order, table parsing, chart conversion.

Schemas

Custom JSON Schemas

Invoices, tax forms, clinical notes, financial statements, contracts. Define the shape you want; Pulse extracts to it.

Deploy

Anywhere It Has To Live

Cloud API, VPC-isolated, on-prem, Docker, Kubernetes. SOC 2 Type II, ISO 27001, GDPR, HIPAA BAA. Built for the buyer who reads every clause.

The People

Two engineers, one wager

Co-Founder · CEO

Sid Manchkanti

UC Berkeley CS. Previously at NVIDIA and D.E. Shaw. Runs the company from San Francisco and answers email at sid@runpulse.com - which is itself a small data point about Pulse.

Co-Founder · CTO

Ritvik Pandey

Georgia Tech CS and Math. ML work at Tesla, plus a stint at Goldman Sachs. Leads the vision model and inference stack.

Customers, in public

SamsungClouderaFountain Grand CharterSyntra Howard Hughes CorpUC BerkeleyProof

Investors on the cap table

NFDG (Nat Friedman · Daniel Gross) Y Combinator Sequoia Scout Soma Capital Liquid 2 Ventures Olive Tree Capital Tiferes NVIDIA · OpenAI · Ramp execs
The Trajectory

A short, dense history

2024 · Summer
Pulse is founded in San Francisco. Joins the Y Combinator S24 batch with an unusually clear thesis: own the parser.
2024 · H2
First enterprise design-partners go live. The team trains its own document VLM rather than wrap someone else's.
2025 · February
Announces $3.9M seed led by NFDG, with YC, Sequoia Scout, Soma, Liquid 2, Olive Tree, and execs from NVIDIA, OpenAI, and Ramp.
2026 · Today
Past one billion pages processed. Team at 33. SOC 2 Type II, ISO 27001, HIPAA BAA, GDPR. Deploys from cloud API down to air-gapped on-prem.
The Footnotes

Things that amuse us

Naming

The name

"Pulse" - the heartbeat of structured data inside the enterprise. A small joke that the company takes seriously.

Roster

Pedigree-to-headcount ratio

Tesla, NVIDIA, D.E. Shaw, Goldman, AWS, Berkeley, Georgia Tech - in a team of 33. Density over scale.

Choice

They trained their own VLM

A sub-35-person company building a foundation model for documents is rare. Pulse did it anyway.

Back To The Scene

The server, eleven seconds later

Back at 465 California Street, the server has moved on. The claim is JSON now. A model downstream summarizes it, an adjuster reviews it, a payment clears. The fax cover sheet from 1998 is, somewhere in a row of a database, finally machine-readable. Nobody at Pulse will tweet about this particular page. There are 999,999,999 others behind it, and a few more arriving each second. The unsexiest problem in AI, getting quietly handled - on Cloudflare DNS, NVIDIA TensorRT, Kubernetes, and a vision model that did not exist two years ago. The work, as Pulse seems to prefer, speaks for itself.

Share Pulse

LinkedIn Twitter / X Facebook Instagram Copy URL