BREAKING
$81M raised • $700M valuation • November 2025 270 Fortune 500 companies use Wispr Flow 10% word error rate vs. 27% for OpenAI Whisper 220 words per minute — 4x faster than typing $10M+ ARR milestone reached October 2025 100+ languages including Hinglish Discussing $2B valuation • May 2026 80% six-month user retention 125 new enterprise customers per week
$81M raised • $700M valuation • November 2025 270 Fortune 500 companies use Wispr Flow 10% word error rate vs. 27% for OpenAI Whisper 220 words per minute — 4x faster than typing $10M+ ARR milestone reached October 2025 100+ languages including Hinglish Discussing $2B valuation • May 2026 80% six-month user retention 125 new enterprise customers per week
Company Profile AI & Productivity SaaS Series A — $81M Raised
Wispr Flow logo
San Francisco, CA • Founded 2021

Wispr
Flow

The AI that listens while you think, then types while you breathe. Voice dictation rebuilt from the neural layer up - for the 150-year-old keyboard's nervous successor.

220 Words / Minute
10% Error Rate
100+ Languages
270 Fortune 500 Clients
$700M Valuation (Nov '25)
SHARE LinkedIn Twitter / X Facebook

The Cursor Blinks. The Hands Don't Move.

Picture a lawyer in Chicago. It's 8 p.m., she has 400 words left to draft a contract, and her wrists ache from ten hours at a keyboard. She closes her laptop screen, opens Wispr Flow on her phone, and starts talking. Three minutes later, the contract is done - grammar corrected, clauses formatted, tone adjusted for the formality of the document. She never typed a word.

This is the scenario Wispr Flow was built around: the moment where speaking is simply faster, smarter, and less painful than typing. The company calls it "voice-first." The keyboard industry calls it a problem.


A 150-Year-Old Interface Nobody Questioned

The QWERTY keyboard was designed in 1874. It was optimized for mechanical typewriters - specifically, to prevent adjacent keys from jamming when typed in sequence. The mechanical problem has been gone for decades. The keyboard stayed anyway.

The average person types 40-60 words per minute. Speaking happens at 130-150 words per minute naturally, and comfortably faster for practiced users. For most of computing history, voice-to-text technology was too inaccurate, too slow, or too rigid to replace the keyboard. Dragon Dictate required training sessions. Siri required patience. Google Voice required low expectations.

The real bottleneck, it turns out, wasn't microphones or bandwidth. It was the AI. The models weren't good enough - and then, suddenly, they were.

"The keyboard is not inevitable. It's a historical accident that became a default. We exist to fix that."
- Tanay Kothari, CEO & Co-Founder, Wispr Flow

Two Stanford Roommates, One Very Large Bet

Tanay Kothari and Sahaj Garg met at Stanford. They shared a dorm, studied AI, and both ended up publishing research at Stanford's AI Lab under Andrew Ng - the researcher who helped put machine learning on the map. Kothari went on to found FeatherX, a personalization platform that was acquired. Garg published with Google Research and Columbia Psychiatry, building expertise in computational neuroscience. In 2021, they founded Wispr AI, Inc. together.

The bet they made was specific: general-purpose speech recognition models like OpenAI's Whisper were good but not great. They hallucinated words, struggled with accents and code-switching, and didn't adapt to the user over time. Kothari and Garg believed the gap between "tolerable" and "addictive" was worth building a company around.

Tanay Kothari
CEO & Co-Founder

Stanford CS + AI master's. Taught Stanford's Deep Learning course alongside Andrew Ng. Previously founded FeatherX (acquired by Cerebra Technologies). Personally onboarded over 500 early Wispr users via Google Meet calls - probably the only Series A CEO who schedules onboarding sessions himself.

Sahaj Garg
CTO & Co-Founder

Stanford engineering - Henry Ford II Scholar Award, the school's highest academic honor. Research in generative modeling, computational neuroscience, and photonic hardware. Published with Google Research and Columbia Psychiatry. The kind of CTO who actually understands what the model is doing.


Not Transcription. Something Smarter.

Wispr Flow is not a transcription app in the traditional sense. Transcription just converts speech to text. Wispr Flow converts speech to appropriate text. The system is context-aware: it knows you're in Slack and keeps things casual. It knows you're in Google Docs drafting a report and formats accordingly. It knows you're in VS Code and doesn't autocorrect variable names.

The AI auto-edits feature cleans up the filler words, false starts, and rambling that comes with natural speech - without removing your meaning. A personal dictionary learns your clients' names, your technical jargon, your niche acronyms. Snippet shortcuts let you speak a phrase and expand it to a paragraph you'd typed a hundred times before.

On all four major platforms - Mac, Windows, iOS, Android - it works inside any app. No copy-paste. No switching tabs. The voice goes in, the text comes out, right where your cursor was.

🎙
AI Auto-Edits

Cleans up rambling speech into clear, formatted prose without changing what you meant to say.

📖
Personal Dictionary

Learns your terminology, proper nouns, and industry jargon. Gets better the more you use it.

Voice Shortcuts

Speak a trigger phrase, expand to a full paragraph. Templates meet voice dictation.

🌐
100+ Languages

Including Hinglish - real-world Hindi-English code-switching, not lab-condition bilingualism.

🔒
Enterprise Security

SOC 2 Type II certified. HIPAA-compliant. Private cloud. Multi-region deployment. Used in healthcare, legal, and finance.

🖥
Cross-Platform

Mac (Oct '24), Windows (Mar '25), iOS (Jun '25), Android. One subscription, everywhere you work.

Word Error Rate Comparison — Voice Dictation Platforms
Wispr Flow
10%
OpenAI Whisper
27%
Apple Native
47%

Lower is better. Data from Wispr Flow internal benchmarks, 2025.

The gap between "barely usable" and "I'll never type again" turns out to be about 37 percentage points.

From Dorm Room to Fortune 500 Layer

Three and a half years. $81M raised. The keyboard's lunch, slowly being eaten.

The Numbers That Don't Require Spin

The clearest sign that Wispr Flow is doing something right is the retention data. After three months of using the app, people dictate over half of everything they write. Not some of it. Not "I use it when I remember." More than half. The keyboard becomes the backup.

At six months, 80% of users are still active. That's a number that takes product people years to earn. Reid Hoffman - the LinkedIn co-founder who has spent thirty years thinking about how professionals communicate online - publicly switched from typing to talking. Clay's go-to-market team reported 20% more customer calls per day after adoption. One CEO dictated 70% of a board document using the app.

The enterprise traction tells a parallel story. Wispr is adding 125 new enterprise customers per week, with 270 of the Fortune 500 already on board. The conversion rate from free to paid sits at 19% - unusually high for a productivity tool where the competition includes "just use what you already have."

80% 6-month retention
50%+ Characters via voice after 3 months
19% Paid conversion rate
125/wk New enterprise clients
"I was able to dictate ~70% of our Q2 board doc with Flow. It was a massive time saver."
- CEO of a Wispr Flow enterprise customer

The Investors Who Voted First

Wispr Flow has raised $81M across three rounds as of late 2025, with reports of a further $260M round in discussion at a $2B valuation. The investor list reads like a who's who of people who take seriously the idea that voice will eventually win: Menlo Ventures led the Series A, with NEA, 8VC, and notable individual backers including Evan Sharp (Pinterest co-founder), Henry Ward (Carta CEO), and Steven Bartlett's Flight Fund joining the cap table.

Round Amount Date Lead Investor
Seed / Early ~$12M 2021 - 2024 AIX Ventures, NEA, 8VC, Neo
Series A $30M June 2025 Menlo Ventures
Series A Extension $25M November 2025 Notable Capital
Series B (In Discussion) ~$260M 2026 Menlo Ventures (reported)

A Voice Interface for a Billion People

The current product - a dictation app - is, in Wispr's own framing, a stepping stone. The roadmap includes voice-to-action: not just converting speech to text, but using it to trigger workflows, schedule meetings, update CRMs, and control software. The terminology the company has started using is "voice operating system."

That framing is ambitious, probably intentionally so. But the data suggests it's not entirely science fiction. When 25-30% of your user base voluntarily opts in to share voice data to help train your models, you have something rarer than users: you have participants. The personalization flywheel - your dictionary grows, your shortcuts multiply, the model learns how you phrase things - makes each account progressively harder to leave.

The India expansion adds another dimension. Hinglish - the spontaneous code-switching between Hindi and English that hundreds of millions of people do naturally - is a notoriously hard problem for speech AI. Wispr Flow is building a model for it anyway, betting that whoever solves the multilingual code-switching problem first will own the voice interface for a large share of the planet.

"Whoever solves voice for a billion people will define the next era of human-computer interaction. We think that company could be us."
- Wispr Flow, Series A Extension announcement, November 2025

The Keyboard Was Never the Point

The keyboard was always a means to an end: getting human thought into a machine. For a very long time, it was the best available means. That is changing. The interface question is becoming live again for the first time since the mouse in the 1980s and the touchscreen in 2007.

Wispr Flow is not trying to make the keyboard slightly better. It's trying to replace the premise that the keyboard is the primary input device. The company is building this from the neural layer up - custom speech models, personalized over time, with context-awareness that no general-purpose transcription tool can replicate.

For knowledge workers - the lawyers, the sales reps, the developers, the executives - the compounding value of removing the keyboard bottleneck is real. If you can think 4x faster than you can type, and an AI can keep up with your thinking, the output of a working day changes. Not a little. Fundamentally.

The lawyer in Chicago closes her Wispr Flow session. The contract is done. She did not type it. She thought it, spoke it, and the machine turned it into the document her client expects by morning.

Three years ago, that sentence would have been fiction. Today, it's a Tuesday night in San Francisco, two Stanford roommates are running a company worth $700 million, and 270 of the Fortune 500's legal, sales, and engineering teams are quietly making the same transition she just did.

The keyboard is still there. It just feels slightly more like furniture than tool.