Tagged Content
Everything on the platform tagged with speech-to-text.
Rev is an American speech-to-text company that pairs the world's most accurate AI speech recognition with a global network of human transcriptionists to deliver transcription, captions, and subtitles at up to 99% accuracy. Founded in 2010 by six MIT-connected entrepreneurs, Rev serves over 100,000 customers and more than a million users across legal, media, education, and enterprise, and has increasingly focused its AI on the legal market with tools for depositions, evidence, and case prep.
Nagish is a New York-based assistive-technology company that uses proprietary AI to caption phone calls in real time, converting speech to text and text to speech so people who are deaf or hard of hearing can make and receive calls independently and privately - without a human relay operator. Its name means 'accessible' in Hebrew. The company is one of the few firms certified by the FCC to provide telecommunication relay services and offers its consumer app for free.

Tomer Aharoni is the co-founder and CEO of Nagish, a New York startup using AI to caption phone calls in real time so Deaf and hard-of-hearing people can place and receive calls by typing and reading, with no human operator in the loop. The idea began with a phone ringing during a class at Columbia and a question he couldn't shake: how do you take a call if you can't hear or speak? Nagish (Hebrew for 'accessible') is now FCC-certified, offered free to users through federal subsidies, and has raised $16 million. Aharoni builds the product hand-in-hand with the Deaf community and is now pushing into AI sign-language translation.
Deepgram builds foundational voice AI - speech-to-text, text-to-speech, and full voice-agent APIs - used by more than 1,300 enterprises including NASA, Spotify, Twilio and Citibank to give machines the ability to listen, understand, and respond in real time.
Krisp is a voice AI company that strips background noise, voices, and echo from live calls using deep learning, then layers transcription, meeting notes, accent conversion, and real-time translation on top. Founded in 2017 by ex-Twilio engineers, it now processes 75+ billion minutes of audio a month for contact centers, BPOs, Discord, and millions of remote workers.
Vapi is a San Francisco developer platform for building, testing, and deploying conversational voice AI agents over phone and web. It abstracts the messy plumbing of speech-to-text, LLMs, text-to-speech, and telephony so developers can ship human-sounding voice agents in minutes, with sub-500ms latency and enterprise-grade compliance.
Scott Stephenson is the co-founder and CEO of Deepgram, the voice AI company building foundational speech-to-text, text-to-speech, and voice agent models from scratch. A particle physicist who once helped build a dark-matter detector two miles underground, he now runs an AI company used by NASA, Spotify, and Twilio.
Otter.ai builds AI meeting assistants that join your Zoom, Teams, and Google Meet calls, transcribe them in real time, summarize the noise, and pull out action items - so people stop scribbling and start paying attention.

Tanay Kothari is the co-founder and CEO of Wispr Flow, a San Francisco-based AI company building the voice interface for the AI era. A four-time founder who taught himself to code at age nine in New Delhi, Kothari holds BS and MS degrees from Stanford in Computer Science and AI, taught Deep Learning alongside Andrew Ng, and published medical AI research. After selling his first startup FeatherX to Cerebra Technologies straight out of college, he co-founded Wispr in 2021 with Stanford batchmate Sahaj Garg. The company's flagship product, Wispr Flow, transforms spoken ramblings into polished writing across 100+ languages with sub-second latency, achieving 50% month-over-month growth and a 20% paid conversion rate - five times the industry standard. Wispr has raised $81 million total, including a $30 million Series A led by Menlo Ventures and a $25 million extension led by Notable Capital, at a $700 million valuation. Named to Forbes 30 Under 30 in 2023, Kothari is betting that keyboards will be vintage store items within five years.

Willow is a San Francisco-based AI voice dictation startup that replaces the keyboard with voice input across any app. Built by two Stanford dropouts, the product delivers sub-500ms latency, 40%+ higher accuracy than built-in dictation tools, and context-aware transcription that handles technical jargon and proper nouns. Backed by Y Combinator and a $4.5M seed round, Willow targets knowledge workers - engineers, managers, sales teams - helping them type 4x faster by speaking naturally. Enterprise customers include Uber, Gusto, Canva, and GitHub.