A collaborative, AI-powered video editor for everyone who never wanted to learn the timeline keyboard shortcuts.
Somewhere in a coworking space in Austin, a social media manager has 47 minutes before her standup. She drags a raw iPhone clip into a Chrome tab. Kapwing transcribes it, drops in burned-in captions, removes a long pause where she said "so basically" three times, resizes the whole thing for a 9:16 Reel, and spits it out with the brand kit colors already locked in. She shares the link with her boss. Her boss leaves a comment at 00:42. She replies. They publish.
This is what video editing looks like in 2026 - and most of it happens without anyone downloading anything. Kapwing is the quiet protagonist of that scene. It is not the loudest name in the AI video category, and it has never tried to be. But it is one of the few that started as a browser-first tool, grew on raw word of mouth, and arrived at the AI wave with the muscle memory of a product that has been editing video in the cloud since before that was a normal thing to do.
The Kapwing pitch is small enough to fit in a tweet and stubborn enough to have driven a company for nine years: video editing should feel like a Google Doc.
Julia Enthoven and Eric Lu met at Google, where both were associate product managers shipping things you have probably used and never thought about - Google Identity, Image Search. They left in 2017 to build something simpler. The first version of Kapwing did exactly one thing: put text on top of a GIF. It was a meme maker, and the business model was three dollars to remove a watermark.
The meme maker worked. People searched for it. They paid. Enthoven and Lu kept adding adjacent micro-tools - resize, subtitle, trim, convert. Each one ranked on Google. Each one fed users back into the same editor. By the time CRV's Saar Gur led an $11M Series A in September 2019, traffic had grown 10x in a year and Kapwing was already the editor that creators texted to each other when someone asked, "wait, how did you do that?"
A multimedia studio masquerading as a website. None of it requires an install, a render farm, or a manual.
Multi-track timeline, real-time collaboration, templates - the core canvas where everything else happens.
Detects silences and filler words. One click deletes them. Your half-hour rambling Loom becomes a tight nine.
Re-shapes a speaker's mouth to fit translated or new audio. Dubbing without the uncanny valley.
Builds an emotion-aware digital replica of your voice for narration, dubbing, or that podcast intro you keep redoing.
Auto-generates accurate subtitles and burns them in. Style them once, batch the rest.
Translates and dubs into 100+ languages, lip sync included. The dubbing industry has noticed.
Cuts subjects out of video and stills. No green screen, no chroma key, no patience required.
Turns prompts, scripts, or articles into short videos with stock, voice, and captions wired in.
Approximate, drawn from public reviews and case-study patterns.
Enthoven studied computer science at Stanford and joined Google's APM program. Lu was on the same cohort track. They worked together on Identity and Image Search, then quit to build a product they wished they had as PMs - one where shipping a video for a launch did not require a coworker to install something and learn keyframes. Enthoven runs the company; Lu runs engineering.
The hiring story has not been linear. Enthoven has spoken openly about how hard it was to recruit during the early years, before the brand had its current weight. The team has stayed deliberately small - around 37 people last public count - and weighted heavily toward product, design, and ML engineering.
Stanford CS. Ex-Google APM. Forbes 30 Under 30 (Consumer Tech, 2020). Writes most of Kapwing's public-facing essays.
Engineer behind the browser-first architecture that Kapwing has bet its entire roadmap on.
Kapwing has raised about $14.4M across a seed and a single Series A. That is conspicuously little for a category where competitors have raised tens or hundreds of millions. The bet has been efficiency: a small team, a high-margin SaaS, and a free tier that does most of the marketing.
The Series A in September 2019 was led by CRV. Village Global, Shasta Ventures, Sinai Ventures, Jane VC, Harry Stebbings, Vector, and the Xoogler Syndicate joined. Six years later that same group is still on the cap table, and Kapwing has not announced a follow-on - a decision that, depending on how you read it, is either restraint or runway.
It is 9:12am in Austin. Standup is in three minutes. The Reel is published, the captions are clean, the boss has signed off in a comment thread that lives next to the video itself. There is no AfterEffects window open in the background. There is no .mov sitting on a desktop waiting to be re-rendered.
Nine years ago, that whole sequence would have required an installer, a tutorial, a tripod of expensive software, and at least one frustrated Slack message about a missing codec. Kapwing did not invent video. It changed the gesture. Video editing used to be a posture - hunched over a desktop, late, alone. Now, increasingly, it is a tab among tabs, shared like a doc, opened like a doc, closed like a doc.
That is the trick. And Kapwing is still the one quietly performing it.