The man who watched Google Translate launch - and decided the really hard problem hadn't even started. Co-founder and CEO of LILT AI, he's building the translation infrastructure the internet actually needs.
There's a specific kind of engineer who goes to Stanford, gets the PhD, and still doesn't feel like the problem is solved. Spence Green is that engineer. He watched Google Translate go live and felt not satisfaction but urgency - because the organizations that create most of the world's information still weren't publishing it in every language their audiences needed.
Green grew up in a world where language barriers are not abstractions. After 9/11, he began learning Arabic - not as a career move, but as an attempt to understand a world that suddenly felt less legible. That instinct - toward connection, toward comprehension - runs directly through everything he has built since.
Before the startup, he was a defense engineer at Northrop Grumman, writing software for avionics systems and a national air defense network. Then a researcher at Johns Hopkins. Then a software engineer at Google, specifically on Translate, specifically working on the English-to-Arabic pipeline. He noticed a gap between what the technology could do and what organizations actually needed - not just a translation, but a verified, quality-assured, brand-consistent localization at scale.
He enrolled at Stanford, where he worked with Chris Manning (one of the foundational figures of modern NLP) and Jeff Heer (a leading visualization and HCI researcher). The combination is unusual - most NLP researchers don't spend much time thinking about how humans interact with their systems. Green did. His dissertation focused on mixed-initiative translation: systems where the machine generates candidates and humans make decisions, each improving the other.
That idea became LILT.
In 2015, Green and fellow Google Translate alumnus John DeNero incorporated LILT in San Francisco. The premise was simple but technically demanding: combine neural machine translation with professional human translators in a tight feedback loop where each pass makes both the machine and the human faster and more accurate. LILT called its prediction engine adaptive - meaning it learns from every correction a translator makes.
"Until we can get that down to a dollar or something like that, we have not solved the problem."
- Spence Green, on translation costs and the mission still ahead
Early exposure to formal NLP research, laying groundwork for later translation work.
Built software for a national air defense system and avionics packages for naval aircraft. Defense rigor would later inform LILT's work with U.S. government clients.
Worked directly on Google Translate, developing a shallow syntactic language model to improve English-to-Arabic translation. Met future co-founder John DeNero.
Gained early exposure to venture capital and the startup ecosystem while completing his doctoral studies.
Graduated with research focused on mixed-initiative translation systems, working with advisors Chris Manning and Jeff Heer. Published on statistical machine translation, language parsing, and human-computer interaction.
Incorporated in San Francisco alongside John DeNero. Built the company as an AI-first enterprise from day one.
Closed Series B funding and won an AFWERX SBIR Phase II contract with the U.S. Air Force for language translation supporting air, space, and cyber operations.
Raised the largest round in LILT's history with participation from Sequoia Capital, Intel Capital, Redpoint Ventures, and XSeed Capital. Total raised reached $95.5M.
One-on-one conversations with business and tech leaders on the future of AI, translation, and global enterprise communication.
* Investors include: Sequoia Capital, Intel Capital, Redpoint Ventures, Four Rivers, Sorenson Capital, CLEAR Ventures, Wipro Ventures, XSeed Capital, In-Q-Tel
Most translation tools work in one direction: machine generates, human checks, content ships. LILT's architecture flips the dynamic. As a human translator works, the engine watches - updating its predictions in real time, learning correction patterns, adapting to the specific domain, tone, and terminology of that translator's workflow. The result is a human-machine team that gets faster over time rather than plateauing.
Green describes this as the core insight: machine translation systems can generate but cannot verify. An enterprise shipping product documentation in 47 languages can't afford to find out there was an error after the fact. LILT's model keeps humans in the decision loop not because the AI isn't capable, but because the stakes require it.
In 2020, LILT was awarded an AFWERX SBIR Phase II contract - a U.S. Air Force program designed to identify and scale dual-use technology. The contract tasked LILT with tracking international aerospace and cyber developments, translating foreign-language intelligence to inform Air Force decision-makers. The company later partnered with In-Q-Tel, the CIA's venture arm, to serve additional government agencies.
This is where Green's Northrop Grumman background becomes legible. He knew how to build software that governments trust - software with strict quality requirements, defined accountability structures, and consequences for failure. Defense translation isn't forgiving. A mistranslated document in an intelligence context is worse than no document.
LILT's public sector business now runs in parallel with its enterprise commercial division, serving Fortune 500 companies localizing websites, marketing content, software products, and customer support.
In 2021, LILT's research team - approximately 15 researchers split between SF and Berlin - won the Best Paper award from the North American Chapter of the Association for Computational Linguistics (NAACL). The paper described a neural automatic review and correction system designed to automate the reviewer step in translation workflows - the most expensive and time-consuming quality assurance bottleneck in the industry.
Academic translation research and commercial translation practice are historically disconnected. Green considers bridging that gap a core part of LILT's competitive advantage. The research team meets with the services team monthly - an intentional structure that prevents the company from drifting into academic irrelevance or commercial mediocrity.
Green has been blunt about the target: translation needs to cost roughly one dollar per piece before the problem is genuinely solved. Today's enterprise translation costs are prohibitive - which means the world's information is still siloed by language, still inaccessible to most of the people who need it.
This isn't altruism dressed up as a business plan. It's a market thesis. If translation costs drop far enough, the total addressable market expands from large enterprises with localization budgets to every organization that creates content. The potential is enormous. The technology path is difficult.
Green's view on AI hype is consistent with this pragmatism. He critiques claims about sentient machines and fully autonomous vehicles as distractions from the actually hard problems - like making it cheap and reliable for a hospital in Indonesia to publish accurate patient instructions in Bahasa. The stakes are high. The solutions need to be real.
"The Last Mile" - Green's podcast hosted for LILT - takes its name from the hardest part of translation: not the machine pass, but the final human judgment that makes content actually work in context. The show explores AI adoption across industries, featuring conversations with executives and technologists navigating the shift.
It's also a signal about where Green places LILT in the market: not as a translation vendor, but as a voice in the broader conversation about how enterprises implement AI in workflows that matter. The company spent years building before the GenAI wave arrived. Green's bet - that human oversight plus machine efficiency, not automation alone, is the right architecture for high-stakes language work - now looks prescient.
The world belongs to the discontented.
Machine translation has a higher impact on humanity than many other speculative AI applications.
Just keep pushing.
The system is getting eight out of ten words right that are being predicted.
Our goal is to build a solution that combines the best of human ingenuity with machine efficiency.
Language is not just a tool but a bridge to cultural understanding.
Until we can get that down to a dollar or something like that, we have not solved the problem.
LILT's research team won Best Paper from the North American Chapter of the Association for Computational Linguistics for a neural auto-review and correction system.
AFWERX SBIR Phase II award to provide language translation for air, space, and cyber operations - a rare government validation for an AI startup.
LILT included in the 2022 Gartner Market Guide for AI-Enabled Translation Services, a key enterprise analyst endorsement.
Led LILT through Seed, Series A, B, and C rounds with backing from Sequoia Capital, Intel Capital, Redpoint Ventures, and Four Rivers.
Partnered with the CIA's investment arm to serve U.S. intelligence community clients - a signal of LILT's trust and security credentials.
Built direct integrations into enterprise content management systems, code repositories, and workflow tools - eliminating the traditional translation management layer.
Green began learning Arabic after 9/11 - not as a career strategy, but to understand a world that suddenly felt harder to read. He became fluent. That instinct toward comprehension became LILT's mission.
Before AI startups, he wrote avionics software and a national air defense system at Northrop Grumman. That background explains why the U.S. military actually trusts him with translation intelligence.
He graduated from the University of Virginia with "highest distinction" - the equivalent of summa cum laude. Then went to Stanford and did it again.
His Stanford advisors were Chris Manning (world-leading NLP researcher) and Jeff Heer (HCI pioneer). That unusual combination - language and interface - is basically LILT's product philosophy in two names.
LILT's 15-person research team split across San Francisco and Berlin has published award-winning papers while running a commercial translation platform. Most research teams do one or the other.
Green named his podcast "The Last Mile" after the hardest part of translation: not the machine's first pass, but the final human judgment that makes content actually land. It's also a metaphor for every problem he finds interesting.
He credits Peter Drucker and Andy Grove with teaching him how to be a manager. That's not a common answer from a Stanford PhD. It suggests he actually read the books.
LILT was incorporated in March 2015 and described itself as "an AI company at its core" from day one - about six years before "AI company" became the default startup description.
Green hosts "The Last Mile," a podcast featuring business and technology leaders navigating the transition to AI-powered operations. Episodes explore everything from AI in marketing and design to the future of global enterprise communication. Recent guests have included investors like Tomasz Tunguz and AI product leaders from companies like Canva.