Upstage

On a quiet Tuesday in July 2025, the analyst firm Artificial Analysis updated its public leaderboard. Buried among the usual suspects - OpenAI, Anthropic, Google, Meta - a new name appeared in the "Frontier Model" tier. The name was Solar Pro 2. The company behind it, Upstage, has a US address on North 1st Street in San Jose and a much older one in Seoul. Until that update, almost no one outside Korea had heard of it. Now its 31-billion-parameter model was sitting five points above GPT-4.1 on the Intelligence Index, and the AI industry was suddenly paying attention to a Korean startup with roughly 150 employees.

If that sounds like a Hollywood arc, the reality is more grinding. Upstage didn't arrive at the frontier by accident. It got there by picking a fight that nobody else seemed interested in - building enterprise-grade AI that big regulated companies could actually deploy - and refusing to let go for five straight years.

The plan was never to be OpenAI. The plan was to be the model your bank's compliance officer trusts at 4pm on a Friday. - the working theory inside Upstage HQ

Chapter OneThe problem they saw

Enterprise AI has a paperwork problem. Insurance carriers run on PDFs. Banks run on PDFs. Hospitals run on PDFs, prescriptions scribbled in three different scripts, and the kind of multi-page forms that make grown adults reach for coffee. Every large language model demo in the past three years has politely ignored this fact, opting instead for poetry and trivia. The companies that actually have money to spend on AI, meanwhile, have entire departments whose job is to retype scanned documents into spreadsheets.

The founders of Upstage - Sung Kim, Hwalsuk Lee, and Lucy Park - had spent years at Naver Clova, the AI lab inside Korea's largest internet company, watching this exact friction play out at scale. Sung Kim had been a computer-science professor at Hong Kong University of Science and Technology before that, the kind of academic who spends more time deploying things than publishing about them. Hwalsuk Lee led Naver's visual-AI team, which mostly meant OCR. Lucy Park led modeling on Papago, Naver's translation product. Three people whose entire professional lives had been spent watching messy documents and language collide.

They left in 2020 and started Upstage with what looked, at the time, like a niche thesis: the next decade of enterprise AI was going to be won by whoever could do the boring parts well.

Everyone wants to build the brain. Almost nobody wants to build the eyes.

Chapter TwoThe founders' bet

There is, of course, a particular brand of irony in three machine-learning researchers deciding their differentiator would be reading bad PDFs. But that's exactly what they bet on. The early Upstage product wasn't a chatbot. It was Document AI - an OCR-and-extraction engine that ingested forms, contracts, and scans and produced clean, structured JSON on the other end. Tables intact. Charts read. Korean and Japanese and English, all in the same document if that's what you needed.

For about two years they were that company. Korean insurers signed on. Then a few banks. Then, somewhat improbably, Samsung. In 2023, with the LLM gold rush in full swing, Upstage made a second bet that should have been a distraction and instead turned out to be a fulcrum: they built their own language model. They called it Solar.

The Two-Product Theory

Most AI startups pick a lane. Upstage picked two and bolted them together. Document AI extracts the structure. Solar reasons over it. The combined pitch - "give us your worst paperwork, get back decisions" - is what closes deals nobody else is closing.

Diagram described in words because the actual architecture is, mercifully, in production.

Solar's distinguishing feature, then and now, is its size. The first version landed at 10.7 billion parameters - small enough to run on a single GPU, large enough to be useful. The thesis was simple and slightly contrarian: enterprises with regulated data don't want to send their documents to OpenAI. They want models they can deploy in their own data centers, on their own hardware, behind their own firewalls. Smaller models with sharper benchmarks would win those deals.

That bet aged well. Solar Pro 2, released in July 2025, weighs in at 31 billion parameters - still less than half the size of the frontier-class behemoths it now competes with. And yet on the Artificial Analysis Intelligence Index, it posts a score of 58. GPT-4.1 sits at 53. The benchmark community took a moment to recheck the numbers. They held.

Small is not a limitation. Small is a feature for everyone who has ever read a SOC 2 report. - the unspoken pitch to every CISO

Solar Pro 2 vs. the field

Artificial Analysis Intelligence Index, July 2025

Solar Pro 2 (31B)

GPT-4.1

DeepSeek R1

Mistral Small 3.2

Qwen 3

Solar Pro 2 enters the "frontier" tier as the first Korean-developed model. Numbers are approximate and shift week to week, as benchmarks tend to do.

Chapter ThreeThe milestones

Five years, three founders, one frontier model

2020

Sung Kim, Hwalsuk Lee, and Lucy Park leave Naver Clova and incorporate Upstage in Seoul. The first product is Document AI.

2022

Series A closes around $28M. Early enterprise wins with Korean insurers and banks.

2023

First Solar LLM (10.7B) released. Tops a Hugging Face leaderboard. Samsung becomes a customer.

2024

Series B of $72M led by SK Networks and Shinhan Venture Investment. The US office in San Jose opens.

2025 (Jul)

Solar Pro 2 released. Artificial Analysis classifies it as a frontier model - a first for Korea.

2025 (Aug)

$45M Series B bridge closes. Amazon and AMD join Korea Development Bank on the cap table. Total raised crosses $157M.

2026

Solar Pro 3 announced, with deeper reasoning at production scale. US expansion accelerates.

Chapter FourThe proof

Benchmarks are pretty. Revenue is real. The case Upstage has been quietly assembling looks something like this: Samsung is using their document intelligence. A majority of the largest Korean insurers run claims through their models. Korean banks lean on them for KYC document processing. AWS lists their Document AI on its Marketplace and, more recently, on the AWS AI Agent Marketplace - the new shelf where Amazon expects enterprise buyers to actually go shopping for AI agents.

Amazon also bought equity in Upstage in 2025, as did AMD. That is unusual. Neither company has a habit of investing for the cocktail-party value. Amazon wants Solar on Bedrock-adjacent rails for enterprise customers who want a model that can read a 300-page insurance contract without losing track of clause 14.2(b). AMD wants Solar trained on AMD silicon. The Series B bridge round of $45M pushed total funding past $157M and put two of the largest names in cloud and compute on the same cap table.

When Amazon and AMD show up in the same round, something architectural is happening underneath. - a venture investor, off the record

Solar LLM

31B parameter frontier model. Hybrid Chat / Reasoning mode. Multilingual fluency in Korean, Japanese, English.

Document Parse

OCR plus structure understanding. Tables, charts, multi-page logic. Outputs clean JSON.

Information Extract

Schema-aware extraction. Pull exactly the fields you specify. Built for claims, contracts, medical forms.

Solar Console

Developer platform. Fine-tune Solar, deploy in cloud, or take it on-prem behind your firewall.

Chapter FiveThe mission

Ask the founders what Upstage is for and you get an answer that is shorter than you'd expect. They want to make AI beneficial. Not transformative, not revolutionary - beneficial. The phrasing is deliberate, and slightly unfashionable in a market that prefers manifestos. The implied argument is that most AI hype is allergic to the small print, the regulated industries, the parts of the economy where AI has to be both good and provably good. That gap is where Upstage lives.

There is a culture clue in how they ship. The company publishes openly on Hugging Face. It competes on public leaderboards. It loses some of those competitions, and it talks about them anyway. There is something refreshing about a frontier-model lab that still acts a little like a research group that hasn't quite caught on to its own importance.

AI that is good is interesting. AI that is provably good is bankable.

Chapter SixWhy it matters tomorrow

The conventional take on enterprise AI in 2026 is that the foundation-model layer will be a winner-take-most market dominated by three or four American labs. Upstage's counter-take is that this is true for chatbots and false for everything else. The places where AI will actually get deployed - regulated, on-prem, multilingual, paperwork-heavy - are the places where size, sovereignty, and document literacy matter more than benchmark sheen.

If that read is right, the company most American observers haven't heard of is going to keep showing up in the corners of the market that have the highest dollar density and the lowest media coverage. The deals will be quiet. The growth will be ugly-pretty - cohorts of insurance companies, hospital networks, manufacturers. The kind of revenue you can't tweet about.

Which brings us back to that Tuesday in July, and the analyst firm that quietly added a fifth name to its frontier list. The interesting thing about that update wasn't Solar Pro 2's score. The interesting thing was who put it there, and where they were from, and what they planned to do with it. Five years ago, three people walked out of Naver to read PDFs for a living. The PDFs are still being read. The model that reads them now happens to be one of the best in the world.

The frontier moved. It just didn't move in the direction most people were looking.

Upstage vs.
the giants.

Chapter OneThe problem they saw

Chapter TwoThe founders' bet

Solar Pro 2 vs. the field

Chapter ThreeThe milestones

Five years, three founders, one frontier model

Chapter FourThe proof

Solar LLM

Document Parse

Information Extract

Solar Console

Chapter FiveThe mission

Chapter SixWhy it matters tomorrow

Where to find Upstage

Quick facts: Upstage

Upstage vs.the giants.

Chapter OneThe problem they saw

Chapter TwoThe founders' bet

Solar Pro 2 vs. the field

Chapter ThreeThe milestones

Five years, three founders, one frontier model

Chapter FourThe proof

Solar LLM

Document Parse

Information Extract

Solar Console

Chapter FiveThe mission

Chapter SixWhy it matters tomorrow

Share Upstage

Where to find Upstage

Quick facts: Upstage

Upstage vs.
the giants.