Company Dossier · AI Data Solutions

Thoth.AI

"Data That Makes AI Work."

The quiet workforce behind the loud machines - collecting, labeling, and stress-testing the data that turns raw models into systems you can actually ship.

EST. SINGAPORE ~880 PEOPLE 200+ LANGUAGES 100+ COUNTRIES

EXHIBIT A - The mark of Thoth AI. The company borrows its name from the ancient Egyptian god of writing and measurement. The job description hasn't changed much: record the world, then measure it.

The Lead Story

Somewhere right now, a model is failing - and Thoth AI is the reason it won't.

It is 3 a.m. in three different time zones at once. A chatbot in Manila is being taught the difference between sarcasm and a complaint. A driving model in Sapporo is being shown the same pedestrian a thousand times, in rain, at dusk, behind a bus. A speech engine in Lisbon is learning to hear an accent it has never met. None of this is glamorous. None of it trends. And all of it is the actual work of making artificial intelligence behave.

This is the world Thoth AI lives in. The company does not build the headline-grabbing model. It builds the thing the model is starving for - clean, labeled, edge-case-rich data, plus the human judgment to tell the machine when it is wrong. In an industry obsessed with bigger brains, Thoth AI sells better food.

"The model is only as good as its data." Thoth AI took that cliche and turned it into a payroll of roughly 880 people.

Headquartered in Singapore with research operations planted in Silicon Valley, Thoth AI is a global AI data solutions company. Translation: when an AI team needs millions of images annotated, a dataset assembled in Yoruba, a model evaluated for the ways it quietly breaks, or a flood of user content moderated before it does damage - Thoth is who they call. The work spans images in 2D and 3D, video, 3D point cloud, text, audio for speech recognition and synthesis, machine-translation post-editing, search relevance, and OCR. If a model needs to read it, Thoth helped label it.

There is a polite fiction in artificial intelligence that the machines taught themselves. They did not. Behind every fluent answer is a mountain of human decisions about what counts as right, polite, dangerous, or true - decisions made by people in offices that rarely appear in the keynote. Thoth AI is built on that unfashionable truth. It does not pretend the data assembles itself. It hires the humans, trains them, routes the work across time zones, and runs quality control on the output as if the model's reputation depended on it - because, increasingly, it does.

The Operation

Five jobs, one obsession: data you can trust

Annotation

Data Collection & Annotation

Collect, clean, and label images, video, 3D point cloud, text, and audio (ASR/TTS) - plus MTPE, search relevance, POI, AR/VR, and OCR - across 200+ languages.

Alignment

Generative AI & RLHF

Reinforcement learning from human feedback, supervised fine-tuning, and DPO - the human nudges that align large language and multimodal models.

Evaluation

Model Evaluation

Benchmarks and evaluation frameworks for LLMs, VLMs, and multimodal systems, built around the edge cases where models quietly fall apart.

Safety

Trust & Safety

Content moderation, fraud prevention, and risk management - guarding the messy edges of AI products and platforms.

Experience

CX Management

Multilingual customer service, product testing, and global system implementation and operation support.

Reach

Applied AI & Robotics

Datasets and evaluation for real-world systems - including robotics - where a labeling error becomes a physical one.

Why It Matters

The unglamorous part is the whole game

The Mission

To help organizations build and deploy reliable AI systems by providing high-quality data, evaluation, and operational support that meet production requirements.

Quality & Reliability - the product is trust
Practical Innovation - useful beats clever
Responsible Execution - safety is a line item
Global Delivery Excellence - everywhere, on time

The Edge

Anyone can label a cat. The hard money is in the long tail - Yoruba, Quechua, Tigrinya, Hmong, and the one weird frame out of ten thousand where a model would have crashed. Thoth AI's pitch is coverage of the cases nobody else wants to scope.

Pair that with evaluation - measuring exactly where a model breaks - and you have a company that profits from being honest about AI's failures rather than hyping its wins.

Consider the economics the company writes about openly. Annotating a high-resource language is a solved problem; there are oceans of English text and an army of annotators. Annotating a low-resource language is a different animal - fewer speakers, fewer reference materials, more ambiguity per sentence, and a cost curve that bends sharply upward. Thoth AI's own field notes describe the gap in stark multiples. Most vendors avoid that work because the margins are thin and the coordination is hard. Thoth treats it as the moat. The harder the language, the fewer competitors can do it well, and the more a serious AI lab will pay to have it done right.

The same logic applies to evaluation. It is easy to demo a model that dazzles. It is hard - and unglamorous - to map the exact conditions under which it embarrasses you. Thoth AI sells that map. Its evaluation frameworks for LLMs, VLMs, and multimodal systems are designed to surface the edge cases that production traffic will eventually find anyway, just less expensively and less publicly. In a market that rewards optimism, there is something almost subversive about a company whose product is rigor.

By The Numbers

Where the work lands

An illustrative breakdown of Thoth AI's modality mix, drawn from its stated service lines. Figures are approximate and meant to show emphasis, not exact revenue share.

Vision (2D/3D)

~88

Text / NLP

~80

Audio (ASR/TTS)

~68

RLHF / Eval

~74

Trust & Safety

~60

The Players

Who runs it, who needs it

At the helm

Eric Zhang - Chief Executive Officer
HQ - Singapore
R&D - Silicon Valley
Backing - Green Union Capital

A globally distributed team across Manila, Jakarta, Hanoi, Bangkok, Madrid, Los Angeles, Lisbon, Kuala Lumpur, Sapporo, Istanbul, Seoul, Beijing, and more.

Who it serves

AI labs, model builders, and enterprises shipping LLMs, VLMs, multimodal systems, and applied AI like robotics - across e-commerce, healthcare, finance, and consumer internet.

The competitive set: Scale AI, Appen, Sama, Surge AI, iMerit, TELUS, and TaskUs. Thoth AI's wedge is multilingual depth plus evaluation, not just raw labeling volume.

From The Newsroom

Latest dispatches

2026 · JUN 18

"The 42x number that should change how you scope annotation projects" - a hard look at why low-resource language work costs what it costs.

2026 · JUN 11

"One Name, Two Jobs: The Quiet Split Happening Inside Data Labeling" - on how the labeling discipline is fracturing into two crafts.

2026 · JUN 05

"AI Text Detectors Fail in the Wild. Now We Know Why." - a field report on why detection breaks outside the lab.

2026 · MAY 27

"The Mirror Test for LLMs: Why Self-Detection Falls Apart Right When It Matters."

Watch & Listen

On the record

▶

Meet Eric, CEO of Thoth AIExclusive interview · Instagram

▶

Inside the data pipelineDemos & clips · @aitho.th

The Margins

Things you'd only know if you asked

NAMEThoth is the Egyptian god of writing, wisdom, and measurement - which is, more or less, the entire business plan.

RANGEThe work reaches languages most platforms ignore: Yoruba, Quechua, Tigrinya, Hmong.

CLOCKThe team spans more than a dozen cities and nearly every time zone - from Sapporo to Lisbon.

BRIDGESingapore HQ, Silicon Valley R&D - deliberately built between Asian operational scale and US model research.

The Closer

Back to 3 a.m.

Return to those three dark rooms. The chatbot in Manila now knows a complaint from a joke. The driving model in Sapporo has seen that pedestrian in every weather a city can throw at it. The speech engine in Lisbon finally hears the accent it kept missing. The work is still unglamorous. It still doesn't trend. But the models built on top of it are quietly more reliable than they were a shift ago - and that is the entire point.

Thoth AI's bet is simple and slightly contrarian: in a gold rush obsessed with the next bigger model, the durable business is in the data underneath. Measure the world carefully, label it honestly, and tell the machine the truth about where it fails. The god of writing would approve.

What makes the company worth watching is not a single breakthrough - it is the unflashy decision to industrialize judgment. Anyone can scale compute by writing a bigger cheque. Scaling human discernment across 200 languages, a dozen cities, and five service lines without the quality collapsing is a genuinely hard operations problem, and it is the one Thoth AI has chosen to make its own. For an AI team, that is the difference between a model that demos well and one that survives contact with real users. For Thoth, it is a business model that gets sturdier the more the rest of the industry sprints toward AGI - because the faster the models advance, the more they need someone keeping the data honest.

The loud part of AI gets the magazine cover. The quiet part - the data - gets the model to actually work.

The Rolodex

Find Thoth AI

WEBSITEaithoth.com LINKEDIN/company/aithoth X / TWITTER@AiThoth INSTAGRAM@aitho.th FACEBOOK/aiTHOTH BLOGField notes CONTACTinfo@aithoth.com CEOEric Zhang

Dossier compiled from public sources including aithoth.com, LinkedIn, and company blog posts. Figures marked approximate are illustrative. Some details (founding year, exact headcount, funding totals) are not publicly disclosed and were omitted rather than guessed.