Breaking

Distributional closes $19M Series A led by Two Sigma Ventures Industry's first dedicated enterprise AI testing platform debuts a16z and Operator Collective double down Behavioral drift is now a measurable engineering problem Founder Scott Clark: SigOpt → Intel → Distributional Five PhDs on a 25-person team Distributional closes $19M Series A led by Two Sigma Ventures Industry's first dedicated enterprise AI testing platform debuts a16z and Operator Collective double down Behavioral drift is now a measurable engineering problem Founder Scott Clark: SigOpt → Intel → Distributional Five PhDs on a 25-person team

Volume I · Issue 001 · The AI Reliability Beat

Distributional.

The enterprise AI testing platform turning the slippery, non-deterministic behavior of large language models into something an engineering team can actually measure, alert on, and fix.

San Francisco · Palo Alto Founded 2023 Series A · $19M Enterprise AI

The team. Roughly twenty-five people, five PhDs, alumni of Intel, SigOpt, Yelp, Google, Meta, Bloomberg. Photographed, probably, on a day someone's agent stopped misbehaving.

A scene, mid-2026

Somewhere, an agent is misbehaving.

Not crashing. Not throwing an exception. Just - drifting. Yesterday it summarized contracts in three crisp paragraphs. Today it adds a fourth, and the fourth one is slightly wrong. Nobody on the platform team will notice for nine days. Distributional notices in nine minutes.

The company sells what nobody quite had a name for two years ago: a test suite for software that does not return the same answer twice. Their thesis is unglamorous and overdue. Generative AI broke the unit test. The deterministic assertion - assertEqual(output, "expected") - was an artifact of a deterministic world. Probabilistic software needs probabilistic tests. Distributional builds them.

Origin

The Intel detour that became the company.

Scott Clark co-founded SigOpt in 2014 - a Bayesian optimization platform for tuning models when nobody used the word "model" in polite company. Intel bought it in 2020. Clark stayed, ran a 200-person AI software organization, watched the generative wave arrive, and noticed a pattern: enterprises wanted to ship LLMs, but they had no idea how to prove the things were behaving.

So in September 2023, he started Distributional with six co-founders. By February 2024, Andreessen Horowitz and Operator Collective had written an $11M seed check. By October, Two Sigma Ventures led a $19M Series A. Total raised in under a year: $30M. Total time spent explaining what AI testing means: still ongoing.

The Founder

Scott Clark

Co-founder & CEO. Cornell math, PhD in applied mathematics from Cornell. Software lead in Yelp's ad-targeting era. Co-founder of SigOpt. Former VP/GM at Intel. Now: making AI behave.

The Cap Table

Who's on board

Two Sigma Ventures (lead, Series A). Andreessen Horowitz. Operator Collective. Oregon Venture Fund. Essence VC. Alumni Ventures. Plus the angels you'd expect when an Intel alum starts shipping.

What it does

Three parts, one bet.

The product is a platform - self-hosted in your VPC, multi-tenant SaaS, or single-tenant. A Python SDK plugs it into your CI, your orchestrator, your data store, your alerting. What it provides, in plain English, is three things working together.

01 · Framework

Extensible tests

Collect production traces. Augment the data. Write tests on distributions, not single outputs. Alert when the distribution shifts. Triage. Resolve.

02 · Dashboards

Test repositories

Collaborate on tests, analyze results, calibrate thresholds, capture audit trails, and produce the governance reports your compliance team has been quietly asking about.

03 · Automation

Adaptive calibration

A preference-learning loop tunes data augmentation, test selection, and calibration. The test suite gets smarter as the model changes - which it will.

Why this exists

Behavioral drift is the bug you can't see.

Drift used to mean a feature distribution shifting in training data - a problem MLOps tools had reasonable tooling for. Generative AI inflicted a more uncomfortable variant: the model still works, the prompts still land, and the outputs are still grammatical - but they have started, somehow, to be worse. A little more hedged. A little less accurate. A little more inclined to invent regulatory citations.

You cannot detect this with logs. You cannot detect it with unit tests. You can detect it by treating model behavior as a distribution and watching how that distribution moves over time. That is, essentially, the company's whole pitch, and so far the Fortune 500 has been picking up the phone.

Customer Type

Fortune 500 platform teams

Companies with operational or reputational risk attached to generative AI applications - which, by 2026, is most of them.

Deployment

Built for compliance teams

VPC self-host, single-tenant VPC, or multi-tenant SaaS. Pick the option that lets your security review go home on time.

Timeline

A short company, in moments.

Sep 2023

Founded

Scott Clark and six co-founders incorporate Distributional after Clark leaves Intel.

Feb 2024

$11M Seed

Andreessen Horowitz and Operator Collective back the bet on a new category.

Oct 2024

$19M Series A

Two Sigma Ventures leads. Platform launches publicly. The category has a name.

2025

Team scales

Headcount grows toward 25; research and platform engineering deepen in parallel.

2026

The current scene

Somewhere, an agent is misbehaving. Distributional is watching the distribution.

For practitioners

What you can actually do with it.

Ship an LLM app and sleep

Wire Distributional into your CI. Define behavioral tests. Get paged when production drifts from the baseline you signed off on.

Debug an agent that started lying

Use DBNL's adaptive analytics on production logs to surface where the agent's behavior changed - and which tool, prompt, or model version is responsible.

Test a new model version safely

Run side-by-side behavioral tests when migrating from one model to another. Catch the regressions a benchmark will miss.

Hand your governance team a paper trail

Audit trails, test outcomes, calibration history - the artifacts a compliance review actually wants to see.

A scene, mid-2026, again

Somewhere, an agent is still misbehaving.

But this time the platform team knows. A test fired at 09:14 - the distribution of contract-summary lengths shifted by 1.8 standard deviations from baseline. By 09:23 someone is in the dashboard, watching which production traces tripped the alert. By 10:01 the team has rolled back the prompt change that did it. The fourth paragraph, the slightly-wrong one, never makes it to the next customer.

That is the scene Distributional is selling. Not the absence of failure - generative AI will fail - but the closing of the gap between the moment the model misbehaves and the moment a human can do something about it. The deterministic test suite was an invention of the 1990s. The probabilistic one is being written now, in San Francisco, by a small team with a Greek letter or two on the whiteboard.

The agent never stops drifting. The point is what happens next.

Distributional.

Somewhere, an agent is misbehaving.

A young company, in receipts.

The Intel detour that became the company.

Scott Clark

Who's on board

Three parts, one bet.

Extensible tests

Test repositories

Adaptive calibration

Behavioral drift is the bug you can't see.

Fortune 500 platform teams

Built for compliance teams

A short company, in moments.

Founded

$11M Seed

$19M Series A

Team scales

The current scene

What you can actually do with it.

Ship an LLM app and sleep

Debug an agent that started lying

Test a new model version safely

Hand your governance team a paper trail

Somewhere, an agent is still misbehaving.

The links worth keeping.

Distributional.

Somewhere, an agent is misbehaving.

A young company, in receipts.

The Intel detour that became the company.

Scott Clark

Who's on board

Three parts, one bet.

Extensible tests

Test repositories

Adaptive calibration

Behavioral drift is the bug you can't see.

Fortune 500 platform teams

Built for compliance teams

A short company, in moments.

Founded

$11M Seed

$19M Series A

Team scales

The current scene

What you can actually do with it.

Ship an LLM app and sleep

Debug an agent that started lying

Test a new model version safely

Hand your governance team a paper trail

Somewhere, an agent is still misbehaving.

The links worth keeping.

Share this profile