Breaking
NEW YORK — HoneyHive ships the observability layer for production AI agents $7.4M total seed raised, led by Insight Partners Running in production at CBA, NVIDIA, MongoDB and Pinecone OpenTelemetry-native across 100+ LLMs and agent frameworks Requests logged grew 50x in a single year Founded by Columbia roommates Mohak Sharma & Dhruv Singh NEW YORK — HoneyHive ships the observability layer for production AI agents $7.4M total seed raised, led by Insight Partners Running in production at CBA, NVIDIA, MongoDB and Pinecone OpenTelemetry-native across 100+ LLMs and agent frameworks Requests logged grew 50x in a single year Founded by Columbia roommates Mohak Sharma & Dhruv Singh
Company Profile · AI Infrastructure
HoneyHive logo

HoneyHive.

The observability layer for production AI agents - it records not just what an agent said, but how it got there, and where it broke.

A logo, a bank, and a very specific promise. Photographed here as the mark of a 19-person company that talked its way into a Global Top 10 bank's production stack - and now watches the agents there like a flight recorder watches a cockpit.

Est. 2022 New York, USA AI Observability Seed · Insight Partners
$7.4M
Total Seed Funding
2022
Founded
~19
Team Members
50x
Logging Growth / Yr
The Story

A Flight Recorder for the Age of Agents

There is a genre of corporate anxiety that goes like this: the demo worked, everyone clapped, and then you put the thing in front of real customers and it did something you cannot explain. In ordinary software this is a bug, and bugs have stack traces. In AI agents - systems that reason across many steps, call tools, retrieve documents, and improvise - the failure is often stranger. The output looks plausible. It is also wrong. And nobody in the room can tell you why.

HoneyHive is a company built almost entirely around that specific discomfort. Its pitch, stripped of jargon, is that if you are going to let a language model make decisions inside a business process, you should be able to watch it think. Not the marketing version of watching - a dashboard with a green light - but the forensic version: every step the agent took, every tool it called, every place the reasoning went sideways, laid out as a trace you can actually read.

The company calls itself "the observability layer for production agents," which is the kind of phrase that sounds like plumbing until you realize plumbing is exactly the point. The interesting AI companies of this cycle are not all building models. Some of them are building the boring, load-bearing infrastructure that lets everyone else's models be trusted with something that matters - a loan decision, a customer's money, a medical form. HoneyHive is firmly in the second camp, and it seems to prefer it there.

The Observability Layer for Production Agents.
— HoneyHive's own tagline, which is refreshingly literal

Two roommates and a shared complaint

The founders, Mohak Sharma and Dhruv Singh, met as roommates at Columbia and, in the manner of people who become co-founders, kept circling back to the same problem. Sharma had gone off to build data and AI platforms as a product manager at Templafy. Singh had a rarer vantage point: he worked as a software engineer on the OpenAI Innovation team at Microsoft, which is to say he saw the raw unpredictability of large language models early and up close. Both walked away with the same conviction - that shipping these systems to production was terrifying, and that the terror was fixable with better tooling.

That is a useful origin story because it explains the product's temperament. HoneyHive is not trying to make AI more magical. It is trying to make AI more accountable, which is a less glamorous ambition and a more durable one. The team it assembled reflects that: seasoned engineers from Microsoft, Amazon, JP Morgan, and Patreon - people who have shipped real systems and watched them fail in real ways.

Observability, but for things that reason

Traditional observability - the Datadog-and-Grafana world - was built for software that behaves the same way every time. You send a request, you get a response, and if something breaks there is a log line telling you where. Agents break that assumption. The same prompt can produce different outputs. Failures are semantic, not just technical. A trace that only shows you inputs and outputs is like a black box recorder that only captured takeoff and landing.

HoneyHive's answer is to instrument the whole journey, and to do it on OpenTelemetry - the open industry standard for telemetry data. This is a quietly strategic choice. Building on an open standard rather than a proprietary format is the difference between asking an enterprise to trust a black box and handing them something they can inspect, self-host, and audit. For a company that wants to sell into banks, that distinction is the entire sales conversation.

The Product

What You Can Actually Do With It

HoneyHive folds two things that usually live in separate tools - watching agents in production and testing them offline - into one continuous loop. Trace, experiment, monitor, annotate, repeat.

See it

Distributed Tracing

OpenTelemetry-native tracing across 100+ LLMs and agent frameworks. Every step of the decision journey, captured and readable.

Test it

Experiments

Offline evaluation against curated datasets with custom evaluators, regression detection, and CI/CD integration.

Watch it

Monitoring & Alerts

Real-time online evaluation and drift detection that pages you when an agent starts misbehaving in production.

Judge it

Annotation Queues

Human-in-the-loop review that lets domain experts - not just engineers - shape what "correct" means.

Version it

System of Record

Versioning for traces, prompts, tools, datasets, and evaluators across the entire Agent Development Lifecycle.

Replay it

Playground & Replays

Prompt experimentation and full session replays for root-cause analysis when something inevitably goes wrong.

In Production

Watching Agents Where the Stakes Are Real

HoneyHive reports powering observability across dozens of mission-critical AI systems at Commonwealth Bank - a deployment that touches 17M+ consumers - alongside a roster of AI-forward enterprises.

Commonwealth Bank NVIDIA MongoDB Pinecone
The Money

Someone Sensible Wrote a Check

Here is a small, telling detail about this particular AI company: the money came for the unsexy part. In April 2025 HoneyHive announced its public launch and $7.4M in total seed funding led by Insight Partners, with George Mathew joining the board. The venture math on this is straightforward and slightly contrarian - Insight did not back a flashy generative play, it backed the tooling that makes generative plays safe enough to deploy. If you believe agents are going into production everywhere, then whoever sells the seatbelts has a good decade ahead.

The round breaks into two parts: a $5.5M seed led by Insight with 468 Capital and MVP Ventures, and a previously unannounced $1.9M pre-seed led by Zero Prime Ventures. The angel list is a nice tell about who takes this problem seriously - Jordan Tigani, CEO of MotherDuck, and Savin Goyal, CTO of Outerbounds. These are people who build data infrastructure for a living, which is roughly like getting endorsed by other plumbers.

$5.5M
Seed · Apr 2025
Led by Insight Partners. With participation from Zero Prime Ventures, 468 Capital, and MVP Ventures. George Mathew of Insight Partners joined the board.
$1.9M
Pre-Seed
Led by Zero Prime Ventures. With AIX Ventures, Firestreak Ventures, and angels Jordan Tigani (MotherDuck) and Savin Goyal (Outerbounds).

Figures per company announcements and press coverage (PRNewswire, AlleyWatch, FINSMES), April 2025. Revenue and headcount figures elsewhere on this page are third-party estimates and should be treated as approximate.

The People

Who Built It

MS

Mohak Sharma

Co-Founder & CEO

Former product manager at Templafy, where he built data and AI platforms before turning his frustration with unpredictable AI into a company.

DS

Dhruv Singh

Co-Founder & CTO

Former software engineer on the OpenAI Innovation team at Microsoft, where he got an early, up-close look at how LLMs actually behave.

The Angle

What Makes HoneyHive Different

Open by default

OpenTelemetry-Native

Not a proprietary black box. Built on the open standard, which is exactly why regulated buyers can self-host and audit it.

One loop

Observability + Evaluation

Most tools pick monitoring or testing. HoneyHive refuses to, folding both into a single continuous improvement loop.

Not just engineers

Domain Experts Included

Annotation queues let the people who actually know what "correct" means review the same traces engineers debug.

Deployable

Enterprise-Grade Privacy

SOC 2 Type II, GDPR, and HIPAA, with logically isolated data planes, granular RBAC, and self-hosting options.

Semantic

The Decision Journey

It shows how a conclusion was reached and where systems falter - not just inputs and outputs, but the reasoning between them.

Small & deep

A 19-Person Bench

Seasoned engineers from Microsoft, Amazon, JP Morgan, and Patreon - people who have shipped and watched things break.

Watch & Explore

Demos & Interviews

Search HoneyHive on these channels for product walkthroughs, founder talks, and platform demos.

Margin Notes

Things Worth Knowing