The engineer in the room
Snorkel AI has three types of co-founders: the researchers who conceived the academic work, the operators who scaled the business, and the engineers who built what customers actually use. Ehrenberg sits squarely in that third category, serving as Head of Engineering and responsible for the technical strategy that keeps Snorkel's platform competitive in one of the fastest-moving markets in technology.
The engineering challenge at Snorkel isn't trivial. The company's platform needs to work for a compliance team at a bank, a clinical NLP team at a hospital system, a document intelligence team at an insurer, and a research lab training foundation models - each with different data schemas, security requirements, deployment constraints, and evaluation criteria. Building software that works across that range without becoming a mess of enterprise configuration switches is genuinely hard.
Ehrenberg's approach has been to stay close to the research side of the company while managing a production engineering organization. He has co-authored blog posts on enterprise GenAI evaluation, contributed to thinking about specialized evaluators as scalable proxies for subject matter experts, and maintained the academic rigor that set Snorkel apart from less principled competitors.
The Data-Centric AI Thesis
While the industry debates model architectures and parameter counts, Ehrenberg's team argues that the quality and structure of training data is the primary determinant of whether an AI system works in production. Snorkel's growth from research project to unicorn is the empirical evidence for that claim.
He spoke at the AI & Big Data Expo North America in 2024, bringing the company's perspective on where enterprise AI actually gets stuck - not at inference time, not at deployment, but at the training data pipeline. It's a message that resonates differently when it comes from someone who co-authored the research papers that started the conversation.
His GitHub handle is "henryre" - a small nod to his middle name, a tiny detail that hints at the person behind the LinkedIn profile. He joined Twitter in February 2013, well before Snorkel AI existed, and keeps a relatively quiet presence. The work speaks. His Google Scholar profile shows a researcher who went into industry without abandoning the rigor that comes with peer review.
In 2022, his alma mater University Prep in Seattle featured him in their alumni magazine as one of four "Creative Thinkers" among their graduates. It's the kind of recognition that matters more than a TechCrunch headline: the school that knew him before any of this is still proud enough to say so.