Mohamed Elgendy

Profile

Where 99% Accuracy Isn't Good Enough

There was a weapons-detection AI model that passed every benchmark thrown at it. Its aggregate accuracy score: 99%. In a live demo, it failed. A different model - one that scored only 97.2% overall - performed better on the exact scenarios that mattered. Mohamed Elgendy was in the room. He understood immediately that the industry wasn't just measuring poorly; it was measuring the wrong things entirely. Kolena was the answer he built.

Elgendy grew up in Egypt, studied systems and biomedical engineering at Cairo University, then spent a few years working in that field before a different pull took hold. He relocated to the United States, worked through a series of engineering roles - Independence Blue Cross, Aspen Dental Management, Yale University - and eventually landed in the architecture of large-scale software systems. He joined Twilio, then Amazon, where something specific happened that would shape the rest of his career.

At Amazon, Elgendy didn't just build AI products. He designed and taught a deep learning for computer vision course at Amazon's Machine Learning University. He ran Amazon's computer vision think tank. He wasn't only building - he was translating the mechanics of machine learning into language that teams could actually use. That instinct for translation would later produce a book that sold 20,000 copies.

"AI lacks trust from both builders and the public. The genie isn't going back in the bottle, but we can make sure we make the right wishes."

Mohamed Elgendy - Co-Founder & CEO, Kolena

After Amazon, he went to Synapse Technology Corporation as Head of Engineering, leading the development of a proprietary threat detection platform. Synapse was acquired by Palantir. Then came Rakuten, where he served as VP of Engineering for the AI Platform, building and managing the ML infrastructure for all of Rakuten Mobile's operations. By 2020, his book - "Deep Learning for Vision Systems" (Manning Publications) - was out in the world, reaching engineers who needed the concepts explained in terms they could implement.

In 2021, Elgendy co-founded Kolena with Andrew Shi (CTO) and Gordon Hart (CPO). The founding insight was sharp: software engineering had spent decades developing rigorous testing methodologies - unit tests, regression tests, scenario coverage. Machine learning had essentially borrowed the concept of accuracy and stopped there. Kolena's pitch was to port the entire discipline of software testing into the AI development lifecycle.

"This is testing on steroids," Elgendy has said. "Not just ticking boxes, but diving deep into the nuances of model accuracy." The platform lets teams test AI models at the scenario level - not just asking whether a model is accurate overall, but whether it performs on the specific slices of data that matter for the actual use case. A healthcare AI should be tested on the edge cases a clinician encounters. An autonomous vehicle model should be tested on the specific road conditions where it will operate. Aggregate metrics lie by omission.

Kolena's clients include Fortune 500 companies, government organizations, European AI standardization institutes, and startups in robotics, healthcare, autonomous vehicles, and banking. By September 2023, the company had raised $15M in a Series A led by Lobby Capital, with participation from SignalFire and Bloomberg Beta - bringing total funding to $21M. At the time of the raise, Kolena had 28 full-time employees.

Elgendy has articulated a framework for thinking about AI trust that divides the problem into three distinct communities: builders (who lack the testing tools and visibility to trust their own systems), buyers (who are misled by aggregate accuracy metrics that hide critical failures), and regulators (who have no pre-deployment validation frameworks to work from - what he compares to "the FDA doing away with clinical trials"). Kolena's ambition is to close all three gaps simultaneously.

Beyond the platform, Elgendy co-founded AIQCON - the AI Quality Conference - with MLOps Community. The inaugural event drew more than 10,000 attendees: AI builders, investors, regulators, and journalists, all gathered to wrestle with what responsible AI deployment actually requires. Creating a conference wasn't a marketing exercise. It was Elgendy doing in public what he has always done: turning a hard problem into a shared discipline.

He works 6 AM to 5 PM daily, a discipline he credits as foundational. He identifies trustworthiness and consistency as instrumental character traits. He credits his wife Amanda El-Dakhakhni as his primary support system. These are the kinds of details that rarely appear in TechCrunch coverage but tend to explain a lot about how someone builds something real over time.

Mohamed Elgendy's bet is that the AI industry is about to face exactly what the weapons-detection model faced: the moment when headline metrics meet the real world, and the gap becomes impossible to ignore. He is building the infrastructure to close that gap before it causes serious harm. Whether you call that product-market fit or moral clarity may depend on your vantage point. From where Elgendy stands, it's just the right problem to solve.

Kolena Funding

Seed Round $6M

Series A (Sept 2023) $15M

Investors: Lobby Capital, SignalFire, Bloomberg Beta

$21M Total

The Founding Moment

When 99% Became a Problem

The Demo That Broke the Benchmark

A weapons-detection AI model enters a live demo with a 99% aggregate accuracy score. In front of the room, it fails. A competing model - clocking in at 97.2% overall - handles the actual scenarios far better.

The lesson isn't that 99% is a bad number. The lesson is that 99% is an incomplete number. Aggregate metrics collapse real-world complexity into a single figure and call it truth. The specific scenarios that matter most to a use case can be buried inside that 99% with nobody looking at them.

This was the founding insight of Kolena: the AI industry needed to borrow from software engineering what software engineering took decades to develop. Unit tests. Regression tests. Scenario-level coverage. Not a new accuracy formula - a new discipline.

"This is testing on steroids: not just ticking boxes, but diving deep into the nuances of model accuracy."

Mohamed Elgendy

Kolena's Core Insight

OLD Test overall accuracy. Ship. Hope.

NEW Test the specific scenarios that matter. Track regressions. Validate before you ship.

The Platform

Kolena: Engineering Discipline for AI

What Kolena Does

Kolena provides end-to-end ML model testing for computer vision, generative AI, natural language processing, LLMs, and multi-modal models. The platform identifies test data coverage gaps, enables scenario-level unit testing and regression analysis, and incorporates risk management features for deployed AI systems. In short: it brings the rigor of software engineering to the validation of machine learning.

🔎

Scenario Testing

Test at the slice level, not just the aggregate

📊

Coverage Analysis

Identify gaps in test data coverage before deployment

🛡

Risk Management

Monitor deployed models for real-world regression

🌟

Fortune 500 Clients

Kolena's platform serves major enterprises across healthcare, autonomous vehicles, banking, and government.

🌎

EU AI Standardization

European AI standardization institutes use Kolena to develop frameworks for responsible AI validation.

🛠

HIPAA Compliant

Enterprise-grade security with SOC 2, PCI, and HIPAA compliance - built for production AI at scale.

Career Arc

Building Toward the Right Problem

2008-2013

Early U.S. Career

Independence Blue Cross, Aspen Dental Management, Yale University - building software engineering fundamentals across healthcare and education sectors

2013-2015

Twilio

Senior Engineering Manager - shipped communication API products at scale

2015-2018

Amazon

Senior Engineering Manager - built and managed AI/ML teams, created and taught the deep learning for computer vision course at Amazon's Machine Learning University, managed Amazon's computer vision think tank

2018-2020

Synapse Technology Corporation (acq. Palantir)

Head of Engineering - led development of proprietary threat detection platform; company acquired by Palantir

2020

Rakuten

VP of Engineering, AI Platform - led AI/ML platform for all Rakuten Mobile operations

2020

"Deep Learning for Vision Systems" Published

Manning Publications - a field guide to building computer vision systems that uses only high school algebra. 20,000+ copies sold.

2021

Kolena Founded

Co-founded with Andrew Shi (CTO) and Gordon Hart (CPO) - purpose: bring software engineering testing discipline to AI development

Sept 2023

Series A - $15M

Led by Lobby Capital, with SignalFire and Bloomberg Beta. Total funding reaches $21M.

2023

AIQCON - AI Quality Conference

Co-founded with MLOps Community. Inaugural event draws 10,000+ AI builders, investors, regulators, and journalists.

In His Own Words

What Mohamed Elgendy Believes

"AI can't just be powerful - it has to be trusted."

Mohamed Elgendy

"The genie isn't going back in the bottle, but we can make sure we make the right wishes."

On AI's inevitability

"Traditional model testing frameworks don't provide the level of granularity the industry desperately needs."

Founding insight, Kolena

"Regulators operating without pre-deployment validation is like the FDA doing away with clinical trials."

On AI governance

Published Work

Deep Learning for Vision Systems

Published by Manning Publications in October 2020, "Deep Learning for Vision Systems" does something that most technical books avoid: it explains neural networks for computer vision using only high school algebra. The result reached an audience far beyond academic researchers.

The book covers image classification, object detection, transfer learning, generative adversarial networks, and the architecture of intelligent, scalable vision systems. Elgendy wrote it while working at Rakuten - a practitioner's account of what actually works in production, not just in papers.

Over 20,000 copies sold. The accompanying GitHub repository has accumulated 389+ stars. The book is distributed through O'Reilly, Apple Books, Simon & Schuster, and Amazon.

📚 View at Manning Publications

20K+

Copies Sold Worldwide

Key Topics Covered

› Image Classification & CNNs
› Object Detection
› Transfer Learning
› Generative Adversarial Networks
› Real-world vision system deployment

The Details

Things That Don't Make It Into Press Releases

🏭 Holds a Lean Six Sigma Green Belt - manufacturing process discipline applied to AI

⏱ Works 6 AM to 5 PM daily. Consistency is a system, not a mood.

🏭 Biomedical engineer before becoming a tech executive - the path started in Cairo

📚 His book uses only high school algebra to explain deep learning - a deliberate choice

👥 Credits wife Amanda El-Dakhakhni as his primary support throughout the founder journey

📊 GitHub: 389+ stars on his book's code repository (moelgendy/deep_learning_for_vision_systems)

🏆 Regular speaker at Amazon DevCon, O'Reilly AI Conference, Google I/O, and TWIMLCon

🧩 PMP certified - project management discipline running alongside the AI expertise

Watch & Listen

Mohamed Elgendy in Conversation

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229

MLOps Community • YouTube

Build a Culture of ML Testing and Model Quality // MLOps Coffee Sessions #76

MLOps Community • YouTube

Mohamed
Elgendy

Where 99% Accuracy Isn't Good Enough

Quick Facts

Kolena Funding

Career Path

When 99% Became a Problem

The Demo That Broke the Benchmark

Kolena: Engineering Discipline for AI

What Kolena Does

Scenario Testing

Coverage Analysis

Risk Management

Fortune 500 Clients

EU AI Standardization

HIPAA Compliant

Building Toward the Right Problem

What Mohamed Elgendy Believes

Deep Learning for Vision Systems

Things That Don't Make It Into Press Releases

Mohamed Elgendy in Conversation

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229

Build a Culture of ML Testing and Model Quality // MLOps Coffee Sessions #76

MohamedElgendy

Where 99% Accuracy Isn't Good Enough

Quick Facts

Kolena Funding

Career Path

When 99% Became a Problem

The Demo That Broke the Benchmark

Kolena: Engineering Discipline for AI

What Kolena Does

Scenario Testing

Coverage Analysis

Risk Management

Fortune 500 Clients

EU AI Standardization

HIPAA Compliant

Building Toward the Right Problem

What Mohamed Elgendy Believes

Deep Learning for Vision Systems

Things That Don't Make It Into Press Releases

Mohamed Elgendy in Conversation

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229

Build a Culture of ML Testing and Model Quality // MLOps Coffee Sessions #76

Mohamed
Elgendy