Pokee AI - The Agent That Actually Clicks the Buttons

The Scene

It is a Tuesday, and the agent is busy

Somewhere in an enterprise account, a request goes in as a single plain-English sentence: pull last quarter's numbers, build the deck, draft the summary, post it. No macros. No if-this-then-that recipe stitched together by hand. A Pokee agent reads the sentence, plans a route through a dozen different apps, reaches for the right tool at each step, checks its own work, and cites where the numbers came from. Then it stops - because it is finished.

That last part is the whole point. The AI industry spent a few loud years proving that models can write, summarize, and hold a conversation. Pokee AI is built around a quieter, harder question: can a model reliably act? Not narrate the plan - execute it, tool after tool, without fumbling the handoff. In Bellevue, Washington, a team of roughly eighteen people decided that gap was the whole business.

The Founder

From a Stanford lab to Meta's RL team to a founder's chair

Pokee AI was founded in 2024 by Zheqing (Bill) Zhu, who earned a Stanford PhD in reinforcement learning and then led Meta's applied reinforcement-learning team. Reinforcement learning is the branch of machine learning where a system learns by trial, error, and reward rather than by predicting the next word. It is how software learns to play games at superhuman levels and how robots learn to move. Zhu's wager is that it is also how agents should learn to use tools.

His diagnosis of the current crop of agents is blunt: they are not really intelligent, they are guessing. When an agent picks a tool by asking a language model to "call a function," it is essentially autocompleting its way through a task. That works until the task gets long, the tools get numerous, and one wrong step derails the rest. Pokee's answer is to train the tool-picking itself - to reward the sequences that work and penalize the ones that don't.

"Your AI. Your data. Your compute."

- Pokee AI's operating principle for enterprises

What It Does

One agent, wired into your whole stack

At the center is a foundation tool-use model - a model trained specifically to plan, reason, and select the right tool at the right moment. Pokee says it can extend to more than 6,000 tools and, on function-calling, outperforms GPT-4o, Claude 3.7, and Gemini 2.5 Pro. Around that model sits a platform that connects to 90+ SaaS apps through one-click OAuth, covering well over a thousand integrations, and runs a plan-act-verify-cite loop so the agent shows its work.

Product 01

Agent Platform

Executes multi-step workflows across 90+ apps from a plain-English prompt. Deploy in private cloud, on-premise, or on-device - with end-to-end encryption.

Product 02

Foundation Model

An RL-trained tool-use model that plans, reasons, and calls tools - the engine the company says beats leading LLMs at function calling.

Product 03

PokeeResearch-7B

An open-source 7B deep-research agent that reasons across the web, retrieves evidence, and self-verifies. Weights on Hugging Face, Apache 2.0.

The Claim, Charted

A small model punching up

Pokee's marquee open release, PokeeResearch-7B, is a seven-billion-parameter model - tiny by frontier standards. The company reports it leads the field of open 7B deep-research agents, topping 7 of 10 benchmarks. The illustrative bars below show the pitch: state-of-the-art among its weight class, trained with reinforcement learning from AI feedback.

PokeeResearch-7B

SOTA

Peer 7B agents

—

Benchmarks led

7/10

Illustrative. Figures reflect Pokee AI's own reported results across 10 open deep-research benchmarks; independent verification recommended.

By The Numbers

The shape of a seed-stage bet

$12M

Seed round, 2025

2024

Year founded

~18

Employees

6,000+

Tools reachable

The $12M seed was led by Point72 Ventures, with participation from Qualcomm Ventures, Samsung NEXT, Salience Capital, SCB 10X, and angels including Typeface founder Abhay Parasnis and semiconductor veteran Lip-Bu Tan. Qualcomm and Samsung are not just money - both have obvious stakes in agents that can run on-device, which is exactly one of Pokee's deployment modes.

The Wager

Zapier automates rules. LLMs guess. Pokee learns.

To place Pokee, line up the alternatives. Rule-based automation - think Zapier - is reliable but rigid: someone has to build the recipe in advance. LLM-based agents are flexible but brittle: they improvise tool calls and can wander off. Pokee's argument is that reinforcement learning threads the needle - flexible enough to handle a task it has never seen, trained enough to keep picking the right tool.

There is a second wager stacked on the first: privacy. Pokee sells to enterprises with the promise that the agent runs inside their own walls - private cloud, on-premise, or on-device - so sensitive data never has to leave. In a market where "where does my data go?" is the first question a security team asks, "nowhere" is a strong answer.

The gap between "AI can talk" and "AI can do" is where the next fortunes get made. Pokee planted its flag squarely in the middle.

- The Pokee AI thesis, in one line

The Story So Far

A short history, moving fast

2024

Bill Zhu leaves Meta's applied RL team and founds Pokee AI in the Seattle area, betting on RL-driven tool use over LLM function-calling.

2025 · JULY

Announces a $12M seed round led by Point72 Ventures, with Qualcomm and Samsung participating. Platform reaches public beta with enterprise design partners.

2025 · OCTOBER

Open-sources PokeeResearch-7B - a state-of-the-art small deep-research agent - releasing weights and inference code on Hugging Face and GitHub under Apache 2.0.

Marginalia

Things worth knowing

The name is a job description. "Pokee" hints at what the product does - poking the right buttons across your apps, so you don't have to.
It runs against the grain. While much of the industry scales ever-larger language models, Pokee trained a smaller model to act.
It gave away its research model. The enterprise foundation model stays proprietary; PokeeResearch-7B's weights are free under Apache 2.0 - confidence used as strategy.
Two chip giants are on the cap table. Qualcomm and Samsung both invested - a tell that on-device agents are part of the plan.

Watch & Listen

Demos and interviews

YouTube · Channel Pokee AI on YouTube - product demos & walkthroughs →

Product Hunt · Launch Pokee AI: Vibe Your Agentic Workflows →

The Directory

Find Pokee AI

Website · pokee.ai↗ LinkedIn↗ X / Twitter · @Pokee_AI↗ GitHub · Pokee-AI↗ Hugging Face · PokeeAI↗ YouTube · @pokeeai↗ Reddit · r/PokeeAI↗ GeekWire · $12M seed coverage↗

Back To The Scene

The same Tuesday, quietly changed

Return to that plain-English request from the start. A year ago it would have been a person's afternoon - hopping between tabs, exporting a spreadsheet, wrestling a template, copy-pasting a summary, remembering to post it. Now it is a sentence, and a Pokee agent working the route while the person does something else. The deck lands. The numbers are cited. The task, unglamorously, gets done.

That is the whole ambition, and it is smaller and stranger than "artificial general intelligence." Pokee AI is not promising a mind. It is promising a worker that can be trusted to take the next correct step, and the one after that, across the ordinary sprawl of software people already use. Eighteen people in Bellevue think that trustworthy action - not clever conversation - is the part of AI still up for grabs. The Tuesday is the proof they are chasing.

Pokee AI: the agent that clicks the buttons.

It is a Tuesday, and the agent is busy

From a Stanford lab to Meta's RL team to a founder's chair

One agent, wired into your whole stack

Agent Platform

Foundation Model

PokeeResearch-7B

A small model punching up

The shape of a seed-stage bet

Zapier automates rules. LLMs guess. Pokee learns.

A short history, moving fast

Things worth knowing

Demos and interviews

Find Pokee AI

The same Tuesday, quietly changed

Share this profile