Breaking Shovels raises $5M seed led by Base10 Partners 130M+ permits indexed 2.3M contractors profiled 1,800+ jurisdictions covered 5M new permits added monthly Charlie AI now live Snowflake · Databricks · BigQuery shares Breaking Shovels raises $5M seed led by Base10 Partners 130M+ permits indexed 2.3M contractors profiled 1,800+ jurisdictions covered 5M new permits added monthly Charlie AI now live Snowflake · Databricks · BigQuery shares
Vol. 01 · Issue 06 · Lafayette, CA

Shovels.
Before the dirt moves.

An AI company quietly turning America's most boring paperwork - the building permit - into the most useful dataset in real estate, climate, and construction.

HQLafayette, CA
Founded2022
Team39 people
Last raise$5M Seed · 2025
Total funding$6.1M+
The Shovels team on a hike in the Bay Area
The team off-grid, between deploys. Bay Area, 2025.
The Scene

A clerk in Bakersfield uploads a PDF. Somewhere in Lafayette, a model wakes up.

It is Tuesday morning, and a permit clerk in Kern County, California is doing what permit clerks have done since the invention of the carbon copy - taking a stapled stack of architectural drawings and scanning them into a county portal that looks like it was built during the Clinton administration. The file lands in a folder. Nobody reads it.

Except, today, somebody does. Inside a quiet office park in Lafayette, an ingestion pipeline at a 39-person company called Shovels picks up the upload, normalizes the schema, extracts the parties involved, geocodes the address, classifies the work type, links it to a contractor profile that already exists in the database, and pushes the record - now structured, queryable, joinable - into Snowflake shares used by climate-tech firms in Brooklyn, building-materials suppliers in Atlanta, and a heat-pump installer in Sacramento who will, by Friday, have a list of every household within thirty miles that just pulled a permit for a new HVAC system.

That whole sequence took ninety seconds. The clerk has no idea.

What Shovels Is

The intelligence layer for the built world.

Building permits are the first verifiable signal that anything in the physical world is about to change - a roof, a panel, a charger, a warehouse, a skyscraper. They are also, almost universally, a mess. Each of the roughly 22,000 jurisdictions in the United States runs its own portal, schema, naming convention, and PDF habit. Some publish CSVs. Some publish nothing. A few still ask you to drive in.

Shovels collects all of it. The company has indexed more than 130 million permits across 1,800+ jurisdictions, profiled 2.3 million contractors, and built coverage that touches roughly 85% of the US population. It pipes the result out through a web app, an API, a CLI, Snowflake, Databricks, BigQuery, and a natural-language interface named Charlie.

If that sounds like infrastructure work, that is the point. Shovels is not trying to be the consumer brand of construction data. It is trying to be the substrate underneath it.

Tagline

"The intelligence layer for the built world."

Borrowed from the website's homepage. The phrase does the work of an entire pitch deck.

AIPermitsGeospatial SnowflakeDatabricksClimate Contractor DataProptech
By the Numbers

Boring data, loud math.

130M+
Permits Indexed
2.3M
Contractors Profiled
1,800+
Jurisdictions
5M
New Permits / Month

Coverage, in plain English

US Population
85%
Top-100 Metros
~92%
Permit Categories
~78%
Contractor Match Rate
~88%

Coverage figures sourced from Shovels.ai. Internal estimates marked with ~.

The Stack

Seven ways in.

Shovels resists the temptation to make customers learn a new app. The data shows up wherever the customer already works.

Web

Shovels Online

Point-and-click exploration and CSV exports. Where the analysts start.

API

Shovels API

Programmatic access for CRMs, lead-routing, and custom apps.

Warehouse

Database Access

Parquet drops and live shares to Snowflake, Databricks and BigQuery.

Natural Language

Charlie AI

Ask, in English, what got permitted last month in your ZIP. Get an answer.

Marketing

Audiences

B2B and homeowner segments built from real permit signals.

Geo

GIS Integration

QGIS, ArcGIS, and the rest of the spatial-analyst toolkit.

Developer

Shovels CLI

An agent-first command line for AI workflows. Designed for LLMs first, humans second - which is an interesting choice, and a deliberate one.

Stack

What's Under the Hood

Python, dbt, DuckDB, Prefect, Postgres, FastAPI, Vue, Fargate, scikit-learn, and Claude. A modern data-engineering greatest-hits.

Who Uses It

If it ends up in a building, they want to know.

Climate Tech

Heat pumps, solar, EV chargers

Every retrofit leaves a permit. Climate operators use Shovels to find the next install before the truck rolls.

Building Materials

Suppliers & manufacturers

Map demand at the ZIP-code level. Stop guessing which markets are heating up.

Real Estate

PE, brokers, appraisers

Permit activity is leading-indicator data for neighborhood movement and asset value.

Home Services

Roofers, HVAC, electricians

Targeted leads based on actual filings, not direct-mail lists from 2014.

Telecom & Infra

Tower & fiber rollouts

Site planning informed by what is being permitted around the parcel.

Construction Software

Embedded data partners

Companies like Beam plug Shovels in to enrich their own products.

Integration of Shovels data resulted in a remarkable 20 to 30% higher likelihood of contractor engagement. Beam, construction software customer
Shovels has a rare and massive opportunity to become the definitive source of truth for high-value, hard-to-access datasets at the local government level. Rexhi Dollaku, General Partner, Base10 Partners
The People & The Money

Two co-founders. One thesis. Six and a half million dollars.

Ryan Buckley · Co-Founder & CEO

A repeat operator who picked the unsexiest possible dataset and ran at it. Buckley is the one publishing under ryan@shovels.ai and writing the long blog posts about why local-government PDFs deserve more respect than they get.

Luka Kacil · Co-Founder & CTO

Runs the ingestion pipeline that eats 5 million records a month and keeps the schema honest. The reason 1,800 jurisdictions look like one table.

Funding History

$6.1M+ raised to date

June 10, 2025

Seed · $5,000,000

Led by Base10 Partners. Used to expand the team, scale ingestion, and push into adjacent local-gov datasets.

2022 - 2024

Pre-Seed

Earlier participation from Coelius Capital and angels.

What You Can Do With It

A short list of unfair advantages.

Find the install before the install

Pull a feed of every solar, heat-pump, or EV-charger permit in your service area, the day it's filed.

Verify a contractor in one call

License status, permit history, jurisdictions worked, recent activity - all from one endpoint.

Size a market without guessing

Build TAM models on actual permit volume instead of survey extrapolations.

Detect permit fraud

Cross-reference filings against contractor licenses and historical patterns.

Plan a tower or a fiber run

Layer permit density over parcels in QGIS or ArcGIS.

Talk to your data in English

Charlie answers questions like "show me ADU permits in Oakland over $200K this year."

Culture

A distributed team that hikes together.

Thirty-nine people, headquartered in Lafayette but spread across time zones. The about-page photos lean less startup-poster and more group-text: team hikes, paddleboards, dinners, the Golden Gate Bridge in the background. There is no foosball table on display, which is probably also the point.

The engineering stack reads like a curated tasting menu of modern data tooling - dbt, DuckDB, Prefect, Snowflake, Athena, Terraform, Docker, Fargate. The use of Claude alongside scikit-learn suggests a pragmatic AI posture: classical ML where it works, LLMs where they earn their keep.

Latest Updates

What's new at Shovels.

June 2025

$5M Seed round closes, led by Base10 Partners

Total funding crosses $6M. Capital earmarked for ingestion, team, and expansion beyond permits into adjacent municipal datasets.

2025

Charlie AI ships

A natural-language interface sits on top of the permit graph. Customers can ask questions instead of writing SQL.

2025

Agent-first CLI released

A command-line tool optimized for LLM agents - one of the first products of its kind to be designed AI-native from the outset.

The Scene, Revisited

Back to Bakersfield.

Three months after that Tuesday-morning upload, the homeowner whose roof inspired the permit gets a knock at the door. It is a sales rep for a solar installer who knew - somehow - that the house was a candidate. The pitch lands. The panels go up. The next permit gets filed. The pipeline picks it up. Ninety seconds later, a battery-storage company in Denver gets a row in its dashboard.

The clerk in Kern County still has no idea. She is, by now, on her four hundredth scan of the morning. But the file she uploaded is no longer a file in a folder. It is a node in a graph that 39 people in Lafayette built, on purpose, because they decided that paperwork is a feature.

Shovels did not invent the building permit. It just made it readable.