Tagged Content
Everything on the platform tagged with unstructured-data.
Anant Bhardwaj is the founder and CEO of Instabase, a San Francisco-based AI platform that helps enterprises extract intelligence from unstructured data - PDFs, images, emails, and documents of every kind. He dropped out of a PhD at MIT in 2015 to build what has become a $1.24 billion company with $277M in total funding, serving clients like NatWest, AXA, Uber, and four of the five largest U.S. banks. Born in Bihar, India, he studied at Pune, Stanford, and MIT before pioneering the idea that the enterprise's biggest untapped asset is the data it can't yet read.
Unstructured is a San Francisco company building the data layer for generative AI. Its open-source library and enterprise platform ingest PDFs, slide decks, emails, images and 70+ other file types, then transform them into clean, structured data that LLMs and RAG pipelines can actually use.
LlamaIndex is a San Francisco company building the data framework and cloud platform that lets enterprises turn messy unstructured documents into knowledge agents powered by large language models. Its open-source library is one of the most-used scaffolds for retrieval-augmented generation, and its hosted product, LlamaCloud, packages parsing, extraction, and indexing for production teams.

Kon Leong is the CEO and co-founder of ZL Technologies, a Milpitas-based enterprise software company he built from the ground up in 1999 to tackle the problem nobody else wanted to solve: the mountain of unstructured human data - emails, documents, messages - that sits at the center of every corporate compliance, legal, and AI challenge. A serial entrepreneur who migrated from China to India to Canada to the US, Leong brought a rare mix of deep IT engineering, Wall Street M&A finance, and startup grit to a market that was just waking up to its own data problem. Today, ZL Technologies serves Fortune 500 companies and government agencies across financial services, healthcare, and the public sector, with a platform that manages data in-place at speeds up to 1,000 times faster than conventional approaches.
Sid Manchkanti is the Co-Founder and CEO of Pulse, a production-grade document intelligence platform that converts complex unstructured documents into LLM-ready structured data. A Berkeley CS graduate with experience at NVIDIA and D.E. Shaw, he co-founded Pulse in 2024 after going through Y Combinator's Summer 2024 batch. The company raised a $3.9M seed round led by Nat Friedman and Daniel Gross, and has processed over one billion pages for Fortune 100 enterprises across finance, healthcare, insurance, legal, and supply chain sectors.