Tagged Content
Everything on the platform tagged with ocr.

Vik Paruchuri is the founder and CEO of Datalab, an AI startup building small, efficient foundation models for document intelligence. A self-taught ML engineer who majored in American History, he previously founded Dataquest - an online learning platform that taught data skills to over 1 million students. His open-source projects (Marker, Surya, Chandra OCR) have earned thousands of GitHub stars and benchmark-topping accuracy scores. He publishes 'The Vik Letter' newsletter covering semiconductors and tech.
LlamaIndex is a San Francisco-based AI infrastructure company and open-source framework that enables enterprises to build intelligent document agents using large language models. Founded in 2022 by Jerry Liu and Simon Suo, it started as a side project called GPT Index and has grown into a full enterprise platform with products like LlamaParse (agentic OCR), LlamaCloud (enterprise SaaS), and a widely-used Python/TypeScript SDK. With 25M+ monthly downloads, 48K+ GitHub stars, and customers including Rakuten, Salesforce, and 90+ Fortune 500 companies, LlamaIndex is a leading player in the enterprise RAG and AI agent infrastructure space.