PersonFounderEngineer
Simon Mo
Simon Mo is the CEO and co-founder of Inferact, the startup commercializing vLLM, the open-source inference engine he helped create and now leads as lead maintainer. A PhD student at UC Berkeley's Sky Computing Lab advised by Ion Stoica and Joseph Gonzalez, Mo has spent roughly eight years building high-throughput, memory-efficient model-serving systems, from Ray Serve at Anyscale to vLLM, which now powers inference for companies including Amazon. In January 2026 he and his vLLM co-maintainers raised a $150M seed round at an $800M valuation, co-led by Andreessen Horowitz and Lightspeed.
simon moinferactvllmllm inferencemachine learning systemsuc berkeley