Tagged Content
Everything on the platform tagged with ai-inference.
Positron AI designs purpose-built inference hardware for transformer models, aiming to make Nvidia GPUs optional for running large language models at production scale. Its first product, Atlas, ships from US fabs and claims roughly 3x lower latency and 4x better performance-per-watt versus an H100 system.
Zain Asgar is the Co-Founder and CEO of Gimlet Labs, a San Francisco-based AI infrastructure company building the world's first multi-silicon inference cloud. With a PhD from Stanford in electrical engineering focused on GPU energy modeling, Asgar previously led engineering at Google AI (where his work became Google Lens) and founded Pixie Labs, a Kubernetes-native observability platform acquired by New Relic in 2020. At Gimlet Labs, he is tackling one of AI's most pressing infrastructure challenges: making AI inference 3-10x more efficient by intelligently routing workloads across heterogeneous hardware including NVIDIA, AMD, Intel, ARM, and specialized accelerators like Cerebras.