Tagged Content
Everything on the platform tagged with nlp.

Nathan Lambert is a Senior Research Scientist and Post-Training Lead at the Allen Institute for AI (Ai2), where he leads open-source language model development on the OLMo and Tulu series. A UC Berkeley PhD, he previously led the RLHF team at Hugging Face, co-building the TRL library and the Zephyr model. He runs Interconnects AI, a Substack newsletter read by tens of thousands covering post-training, open models, and AI policy, and is the author of The RLHF Book (Manning Publications). With roughly 8,000 academic citations and a reputation for demystifying the hardest parts of modern AI, Lambert is one of the most trusted voices at the intersection of open-source AI research and public education.

Sebastian Raschka is a German-born AI/ML researcher, educator, and author who has built one of the most trusted independent voices in the machine learning community. Through his Substack newsletter 'Ahead of AI' (184,000+ subscribers), bestselling books like 'Build a Large Language Model (From Scratch)', and 91,000+ starred GitHub repositories, he demystifies cutting-edge AI for practitioners worldwide. After a stint as an Assistant Professor at UW-Madison and a role as Staff Research Engineer at Lightning AI, he now runs RAIR Lab as an independent researcher, writer, and consultant.

Elvis Saravia is the co-founder and lead AI researcher at DAIR.AI, a mission-driven organization democratizing AI research and education worldwide. Based in Belize, he is the author of the Prompt Engineering Guide - one of the most widely read AI resources on the internet with 73,000+ GitHub stars and over 3 million learners - and publishes the AI Agents Weekly newsletter. A PhD graduate from National Tsing Hua University in Taiwan, he has contributed to landmark AI projects including the Galactica large language model at Meta AI, and is known for bridging rigorous research with accessible, production-minded education for the next generation of AI builders.

Eugene Yan is a Principal Applied Scientist turned Member of Technical Staff at Anthropic, where he bridges cutting-edge AI research with production-scale systems. Formerly at Amazon for five years building real-time recommendation and LLM-powered systems for Kindle and Search, Eugene is equally well-known for his prolific writing: 209 blog posts, 420,000+ words published, and a newsletter with over 11,800 subscribers. His open-source repository applied-ml on GitHub has become a canonical reference for teams shipping machine learning in production. He lives in Seattle, snowboards on weekends, and writes like someone who actually wants you to understand.

Hamel Husain is a machine learning engineer with 25+ years of experience who built part of the foundation beneath GitHub Copilot - his CodeSearchNet project was early LLM research later used by OpenAI for code understanding. Today he runs Parlance Labs, consults with AI teams across 35+ products, co-authored O'Reilly's 'Evals for AI Engineers', and teaches thousands of engineers how to move beyond vibes and actually measure their AI systems.

Maarten Grootendorst is a psychologist-turned-ML engineer at Google DeepMind, best known for creating BERTopic, KeyBERT, and PolyFuzz - open-source NLP tools with over 15 million combined downloads. Co-author of the Amazon #1 bestseller 'Hands-On Large Language Models' (O'Reilly, 2024) with Jay Alammar, he runs the 'Exploring Language Models' newsletter with 2M+ views and has taught 50,000+ students on DeepLearning.AI. His work bridges the worlds of psychology and AI, making complex language model internals accessible through strikingly visual guides.

Mihail Eric is a Palo Alto-based ML engineer, researcher, educator, and serial founder who has spent a decade bridging cutting-edge AI research and production systems. A Stanford CS alumnus who studied under Christopher Manning and Percy Liang, he built some of Amazon Alexa's earliest large language models, co-founded YC-backed Storia AI, founded Confetti AI (acquired by Towards AI), and now teaches 'The Modern Software Developer' at Stanford while running a newsletter for 17,000+ AI practitioners.

Tim Dettmers is an Assistant Professor at Carnegie Mellon University and Research Scientist at the Allen Institute for AI (AI2), best known for making large language models accessible on consumer hardware. He created the bitsandbytes library (2.2M monthly installs), co-authored QLoRA - a technique enabling fine-tuning of 65B-parameter models on a single GPU - and pioneered LLM.int8() quantization. With over 18,000 citations across his work, Dettmers has become one of the most influential voices in efficient deep learning, consistently arguing that computational democratization - not AGI hype - is where the real progress lives.

Vicki Boykis is a founding ML engineer and one of the most respected voices in applied machine learning. Known for making complex systems legible through rigorous writing and dry wit, she runs the Normcore Tech newsletter, authored a widely-cited deep dive on embeddings, built Viberary (a semantic book recommendation engine), and created Normconf - an unconventional data conference celebrating the unglamorous realities of ML work. She brings an economist's skepticism and a software engineer's discipline to a field that often confuses hype for progress.

Yao Fu (符尧) is an AI researcher at xAI specializing in large language model reasoning, efficient inference, and distributed systems. A PhD graduate of the University of Edinburgh, he previously worked at Google DeepMind on Gemini 3 and Project Astra. With over 5,000 citations and key papers like ServerlessLLM (OSDI '24) and DuoAttention (ICLR '25), Fu bridges systems engineering and ML research. He writes the 'Yao Fu' newsletter on Notion and is known for the Chain-of-Thought Hub benchmark repository, which helped track LLM reasoning progress across the field.

ZeroEntropy is the AI infrastructure company fixing the broken retrieval layer of modern AI applications. Founded in 2024 by Ghita Houir Alami (CEO) and Nicholas Pipitone (CTO), the San Francisco–based startup builds rerankers, embedding models, and end-to-end search infrastructure that outperforms Google, OpenAI, Cohere, and Voyage on public benchmarks. Backed by Y Combinator (W25) and a $4.2M seed round led by Initialized Capital, ZeroEntropy's products — zerank-2, zembed-1, zsearch, and ze-onprem — are used by enterprises including Assembled (serving Stripe, Canva, Robinhood, and Notion). The company's proprietary zELO training methodology, derived from chess Elo ratings and the Thurstone statistical model, produces models with calibrated relevance judgments that binary labels cannot replicate.
LlamaIndex is a San Francisco-based AI infrastructure company and open-source framework that enables enterprises to build intelligent document agents using large language models. Founded in 2022 by Jerry Liu and Simon Suo, it started as a side project called GPT Index and has grown into a full enterprise platform with products like LlamaParse (agentic OCR), LlamaCloud (enterprise SaaS), and a widely-used Python/TypeScript SDK. With 25M+ monthly downloads, 48K+ GitHub stars, and customers including Rakuten, Salesforce, and 90+ Fortune 500 companies, LlamaIndex is a leading player in the enterprise RAG and AI agent infrastructure space.

ElevenLabs is an AI research and product company that builds human-like voice technology. Founded in 2022 by two Polish engineers frustrated by badly dubbed movies, it grew from zero to $200M ARR in three years and reached an $11 billion valuation by February 2026. Its platform covers text-to-speech, voice cloning, AI dubbing, conversational agents, and speech-to-text across 70+ languages, used by everyone from independent creators to over 60% of Fortune 500 companies.

Glean is an AI-powered Work AI platform founded in 2019 by former Google and Rubrik engineers, headquartered in Palo Alto, CA. The company offers enterprise search, an AI assistant, and an agentic platform that connects to 100+ business applications, enabling employees to find information and automate work across their entire digital workspace. With a mission to expand human potential to do extraordinary work, Glean has grown to over $200M ARR, achieved a $7.2B valuation in June 2025, and counts 400+ enterprises including Booking.com, eBay, LinkedIn, and Samsung as customers.

COO of Stratyfy, adjunct faculty at Northeastern University, and one of the most credible voices in the fight against algorithmic discrimination in financial services. Born in Istanbul, educated in engineering and business, Deniz has spent decades bridging the gap between AI technology and human consequences — building interpretable, bias-mitigating AI systems used by 20+ lenders through the Underwriting for Racial Justice programme. She is a serial board member, community convener, and published author who believes fairness and performance are not opposites but the same equation solved correctly.

Perplexity AI is a San Francisco-based AI company that built the world's leading 'answer engine' ? replacing traditional link-based search with real-time, AI-generated responses that cite their sources. Founded in August 2022 by four AI researchers from OpenAI, Meta, Databricks, and Quora, Perplexity has grown from a scrappy post-ChatGPT prototype to a $20B+ company with 45 million monthly active users, over $1.7B in total funding, and a product suite spanning a conversational search engine, a developer API platform, and the Comet AI browser.