Tagged Content
Everything on the platform tagged with sre.

Adam Jacob is the co-founder of Chef (the infrastructure automation company acquired for $220M) and current CEO of System Initiative, a next-generation DevOps platform using digital twins to visually model and simulate infrastructure. A self-taught systems engineer who ran ISPs as a teenager and built automated infrastructure for 15 startups before co-founding Opscode/Chef in 2008, Jacob created the Chef, InSpec, and Habitat open-source tools that helped define the DevOps movement. Today he advocates for rebuilding DevOps from the ground up, arguing that tooling and culture must evolve together - and that Infrastructure as Code, as practiced, is still broken.

Justin Garrison is a platform engineering veteran who helped launch Disney+ from zero to 50 million subscribers, spent 3.5+ years as a Senior Developer Advocate at AWS working on EKS, and now serves as Head of Product at Sidero Labs. He co-authored 'Cloud Native Infrastructure' with O'Reilly, hosts the Ship It! and Fork Around and Find Out podcasts, and is one of the original chairs of the Kubernetes SIG on-prem. He's known for critical, no-hype takes on cloud native trends and a deep commitment to open source community.

Laura Nolan is a Principal Engineer at Stanza Systems, a veteran Site Reliability Engineer, and one of tech's most credible voices on autonomous weapons ethics. After five years at Google - where she contributed to the seminal O'Reilly SRE book and resigned over Project Maven - and seven years at Slack as Senior Staff Engineer, she brings deep technical authority to both the reliability engineering world and the global debate over killer robots. Based in rural Ireland, she speaks at SREcon, QCon, and TED stages alike, writes the Responsible Computing newsletter on Substack, and holds seats on the USENIX Board and the SREcon Steering Committee.

Liz Fong-Jones is a Technical Fellow at Honeycomb.io, renowned SRE practitioner, co-author of 'Observability Engineering' (O'Reilly), and one of the most influential voices in the observability and platform engineering space. With 18+ years in software engineering spanning Google (11 years) and Honeycomb, she bridges deep technical expertise with fierce advocacy for labor rights, trans inclusion, and workplace equity. She led the Google Walkout Strike Fund in 2018, founded the Solidarity Fund by Coworker, and sits on the OpenTelemetry governance committee - all while speaking at every major SRE and DevOps conference on Earth.

Lorin Hochstein is a Staff Software Engineer specializing in reliability at Airbnb, and one of the most respected voices in incident analysis and resilience engineering. Known for rewriting Chaos Monkey at Netflix, co-authoring the O'Reilly book 'Ansible: Up and Running', and contributing to the 'Learning from Incidents' community, he bridges the gap between academic complexity theory and real-world software operations. His blog 'Surfing Complexity' and conference talks challenge engineers to think more deeply about why systems fail and how humans make sense of them.

Roy Rapoport is a veteran engineering leader and writer whose work has quietly reshaped how tech companies think about people, reliability, and operational culture. Best known for his two stints at Netflix (where he built Insight Engineering and its operational platform) and a stint at Slack, Roy popularized the Manager README format and authored influential frameworks on feedback, trust, and performance improvement. He writes on Medium about the subtler mechanics of leadership, raises goats in California, and insists he will never retire.

Theo Schlossnagle is a serial entrepreneur, software engineer, and investor who has spent three decades pushing the edges of distributed systems and scalable infrastructure. Founder of OmniTI (1997), Circonus (2010), and General Partner at L42 Ventures, he is a Distinguished Member of the ACM, an IEEE member, co-chair of ACM's Queue Magazine, and the author of 'Scalable Internet Architectures'. Beyond software, he runs a butcher shop in Maryland, maintains a farm retreat in West Virginia, and has left Twitter for greener (and more federated) social pastures.

Yevgeniy 'Jim' Brikman is the co-founder of Gruntwork and one of the most influential voices in infrastructure-as-code. Author of three O'Reilly books — 'Terraform: Up & Running' (now in its 3rd edition), 'Hello, Startup', and 'Fundamentals of DevOps and Software Delivery' (2025) — he has spent 15+ years building infrastructure that served hundreds of millions of users at LinkedIn, TripAdvisor, Cisco Systems, and Thomson Financial. At Gruntwork, he helped pioneer the idea of treating infrastructure like software: testable, reusable, and versioned.