Tagged Content
Everything on the platform tagged with reliability.

Lorin Hochstein is a Staff Software Engineer specializing in reliability at Airbnb, and one of the most respected voices in incident analysis and resilience engineering. Known for rewriting Chaos Monkey at Netflix, co-authoring the O'Reilly book 'Ansible: Up and Running', and contributing to the 'Learning from Incidents' community, he bridges the gap between academic complexity theory and real-world software operations. His blog 'Surfing Complexity' and conference talks challenge engineers to think more deeply about why systems fail and how humans make sense of them.

Roy Rapoport is a veteran engineering leader and writer whose work has quietly reshaped how tech companies think about people, reliability, and operational culture. Best known for his two stints at Netflix (where he built Insight Engineering and its operational platform) and a stint at Slack, Roy popularized the Manager README format and authored influential frameworks on feedback, trust, and performance improvement. He writes on Medium about the subtler mechanics of leadership, raises goats in California, and insists he will never retire.