Site Reliability Engineering
Past Presentations
Amplifying Sources of Resilience: What Research Says
Building robust software systems means anticipating how failures may occur with components and subsystems and developing answers to the question: “What is needed for the design of systems that prevents or limits catastrophic failure?” Investing in, developing, and...
Tesla Virtual Power Plant
A Virtual Power Plant (VPP) is a network of distributed energy-resources (often solar, wind, and batteries) that are aggregated to provide smarter and more flexible power generation, distribution, and availability. A VPP leverages assets for more than one purpose, and, in doing so, decentralizes...
Managing Systems in an Age of Dynamic Complexity
Why is it that a single server can often have better uptime than a public cloud service?We used to manage systems. Instead, many of us now write and run dynamic control planes: the systems that run our user-facing systems. We find the dynamic control plane pattern in software-defined networking,...