SESSION + Live Q&A

Cultivating Production Excellence - Taming Complex Distributed Systems

Taming the complex distributed systems we're responsible for requires changing not just the tools and technical approaches we use; it also requires changing who is involved in production, how they collaborate, and how we measure success. 

 

In this talk, you'll learn about several practices core to production excellence: giving everyone a stake in production, collaborating to ensure observability, measuring with Service Level Objectives, and prioritizing improvements using risk analysis.


Speaker

Liz Fong-Jones

Site Reliability Engineer

Liz is a developer advocate, labor and ethics organizer, and Site Reliability Engineer (SRE) with 15+ years of experience. She is an advocate at Honeycomb.io for the SRE and Observability communities, and previously was an SRE working on products ranging from the Google Cloud Load Balancer to...

Read more
Find Liz Fong-Jones at:

Location

Fleming, 3rd flr.

Track

Operationalizing Microservices: Design, Deliver, Operate

Topics

MicroservicesSilicon Valley

Share

From the same track

SESSION + Live Q&A Distributed Systems

Complex Event Flows in Distributed Systems

Event-driven architectures enable nicely decoupled microservices and are fundamental for decentralized data management. However, using peer-to-peer event chains to implement complex end-to-end logic crossing service boundaries can accidentally increase coupling. Extracting such business logic...

Bernd Ruecker

Co-founder and chief technologist @Camunda

SESSION + Live Q&A Reactive Programming

Reactive Systems Architecture

Reactive systems architecture promises resilience and scalability, but building and maintaining a globally distributed system introduces considerable challenges. Jan and Matt will share the most important building aspects of systems that spread over multiple data centres as well as multiple AWS...

Jan Machacek

Senior Principal Engineer @waltdisneyco & Founder @muvrhq

Matthew Squire

Technical Team Leader @BamtechMedia

SESSION + Live Q&A Infrastructure

Lessons From 300k+ Lines of Infrastructure Code

This talk is a concise masterclass on how to write infrastructure code. I’ll share key lessons from the “Infrastructure Cookbook” we developed at Gruntwork while creating and maintaining a library of over 300,000 lines of infrastructure code that’s used in production by...

Yevgeniy Brikman

Co-founder @gruntwork_io

SESSION + Live Q&A Microservices

What Lies Between: The Challenge of Operationalising Microservices

The biggest challenge in operationalising microservices is managing the space between them. This is the land of distributed systems: uncertainty and non-determinism. I will present practical approaches that you can use to take microservices into production or increase the value provided by...

Colin Breck

Sr. Staff Software Engineer @Tesla

UNCONFERENCE + Live Q&A Microservices

Microservices Open Space

Ian Robins

View full Schedule