Track Overview
Debug, Analyze & Optimise... in Production!
Reaching production is only the beginning.
Even when we spend the most dedicated effort in design, architecture and testing, there is no perfect substitute to seeing applications run in production, exercised by real-users in the chaos of distributed systems.
In this track, we’ll learn how high-performing teams embrace this unpredictable side of production environments and how they build systems that can be observed and responded to in real-time.
Pierre Vincent
Head of SRE @weareglofoxFrom this track
An Observable Service with No Logs
Tuesday Apr 5 / 10:35AM BST
After working with Honeycomb for a little while and starting to instrument our existing code with events, I’d become enamoured with the level of observability possible with that sort of telemetry. In particular, how easy it became to interactively and visually explore how my systems were...
Glen Mailer
Senior Software Engineer @Geckoboard
Profiles, the Missing Pillar: Continuous Profiling in Practice
Tuesday Apr 5 / 11:50AM BST
With Continuous Profiling (CP) you capture resource usage (such as CPU, memory, I/O, etc.) over time, enabling you to pinpoint the (source) code that is slow or causes an issue. In recent times, CP has become mainstream and a number of open source projects such as Parca, Pyroscope, or CNCF...
Michael Hausenblas
Solution Engineering Lead @AWS
Chaos Engineering Observability with Visual Metaphors
Tuesday Apr 5 / 01:40PM BST
Observability is key in operating a system in production; it’s required during an incident, when an operator has to interrogate, inspect, and piece together what happened to avoid a similar event. In those scenarios, Chaos engineering and Observability are closely connected - providing...
Yury Niño Roa
Cloud Infrastructure Engineer @Google
Slack’s DNSSEC Rollout: Third Time’s the Outage
Tuesday Apr 5 / 02:55PM BST
We all have to manage DNS. DNS changes are inherently high-blast-radius and high-visibility. We present a case study of what happened when a large SaaS company enabled DNSSEC. We did significant planning and testing beforehand. The rollout went smoothly for most of our domains, but one...
Rafael de Elvira Tellez
Senior Software Engineer @Slack
Could Observability-Driven Development Be the Next Leap?
Tuesday Apr 5 / 04:10PM BST
Twenty years ago Kent Beck coined the term “test-driven development”: write tests first, develop the code later. Today, even if not practising true TDD, the idea of writing code without tests is an immediate warning sign to any developer. Yet, most teams still continue shipping code...
Yury Niño Roa
Cloud Infrastructure Engineer @Google
Michael Hausenblas
Solution Engineering Lead @AWS
Glen Mailer
Senior Software Engineer @Geckoboard
Jessica Kerr
Principal Developer Evangelist @honeycombio
Unconference: Observability
Tuesday Apr 5 / 05:25PM BST
Details coming soon.
Speakers from this track
Glen Mailer
Senior Software Engineer @Geckoboard
After spending a bunch of years as a contractor, Glen worked across a variety of roles at all levels of the stack: from infrastructure to frontend with a detour via databases - this has led to a very varied set of experiences to draw from. Most recently he’s worked on build infrastructure...
Read moreFind Glen Mailer at:
Michael Hausenblas
Solution Engineering Lead @AWS
Michael is a Solution Engineering Lead in the AWS open source observability service team. He covers Prometheus, Grafana, and OpenTelemetry upstream and in managed services. Before Amazon, Michael worked at Red Hat, Mesosphere (now D2iQ), MapR (now part of HPE), and prior to that ten years in...
Read moreYury Niño Roa
Cloud Infrastructure Engineer @Google
Software Engineer with 8+ years of experience designing, implementing and managing the development of software applications using agile methodologies such as scrum and kanban. 3+ years of DevOps and SRE experience supporting, automating and optimizing mission-critical deployments, leveraging...
Read moreFind Yury Niño Roa at:
Rafael de Elvira Tellez
Senior Software Engineer @Slack
Rafael is a Senior Software Engineer for the Demand Engineering team at Slack. Demand Engineering enables fast and reliable delivery.Outside work, Rafa enjoys spending time in the mountains climbing, hiking, mountain biking, etc with his friends but also spending time with his pets and...
Read moreFind Rafael de Elvira Tellez at:
Jessica Kerr
Principal Developer Evangelist @honeycombio
Jessica Kerr (@jessitron) is a Principal Developer Evangelist at Honeycomb.io. After twenty years as a developer, she sees software as a significant force in the world. As software engineers, we change reality--including our own, and that's developer experience! Jess lives in St. Louis,...
Read moreFind Jessica Kerr at:
Track Host
Pierre Vincent
Head of SRE @weareglofox
Track Host
Pierre Vincent
Head of SRE @weareglofox
Originally from a software development background, the rise of DevOps drove Pierre Vincent to become more involved in how systems actually run in the real world and how he could make a difference helping others care about the applications they release to production.Pierre is currently Head of SRE...
Read more