Taming Distributed Stateful Pets With Kubernetes

So you've mastered Kubernetes for scheduling and scaling your stateless applications. Your pager has been quieter, life is good. But what about the carefully configured database clusters running on expensive dedicated infrastructure? (And the expensive sysadmin you're paying to maintain it!).

In Kubernetes, there are now many of the building blocks needed to help herd database ‘Pets’, alongside the stateless applications in your cluster. In this talk, we'll explain how we use these features, including StatefulSet and dynamic volume provisioning, to manage the lifecycle of distributed and secure Cassandra clusters for cloud and bare-metal environments with the open source project Navigator.

Building on the Kubernetes API machinery, with custom API server and controllers, a ‘Navigator’ and sidecar ‘Pilots’ codify and automate many of the processes that would previously have been performed by a database administrator. Bootstrap, scale-up and safe scale-down, health checks, load balancing and more. Learn how Navigator uses Kubernetes as its kernel to make it as easy to deploy and manage stateful systems as stateless applications.


Matthew Bates

Co-founder at UK Kubernetes Company Jetstack

Matt’s background is in solutions for the acquisition, management and exploitation of large-scale data. Since its launch, he has contributed widely to the Kubernetes project, both to the technology and to the ecosystem. He was an early employee at NoSQL startup MongoDB, and previously at...

Read more


James Munnelly

Solutions Engineer @Jetstack

I'm a Solutions Engineer at Jetstack, which involves helping customers bend and break Kubernetes to their will. I've created a number of extensions to Kubernetes core, including cert-manager (a kube-lego successor), Navigator (DBaaS for Kubernetes), and built my own simple cloud provider for bare...

Read more


Churchill, G flr.


Distributed Stateful Systems


KubernetesCassandraDistributed SystemsDatabase ManagementImmutable Infrastructure


Slides are not available


From the same track

SESSION + Live Q&A Cloud Computing

Cloud-Native and Scalable Kafka Architecture

Kafka as a distributed stateful service faces serious stability and scalability challenges in cloud environment which favors stateless services. As cluster size grows with traffic, it faces issues of data balancing, high consumer data fan out and time consuming process to scale up or update....

Allen Wang

Senior Software Engineer - Cloud Platform @Netflix

SESSION + Live Q&A Ingestion

Scaling Uber's Elasticsearch Clusters

Uber's Marketplace is the algorithmic brain behind Uber's ride-sharing services, and the brain needs immense amount of real-time data to make timely and sound decisions. Uber's Marketplace Intelligence team has been using Elasticsearch as a real-time OLAP database to serve thousands of internal...

Danny Yuan

Real-time Streaming Lead @Uber

SESSION + Live Q&A Distributed Systems

The Future of Distributed Databases Is Relational

Years ago when working at Amazon on shopping cart infrastructure and the precursor to DynamoDB, my co-founder and I realized that while distributed key value stores were useful for a few use-cases, we missed many of the benefits of relational databases: transactions, joins, and the power of the...

Sumedh Pathak

VP Engineering & Co-Founder @CitusData

SESSION + Live Q&A Machine Learning

Real-Time Decisions Using ML on the Google Cloud Platform

Ocado Technology is providing a full solution to put the world’s retailers online using the cloud, robotics, AI and IoT. Processing tens of thousands of orders every day, we generate millions of events every minute, leading to huge amount of data to be managed. We will present how this Big Data...

Carlos Garcia

Ocado Smart Platform Fraud Team Lead

Przemyslaw Pastuszka

ML Engineer @Ocado

View full Schedule