Chaos testing of a Postgres cluster on Kubernetes
Presented by:
Nikolay Sivko
Nikolay Sivko, co-founder and CEO of Coroot, aims to simplify troubleshooting in production for developers. He is passionate about Site Reliability Engineering (SRE) practices, observability, and open source. Previously, he was the head of the Engineering group at a large technology company and founded an observability tool development company in Russia, which he successfully acquired. Currently, he resides in Turkey, focusing on developing a startup with an international market orientation.
No video of the event yet, sorry!
In this presentation, we will explore the process of making distributed applications, such as database clusters in Kubernetes, observable. To illustrate this, we will intentionally introduce real failures into a Postgres High Availability (HA) cluster managed by the Postgres Operator for Kubernetes. We aim to understand how to detect different types of failures, evaluate whether the cluster components can handle each failure automatically, and determine the recovery time for each scenario.
- Date:
- Duration:
- 20 min
- Room:
- Conference:
- PostgresWorld Webinars: 2026
- Language:
- Track:
- Ops
- Difficulty:
- Medium