Presented by:

Nikolay Sivko

Coroot

Nikolay Sivko, co-founder and CEO of Coroot, aims to simplify troubleshooting in production for developers. He is passionate about Site Reliability Engineering (SRE) practices, observability, and open source. Previously, he was the head of the Engineering group at a large technology company and founded an observability tool development company in Russia, which he successfully acquired. Currently, he resides in Turkey, focusing on developing a startup with an international market orientation.

No video of the event yet, sorry!

In this presentation, we will explore the process of making distributed applications, such as database clusters in Kubernetes, observable. To illustrate this, we will intentionally introduce real failures into a Postgres High Availability (HA) cluster managed by the Postgres Operator for Kubernetes. We aim to understand how to detect different types of failures, evaluate whether the cluster components can handle each failure automatically, and determine the recovery time for each scenario.

Date:
Duration:
20 min
Room:
Conference:
PostgresWorld Webinars: 2026
Language:
Track:
Ops
Difficulty:
Medium