Availability Rules
Avoid taking too many PostgreSQL pods offline at once. With the default three-instance PostgreSQL deployment, take at most one PostgreSQL pod or node offline at a time. CloudNativePG uses pod disruption budgets and switchover-aware behavior to protect the cluster during routine maintenance. Keep at least one PostgreSQL instance ready. Running with one ready instance removes standby failover capacity, but it keeps PostgreSQL available. Taking every PostgreSQL instance offline is a planned outage for Metoro UI, API, authentication, configuration, and Temporal workflow persistence paths. If the primary is affected, expect CloudNativePG to switchover where possible. Single-instance deployments cannot switchover, so taking the only PostgreSQL pod offline is a PostgreSQL outage. For maintenance, work one pod or node at a time. Wait for PostgreSQL readiness to recover before continuing to the next pod or node.Before Maintenance
Check the CloudNativePG cluster:During Maintenance
Take one pod or one node offline at a time. After each step, confirm PostgreSQL availability before moving on:- At least one PostgreSQL instance stays ready.
- In the default three-instance deployment, at least two PostgreSQL instances should normally remain ready while one is under maintenance.
- If the primary is disrupted, CloudNativePG may promote a standby and move the write endpoint.
- Metoro UI, API, authentication, configuration writes, and Temporal persistence may have reduced capacity or brief reconnects, but should not become fully unavailable unless all PostgreSQL instances are offline.
After Maintenance
Confirm all expected PostgreSQL pods are ready:- At least one PostgreSQL instance stayed ready throughout the maintenance window, unless the work was planned as a PostgreSQL outage.
- The default three-instance deployment returns to three ready PostgreSQL pods.
- CloudNativePG reports a healthy cluster.
- Apiserver and Temporal recover automatically if PostgreSQL readiness briefly changed.
