Resolved -
System has been migrated to the new infrastructure.
Apr 26, 08:05 UTC
Monitoring -
ok, we are stable again -- hopefully.
Sep 28, 01:37 UTC
Update -
The database croaked again! I'm restoring from a ~20 minute old backup.
Sep 27, 21:40 UTC
Identified -
With a bit of help the kubernetes operator got the cluster working again, at least for now.
Sep 26, 01:25 UTC
Investigating -
The newly reset database cluster (from last night) decided to blow itself up again. I think the extra query load and data from the new monitoring system is making the setup fragile.
Investigating.
Sep 26, 00:52 UTC