Outage

In case of an outage, if you cannot ssh to the server, you need to contact someone from redpencil (Aad, Niels, Felix,...) so they can check what is going on.

If you can ssh to the server, check the CPU and ram usage (using htop) if it's at 100% or near, check which container consumes the most resources (docker stats --no-stream).

Get the maximum of data that you can (htop, docker stats, latest logs of the container that consumes most of the resources,...), put them to a file and then restart the docker compose stack.

Once restarted, check if everything is back to normal (cpu, ram usage,...).

Don't forget to also check if Kalliope sync doesn't have failed job (https://app.gitbook.com/o/-MP9Yduzf5xu7wIebqPG/s/-M_5cpNQ4inMjiC5hhyJ/code/deployment-guide#ensure-that-jobs-are-not-blocked)

Notify someone from redpencil of the outage with the information you extracted

Last updated