[server-admins] MonIt
Troy A. Griffitts
scribe at crosswire.org
Wed Aug 6 11:09:26 EDT 2025
In an effort to keep things running if I get hit by a bus, I've
installed MonIt on the server. It is an ancient service which simply
allows simple regular checks and restart command if something fails.
I've added one for swordweb under:
/etc/monit.d/
Feel free to comment and/or add your own. Please consider that we might
take a service offline for 5-10 minutes while developing or doing
maintenance and the monitor SHOULD NOT trigger on those occasions, so
write your rules accordingly. As a template just use the swordweb
monitor configuration which checks each minute and requires 20 failures
over 30 checks to trigger a restart, which gives us ~20 minutes to do
maintenance.
Hope this helps with some of the site-down issues we've seen recently.
Hope everyone is well. I am sad to not have more time to join together
in work with you more frequently, these days.
May the Lord bless our service for Him,
Troy
More information about the server-admins
mailing list