[server-admins] MonIt

Troy A. Griffitts scribe at crosswire.org
Wed Aug 6 11:09:26 EDT 2025


In an effort to keep things running if I get hit by a bus, I've 
installed MonIt on the server.  It is an ancient service which simply 
allows simple regular checks and restart command if something fails.  
I've added one for swordweb under:

/etc/monit.d/

Feel free to comment and/or add your own.  Please consider that we might 
take a service offline for 5-10 minutes while developing or doing 
maintenance and the monitor SHOULD NOT trigger on those occasions, so 
write your rules accordingly.  As a template just use the swordweb 
monitor configuration which checks each minute and requires 20 failures 
over 30 checks to trigger a restart, which gives us ~20 minutes to do 
maintenance.

Hope this helps with some of the site-down issues we've seen recently.

Hope everyone is well.  I am sad to not have more time to join together 
in work with you more frequently, these days.

May the Lord bless our service for Him,

Troy




More information about the server-admins mailing list