[server-admins] MonIt

DM Smith dmsmith at crosswire.org
Tue Aug 12 19:08:49 EDT 2025


Troy,
I think this is killing Jira on a regular basis. It is down at this moment. I’ve restarted it many times in the last week or so.
I haven’t looked in the Jira logs, but the service status message says it was killed with -9.
DM

> On Aug 6, 2025, at 11:09 AM, Troy A. Griffitts <scribe at crosswire.org> wrote:
> 
> In an effort to keep things running if I get hit by a bus, I've installed MonIt on the server.  It is an ancient service which simply allows simple regular checks and restart command if something fails.  I've added one for swordweb under:
> 
> /etc/monit.d/
> 
> Feel free to comment and/or add your own.  Please consider that we might take a service offline for 5-10 minutes while developing or doing maintenance and the monitor SHOULD NOT trigger on those occasions, so write your rules accordingly.  As a template just use the swordweb monitor configuration which checks each minute and requires 20 failures over 30 checks to trigger a restart, which gives us ~20 minutes to do maintenance.
> 
> Hope this helps with some of the site-down issues we've seen recently.
> 
> Hope everyone is well.  I am sad to not have more time to join together in work with you more frequently, these days.
> 
> May the Lord bless our service for Him,
> 
> Troy
> 
> 
> _______________________________________________
> server-admins mailing list
> server-admins at crosswire.org
> http://www.crosswire.org/mailman/listinfo/server-admins



More information about the server-admins mailing list