[server-admins] MonIt

DM Smith dmsmith at crosswire.org
Tue Aug 12 19:25:22 EDT 2025


Well it clearly isn’t Monit’s fault.

> On Aug 12, 2025, at 7:08 PM, DM Smith <dmsmith at crosswire.org> wrote:
> 
> Troy,
> I think this is killing Jira on a regular basis. It is down at this moment. I’ve restarted it many times in the last week or so.
> I haven’t looked in the Jira logs, but the service status message says it was killed with -9.
> DM
> 
>> On Aug 6, 2025, at 11:09 AM, Troy A. Griffitts <scribe at crosswire.org> wrote:
>> 
>> In an effort to keep things running if I get hit by a bus, I've installed MonIt on the server.  It is an ancient service which simply allows simple regular checks and restart command if something fails.  I've added one for swordweb under:
>> 
>> /etc/monit.d/
>> 
>> Feel free to comment and/or add your own.  Please consider that we might take a service offline for 5-10 minutes while developing or doing maintenance and the monitor SHOULD NOT trigger on those occasions, so write your rules accordingly.  As a template just use the swordweb monitor configuration which checks each minute and requires 20 failures over 30 checks to trigger a restart, which gives us ~20 minutes to do maintenance.
>> 
>> Hope this helps with some of the site-down issues we've seen recently.
>> 
>> Hope everyone is well.  I am sad to not have more time to join together in work with you more frequently, these days.
>> 
>> May the Lord bless our service for Him,
>> 
>> Troy
>> 
>> 
>> _______________________________________________
>> server-admins mailing list
>> server-admins at crosswire.org
>> http://www.crosswire.org/mailman/listinfo/server-admins
> 



More information about the server-admins mailing list