[server-admins] MonIt

Troy A. Griffitts scribe at crosswire.org
Wed Aug 13 08:59:01 EDT 2025


Did you find the culprit? Have a look in /var/log/messages for oom and see if we're running out of RAM or something.

On August 13, 2025 1:25:22 AM GMT+02:00, DM Smith <dmsmith at crosswire.org> wrote:
>Well it clearly isn’t Monit’s fault.
>
>> On Aug 12, 2025, at 7:08 PM, DM Smith <dmsmith at crosswire.org> wrote:
>> 
>> Troy,
>> I think this is killing Jira on a regular basis. It is down at this moment. I’ve restarted it many times in the last week or so.
>> I haven’t looked in the Jira logs, but the service status message says it was killed with -9.
>> DM
>> 
>>> On Aug 6, 2025, at 11:09 AM, Troy A. Griffitts <scribe at crosswire.org> wrote:
>>> 
>>> In an effort to keep things running if I get hit by a bus, I've installed MonIt on the server.  It is an ancient service which simply allows simple regular checks and restart command if something fails.  I've added one for swordweb under:
>>> 
>>> /etc/monit.d/
>>> 
>>> Feel free to comment and/or add your own.  Please consider that we might take a service offline for 5-10 minutes while developing or doing maintenance and the monitor SHOULD NOT trigger on those occasions, so write your rules accordingly.  As a template just use the swordweb monitor configuration which checks each minute and requires 20 failures over 30 checks to trigger a restart, which gives us ~20 minutes to do maintenance.
>>> 
>>> Hope this helps with some of the site-down issues we've seen recently.
>>> 
>>> Hope everyone is well.  I am sad to not have more time to join together in work with you more frequently, these days.
>>> 
>>> May the Lord bless our service for Him,
>>> 
>>> Troy
>>> 
>>> 
>>> _______________________________________________
>>> server-admins mailing list
>>> server-admins at crosswire.org
>>> http://www.crosswire.org/mailman/listinfo/server-admins
>> 
>
>_______________________________________________
>server-admins mailing list
>server-admins at crosswire.org
>http://www.crosswire.org/mailman/listinfo/server-admins

-- 
Sent from my Android device with K-9 Mail. Please excuse my brevity.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/server-admins/attachments/20250813/c7d4fb53/attachment.htm>


More information about the server-admins mailing list