[server-admins] MonIt
DM Smith
dmsmith at crosswire.org
Wed Sep 10 20:01:09 EDT 2025
This time I didn’t fall asleep. The culprit is the swordorbserver processes. There were 8-10 created every 10 seconds. I created a loop to spit out the count every 10 seconds. The last count before jira was killed was 1040.
top - 19:56:20 up 4 days, 4:46, 2 users, load average: 1.07, 1.06, 1.41
Tasks: 1504 total, 2 running, 1502 sleeping, 0 stopped, 0 zombie
%Cpu(s): 3.8 us, 3.1 sy, 0.0 ni, 92.5 id, 0.4 wa, 0.0 hi, 0.1 si, 0.0 st
MiB Mem : 78860.2 total, 505.7 free, 78156.8 used, 197.8 buff/cache
MiB Swap: 4883.0 total, 0.0 free, 4883.0 used. 56.6 avail Mem
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
2939079 jira 20 0 13.1g 1.3g 5804 S 0.0 1.7 3:42.35 java
2941639 swordweb 20 0 22.0g 578728 5796 S 18.5 0.7 1:35.40 java
2765973 tomcat 20 0 11.0g 229580 3744 S 0.3 0.3 1:07.09 java
1886 mysql 20 0 8814624 204424 1972 S 6.6 0.3 255:40.49 mariadbd
2930053 crosswi+ 20 0 25.5g 107332 6924 S 0.3 0.1 0:28.18 java
1530 vmrcre 20 0 18.9g 97908 0 S 0.3 0.1 70:37.28 java
2949492 swordweb 20 0 369652 92396 13660 S 0.0 0.1 0:00.43 swordorbserver
2949512 swordweb 20 0 370200 91440 13508 S 0.0 0.1 0:00.45 swordorbserver
2949528 swordweb 20 0 370200 91348 13420 S 0.0 0.1 0:00.44 swordorbserver
2949409 swordweb 20 0 370200 91336 13628 S 0.0 0.1 0:00.46 swordorbserver
> On Sep 10, 2025, at 7:34 PM, DM Smith <dmsmith at crosswire.org> wrote:
>
> I watched “top” sorted by RSS and Jira was at the top. The RSS slowly went up to 1.5G and then 1.6 and finally to 1.7. But processes went from 500 or so to over 1200, when I fell asleep watching it die a third time. Lisa alerted me that I had fallen asleep!
>
> I noticed that 3 of the top processes, all java, were killed. 2 restarted (monit?). The number of processes dropped to around 500 and have been creeping upward and it’s nearly 900 now.
>
> I continued to watch and swordweb died and restarted again.
>
> Hope this helps.
>
> DM
>
>> On Sep 10, 2025, at 4:47 PM, DM Smith <dmsmith at crosswire.org> wrote:
>>
>> And it died again after a few minutes. I restarted it. Not hopeful.
>>
>> — DM
>>
>>> On Sep 10, 2025, at 4:17 PM, DM Smith <dmsmith at crosswire.org> wrote:
>>>
>>> I don’t know how to triage or fix the underlying problem. I’ve restarted it.
>>>
>>> Looking at /var/log/messages, it is a machine OOM. Same as what Troy saw before. I’m guessing that the “OOM Reaper” is picking the biggest memory hogs and killing them. Those are all java processes.
>>>
>>> There are many, many (~200) sword observer processes each taking a paltry 90,000 KB. Perhaps these are the culprits? Is there a bound on the pool of sword observers?
>>>
>>> I also noted that:
>>> When the large java process is killed (which belongs to Jira) that many mariadb connections by jira are terminated.
>>> Jira needs to be updated from its current outdated version. Perhaps the newer version has a better memory footprint?
>>>
>>> — DM
>>>
>>>> On Sep 10, 2025, at 8:11 AM, Karl Kleinpaste <karl at kleinpaste.org> wrote:
>>>>
>>>> On 9/9/25 3:57 PM, DM Smith wrote:
>>>>> I restarted it.
>>>>
>>>> And it's dead again.
>>>>
>>>> Something is evidently more seriously wrong than a mere need to restart.
>>>> _______________________________________________
>>>> server-admins mailing list
>>>> server-admins at crosswire.org
>>>> https://crosswire.org/mailman/listinfo/server-admins
>>>
>>> _______________________________________________
>>> server-admins mailing list
>>> server-admins at crosswire.org
>>> https://crosswire.org/mailman/listinfo/server-admins
>>
>> _______________________________________________
>> server-admins mailing list
>> server-admins at crosswire.org
>> https://crosswire.org/mailman/listinfo/server-admins
>
> _______________________________________________
> server-admins mailing list
> server-admins at crosswire.org
> https://crosswire.org/mailman/listinfo/server-admins
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://crosswire.org/pipermail/server-admins/attachments/20250910/926d6e42/attachment-0001.htm>
More information about the server-admins
mailing list