[SERVER-4201] CLONE - Unable to shut down or kill -9 monogd Created: 03/Nov/11 Updated: 30/Mar/12 Resolved: 11/Nov/11 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | None |
| Affects Version/s: | 1.8.3, 2.0.0, 2.0.1 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Joachim Kainz | Assignee: | Unassigned |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Environment: |
Linux mbl-mdb01 2.6.32.10-90.fc12.x86_64 #1 SMP Tue Mar 23 09:47:08 UTC 2010 x86_64 x86_64 x86_64 GNU/Linux |
||
| Operating System: | Linux |
| Participants: |
| Description |
|
I created a replication set by adding two servers to an existing server with about 250 GB and having it replicate the data. After being in recovery state for a while we see the new server go into a state were CPU usage becomes very low, but the load-average goes to about 200 or more. At this point is it impossible to shut down or kill mongod. kill -9 has no effect. I also noticed that that I cannot cat any file in /proc/<pid> belonging to the mongod process. |
| Comments |
| Comment by Eliot Horowitz (Inactive) [ 11/Nov/11 ] |
|
Let us know if it comes back after upgrading or if you think its a mongo issue. |
| Comment by Joachim Kainz [ 03/Nov/11 ] |
|
Yes, I do. Just found out that the kernel on the machines where we are running mongo has not be patched in about 4 years. I am trying to get my datacenter guys to bring the kernel up-to-date. I let you know if it reoccurs after patching. I personally believe it will not reoccur. |
| Comment by Eliot Horowitz (Inactive) [ 03/Nov/11 ] |
|
Are you doing the kill as the mongod user or root? |
| Comment by Joachim Kainz [ 03/Nov/11 ] |
|
top - 06:24:19 up 14:23, 1 user, load average: 11.05, 8.97, 6.00 PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND |
| Comment by Joachim Kainz [ 03/Nov/11 ] |
|
$ iostat avg-cpu: %user %nice %system %iowait %steal %idle Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn |
| Comment by Joachim Kainz [ 03/Nov/11 ] |
|
$ iostat avg-cpu: %user %nice %system %iowait %steal %idle Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn |