[SERVER-2646] shard hangs on migrate Created: 01/Mar/11  Updated: 30/Mar/12  Resolved: 02/Mar/11

Status: Closed
Project: Core Server
Component/s: Sharding
Affects Version/s: 1.6.5
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Erez Zarum Assignee: Unassigned
Resolution: Cannot Reproduce Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

CentOS 5.5 64bit
64GB RAM
Hardware RAID10 12x146GB 10k (Adaptec 52445 with BBU)


Attachments: Zip Archive 01032011_xx-xxx.zip     Zip Archive 02032011xx-xxx.zip    
Operating System: Linux
Participants:

 Description   

While a shard is doing a migrate, the whole cluster hangs.
The server and the cluster simply hangs, unable to query, cursors gets timeout, all the clients can't access the cluster at all.

I have attached logs.



 Comments   
Comment by Erez Zarum [ 02/Mar/11 ]

new logs along with one application server mongos.log (the one who CursorTimeout)

Comment by Eliot Horowitz (Inactive) [ 02/Mar/11 ]

If you can attach mongos logs can dig into more.

Comment by Eliot Horowitz (Inactive) [ 02/Mar/11 ]

As discussed in mailing list, these logs don't show an issue by themselves.
mongos logs would help

Comment by Eliot Horowitz (Inactive) [ 01/Mar/11 ]

Can you send the errors you got?
From the logs you sent there is no queuing or blocking in either mongod, so I don't see anything that would lead to slow client activity.

Generated at Thu Feb 08 03:00:46 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.