[SERVER-22140] Shard node server crash Created: 12/Jan/16  Updated: 07/Apr/23  Resolved: 10/Feb/16

Status: Closed
Project: Core Server
Component/s: Replication, Sharding
Affects Version/s: 3.2.0
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: flyinflash Assignee: Eric Milkie
Resolution: Incomplete Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: File crash.log    
Operating System: ALL
Steps To Reproduce:
  • Setup 1 app, 3 configs and 5 shards cluster;
  • do `db.runCommand( { removeShard: "rs5" }

    )` on app server;

  • shard rs4 will crash in a couple of minutes
Participants:

 Description   

It was crashed while its state is `draning ongoing` after ran command `db.removeShard`.



 Comments   
Comment by Ramon Fernandez Marina [ 10/Feb/16 ]

flyinflash@gmail.com, since you were not able to gather the old logs we're going to close this ticket for now. If you experience this problem again please collect the logs requested by Eric above and attach them to this ticket so we can investigate further.

Thanks,
Ramón.

Comment by Eric Milkie [ 14/Jan/16 ]

Hi Shuge,
The logs may be lost, but if you can, please attach the replica set config for this shard, and the command line you're using to start the mongod's for this replica set. I may still be able to glean some information from those.
Thanks!
-Eric

Comment by flyinflash [ 14/Jan/16 ]

Sorry, I'm could not supply more info, I have restarted it after it crash, and all old logs were override and lost.

Comment by Eric Milkie [ 12/Jan/16 ]

Hi Shuge,
I need more information to determine the cause. Can you please supply the full system logs from each of the nodes in the rs4 replica set? In addition to the log entries near the time of the crash of one of the nodes, I also need the beginning of each of the logs when the mongod processes were first started. Finally, please attach the replica set config for this shard. You can obtain that by running "rs.conf()" in the shell when connecting directly to any node in the replica set rs4.

Comment by flyinflash [ 12/Jan/16 ]

BTW

OS Info: CentOS release 6.4 (Final)

Generated at Thu Feb 08 03:59:30 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.