[SERVER-7465] Crash on the master while creating a replicaset Created: 25/Oct/12  Updated: 19/Mar/13  Resolved: 12/Mar/13

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: 2.2.0
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Ran Tavory Assignee: Michael Grundy
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

$ uname -a
Linux mongo02 2.6.38-8-virtual #42-Ubuntu SMP Mon Apr 11 04:06:34 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux


Operating System: ALL
Participants:

 Description   

I used a single mongodb server and then wanted to create a replicaset to improve durability.

Created a secondary and an arbiter.
then restarted the master in replSet mode.
> rs.initialize()
OK
> rs.reconfig(

{...}

)
OK

wait....
they start synching...

And then out of nowhere, the master node sees:

0xade6e1 0x5582d9 0x558862 0x7f3b3ce47c60 0x7f3b3c185bf6 0x580ee5 0x94e796 0x94227e 0x6b26b9 0xb5ba7d 0xb5d052 0x56fa52 0x5dbf11 0x7f3b3ce3ed8c 0x7f3b3c1e104d
/usr/bin/mongod(_ZN5mongo15printStackTraceERSo+0x21) [0xade6e1]
/usr/bin/mongod(_ZN5mongo10abruptQuitEi+0x399) [0x5582d9]
/usr/bin/mongod(_ZN5mongo24abruptQuitWithAddrSignalEiP7siginfoPv+0x262) [0x558862]
/lib/x86_64-linux-gnu/libpthread.so.0(+0xfc60) [0x7f3b3ce47c60]
/lib/x86_64-linux-gnu/libc.so.6(memcpy+0x296) [0x7f3b3c185bf6]
/usr/bin/mongod(_ZNK5mongo7BSONObj4copyEv+0x45) [0x580ee5]
/usr/bin/mongod(_ZN5mongo11ParsedQuery4initERKNS_7BSONObjE+0x516) [0x94e796]
/usr/bin/mongod(_ZN5mongo11ParsedQueryC1ERNS_12QueryMessageE+0x9e) [0x94227e]
/usr/bin/mongod(ZN5mongo8runQueryERNS_7MessageERNS_12QueryMessageERNS_5CurOpES1+0x39) [0x6b26b9]
/usr/bin/mongod() [0xb5ba7d]
/usr/bin/mongod(_ZN5mongo16assembleResponseERNS_7MessageERNS_10DbResponseERKNS_11HostAndPortE+0x3a2) [0xb5d052]
/usr/bin/mongod(_ZN5mongo16MyMessageHandler7processERNS_7MessageEPNS_21AbstractMessagingPortEPNS_9LastErrorE+0x82) [0x56fa52]
/usr/bin/mongod(_ZN5mongo3pms9threadRunEPNS_13MessagingPortE+0x411) [0x5dbf11]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x6d8c) [0x7f3b3ce3ed8c]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7f3b3c1e104d]

This happend a few times during the initial sync process. The sync isn't over, it's still running...

On another set of mongodb servers I did exactly the same, but there it went smoothly and the replicaset was created successfully.

I'm still running the faulting server in hope that it would finish its replication to the secondary (it'll take a few hours) but anyway, the crash itself is bad enough to report...



 Comments   
Comment by Ran Tavory [ 11/Mar/13 ]

Hi Mike, sorry I don't work there anymore...


/Ran
http://tavory.com

Comment by Michael Grundy [ 11/Mar/13 ]

Hi -

Have you seen any recurrences of this issue? I noticed you were running version 2.2.0 when you initially posted this. We've fixed some critical bugs in 2.2.0 since then and recommend you update to the latest stable release of 2.2, which is 2.2.3, as soon as possible.

If you are still seeing this issue with 2.2.3, I'd recommend posting on one of our free support channels. Please include larger chunks of the log before and after the stack trace. Filing tickets under the SERVER project is not the best way to get visibility for issues you encounter, as we use this project for tracking bugs and feature requests. If you have a support contract with 10gen, then filing commercial support tickets is your best option. If not, the 10gen development and support teams actively monitor posts to stackoverflow (http://stackoverflow.com/questions/tagged/mongodb) and to the mongo google group (https://groups.google.com/forum/?fromgroups#!forum/mongodb-user)

Mike

Generated at Thu Feb 08 03:14:37 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.