[SERVER-6029] Replication failing - replSet syncTail: 0 assertion db/pdfile.cpp:398 Created: 07/Jun/12  Updated: 15/Aug/12  Resolved: 11/Jun/12

Status: Closed
Project: Core Server
Component/s: Replication
Affects Version/s: 2.0.5
Fix Version/s: None

Type: Bug Priority: Critical - P2
Reporter: Colin Howe Assignee: Kristina Chodorow (Inactive)
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Operating System: ALL
Participants:

 Description   

We've just restored a member from a backup and are trying to get it to catch up by playing the oplog. It is now spewing this into the logs:

Thu Jun  7 08:58:22 [conn37]   Assertion failure sz <= 0x7fffffff db/pdfile.cpp 398
0x57a8a6 0x5853eb 0x8a8ef6 0x8a0962 0x8a1d03 0x8a1fda 0x8a2835 0x898619 0x898ea3 0x942dcd 0x889107 0x88dcd9 0xaa0bc8 0x6389f7 0x7fa77063ed8c 0x7fa76fbe904d 
 /usr/bin/mongod(_ZN5mongo12sayDbContextEPKc+0x96) [0x57a8a6]
 /usr/bin/mongod(_ZN5mongo8assertedEPKcS1_j+0xfb) [0x5853eb]
 /usr/bin/mongod(_ZN5mongo13MongoDataFile4openEPKcib+0x406) [0x8a8ef6]
 /usr/bin/mongod(_ZN5mongo8Database7getFileEiib+0x102) [0x8a0962]
 /usr/bin/mongod(_ZN5mongo8Database12openAllFilesEv+0x33) [0x8a1d03]
 /usr/bin/mongod(_ZN5mongo8DatabaseC1EPKcRbRKSs+0x1ca) [0x8a1fda]
 /usr/bin/mongod(_ZN5mongo14DatabaseHolder11getOrCreateERKSsS2_Rb+0x605) [0x8a2835]
 /usr/bin/mongod(_ZN5mongo6Client7Context11_finishInitEb+0x2e9) [0x898619]
 /usr/bin/mongod(_ZN5mongo6Client7ContextC1ERKSsSsPNS_9mongolockEb+0x73) [0x898ea3]
 /usr/bin/mongod(_ZN5mongo8runQueryERNS_7MessageERNS_12QueryMessageERNS_5CurOpES1_+0xadd) [0x942dcd]
 /usr/bin/mongod() [0x889107]
 /usr/bin/mongod(_ZN5mongo16assembleResponseERNS_7MessageERNS_10DbResponseERKNS_11HostAndPortE+0x559) [0x88dcd9]
 /usr/bin/mongod(_ZN5mongo16MyMessageHandler7processERNS_7MessageEPNS_21AbstractMessagingPortEPNS_9LastErrorE+0x78) [0xaa0bc8]
 /usr/bin/mongod(_ZN5mongo3pms9threadRunEPNS_13MessagingPortE+0x287) [0x6389f7]
 /lib/x86_64-linux-gnu/libpthread.so.0(+0x6d8c) [0x7fa77063ed8c]
 /lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7fa76fbe904d]
Thu Jun  7 08:58:22 [conn37] assertion 0 assertion db/pdfile.cpp:398 ns:conversocial.content query:{ $query: { by_source: false, status: "R", source: 33046, created_date: { $lte: new Date(1339019999517), $gt: new Date(1336255200517) } }, $orderby: { created_date: -1, _id: -1 }, $hint: { source: 1, status: 1, created_date: -1, _id: -1 } }

Please advise.



 Comments   
Comment by Kristina Chodorow (Inactive) [ 11/Jun/12 ]

Unfortunately, at the moment there is no way for MongoDB to check this other than going through every document in every data file. You might want to watch these issues: SERVER-2039, SERVER-2350.

Comment by Colin Howe [ 09/Jun/12 ]

It's possible the backup didn't copy across properly - we copied the entire backup again and it worked fine.

However, it would be useful if the logs said this instead of just spewing assertions.

Comment by Kristina Chodorow (Inactive) [ 08/Jun/12 ]

That means that you didn't get a clean backup, how are you creating it?

Generated at Thu Feb 08 03:10:34 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.