[SERVER-15853] SIGSEGV in replFetcher code Created: 28/Oct/14  Updated: 11/Jul/16  Resolved: 29/Oct/14

Status: Closed
Project: Core Server
Component/s: Replication
Affects Version/s: None
Fix Version/s: 2.7.8

Type: Bug Priority: Major - P3
Reporter: Davide Italiano Assignee: Scott Hernandez (Inactive)
Resolution: Done Votes: 0
Labels: 28qa
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Related
Operating System: ALL
Participants:

 Description   

Doesn't happen in 2.6.

Upstream version:

commit 0e9cb3b20498b9f167afaff7a5c4a4d1da7e06a2
Author: Dan Pasette <dan@10gen.com>
Date:   Wed Oct 22 14:10:37 2014 -0400
 
    BUMP 2.7.8

Relevant bits in the log:

2014-10-28T23:19:30.836+0000 I INDEXING [rsSync] allocating new ns file /home/ec2-user//mongo/repl2/thisdb1.ns, filling with zeroes...
2014-10-28T23:19:30.951+0000 I STORAGE  [FileAllocator] allocating new datafile /home/ec2-user//mongo/repl2/thisdb1.0, filling with zeroes...
2014-10-28T23:19:30.953+0000 I STORAGE  [FileAllocator] done allocating datafile /home/ec2-user//mongo/repl2/thisdb1.0, size: 64MB,  took 0.002 secs
2014-10-28T23:19:30.955+0000 I INDEXING [rsSync] build index on: thisdb1.mycoll1 properties: { v: 1, key: { _id: 1 }, name: "_id_", ns: "thisdb1.mycoll1" }
2014-10-28T23:19:30.955+0000 I INDEXING [rsSync]         building index using bulk method
2014-10-28T23:19:30.955+0000 I INDEXING [rsSync] build index done.  scanned 19 total records. 0 secs
2014-10-28T23:19:30.955+0000 I REPLSETS [rsSync] initial sync cloning db: thisdb10
2014-10-28T23:19:30.959+0000 I INDEXING [rsSync] allocating new ns file /home/ec2-user//mongo/repl2/thisdb10.ns, filling with zeroes...
2014-10-28T23:19:31.081+0000 I STORAGE  [FileAllocator] allocating new datafile /home/ec2-user//mongo/repl2/thisdb10.0, filling with zeroes...
2014-10-28T23:19:31.083+0000 I STORAGE  [FileAllocator] done allocating datafile /home/ec2-user//mongo/repl2/thisdb10.0, size: 64MB,  took 0.001 secs
2014-10-28T23:19:31.085+0000 I INDEXING [rsSync] build index on: thisdb10.mycoll10 properties: { v: 1, key: { _id: 1 }, name: "_id_", ns: "thisdb10.mycoll10" }
2014-10-28T23:19:31.085+0000 I INDEXING [rsSync]         building index using bulk method
2014-10-28T23:19:31.086+0000 I INDEXING [rsSync] build index done.  scanned 87 total records. 0 secs
2014-10-28T23:19:31.086+0000 I REPLSETS [rsSync] initial sync cloning db: thisdb15
2014-10-28T23:19:31.089+0000 I INDEXING [rsSync] allocating new ns file /home/ec2-user//mongo/repl2/thisdb15.ns, filling with zeroes...
2014-10-28T23:19:31.224+0000 I STORAGE  [FileAllocator] allocating new datafile /home/ec2-user//mongo/repl2/thisdb15.0, filling with zeroes...
2014-10-28T23:19:31.226+0000 I STORAGE  [FileAllocator] done allocating datafile /home/ec2-user//mongo/repl2/thisdb15.0, size: 64MB,  took 0.001 secs
2014-10-28T23:19:31.229+0000 I INDEXING [rsSync] build index on: thisdb15.mycoll15 properties: { v: 1, key: { _id: 1 }, name: "_id_", ns: "thisdb15.mycoll15" }
2014-10-28T23:19:31.229+0000 I INDEXING [rsSync]         building index using bulk method
2014-10-28T23:19:31.229+0000 I INDEXING [rsSync] build index done.  scanned 70 total records. 0 secs
2014-10-28T23:19:31.229+0000 I REPLSETS [rsSync] initial sync data copy, starting syncup
2014-10-28T23:19:31.229+0000 I REPLSETS [rsSync] oplog sync 1 of 3
2014-10-28T23:19:44.087+0000 F -        [repl prefetch worker] Invalid access at address: 0
2014-10-28T23:19:44.091+0000 F -        [repl prefetch worker] Got signal: 11 (Segmentation fault).
 
 0xf23939 0xf234f2 0xf2381e 0x7fb51092f5b0 0x90a844 0xb71552 0xca8c86 0xec7b1b 0xf70464 0x7fb510927f18 0x7fb50fa39b9d
----- BEGIN BACKTRACE -----
{"backtrace":[{"b":"400000","o":"B23939"},{"b":"400000","o":"B234F2"},{"b":"400000","o":"B2381E"},{"b":"7FB510920000","o":"F5B0"},{"b":"400000","o":"50A844"},{"b":"400000","o":"771552"},{"b":"400000","o":"8A8C86"},{"b":"400000","o":"AC7B1B"},{"b":"400000","o":"B70464"},{"b":"7FB510920000","o":"7F18"},{"b":"7FB50F957000","o":"E2B9D"}],"processInfo":{ "mongodbVersion" : "2.7.8", "gitVersion" : "0e9cb3b20498b9f167afaff7a5c4a4d1da7e06a2", "uname" : { "sysname" : "Linux", "release" : "3.10.42-52.145.amzn1.x86_64", "version" : "#1 SMP Tue Jun 10 23:46:43 UTC 2014", "machine" : "x86_64" }, "somap" : [ { "elfType" : 2, "b" : "400000", "buildId" : "FE60E5821707057A6DA7704117413DA4FD0406B6" }, { "b" : "7FFFC8AFE000", "elfType" : 3, "buildId" : "422398C00613A4323A5E0C656ED6BE4BCFF5FD15" }, { "b" : "7FB510920000", "path" : "/lib64/libpthread.so.0", "elfType" : 3, "buildId" : "A8E8015F84374807F6920DF95D16BBAAAC02AB41" }, { "b" : "7FB510718000", "path" : "/lib64/librt.so.1", "elfType" : 3, "buildId" : "CC3700E5EACA3A520D1FD1BA3B469F2EFB1B8F11" }, { "b" : "7FB510514000", "path" : "/lib64/libdl.so.2", "elfType" : 3, "buildId" : "3D5EB706F159BCCB9A3B3E5F528B512E2C500F90" }, { "b" : "7FB510210000", "path" : "/usr/lib64/libstdc++.so.6", "elfType" : 3, "buildId" : "DD6383EEAC49E9BAA9E3D1080AE932F42CF8A385" }, { "b" : "7FB50FF12000", "path" : "/lib64/libm.so.6", "elfType" : 3, "buildId" : "226E54A756FDC288551092B4FDE818C8D6D6FC04" }, { "b" : "7FB50FCFC000", "path" : "/lib64/libgcc_s.so.1", "elfType" : 3, "buildId" : "C52958E393BDF8E8D090F36DE0F4E620D8736FBF" }, { "b" : "7FB50F957000", "path" : "/lib64/libc.so.6", "elfType" : 3, "buildId" : "AD1B9E9119DEDF5A95178C0EA240EEBDE71147B6" }, { "b" : "7FB510B3C000", "path" : "/lib64/ld-linux-x86-64.so.2", "elfType" : 3, "buildId" : "37F70DE2ABEA184F24518EB5CC538FF0F54E081F" } ] }}
 mongod(_ZN5mongo15printStackTraceERSo+0x29) [0xf23939]
 mongod(+0xB234F2) [0xf234f2]
 mongod(+0xB2381E) [0xf2381e]
 libpthread.so.0(+0xF5B0) [0x7fb51092f5b0]
 mongod(_ZN5mongo8Database13getCollectionEPNS_16OperationContextERKNS_10StringDataE+0x24) [0x90a844]
 mongod(_ZN5mongo4repl28prefetchPagesForReplicatedOpEPNS_16OperationContextEPNS_8DatabaseERKNS_7BSONObjE+0x152) [0xb71552]
 mongod(_ZN5mongo4repl8SyncTail10prefetchOpERKNS_7BSONObjE+0xB6) [0xca8c86]
 mongod(_ZN5mongo10threadpool6Worker4loopEv+0x2FB) [0xec7b1b]
 mongod(+0xB70464) [0xf70464]
 libpthread.so.0(+0x7F18) [0x7fb510927f18]
 libc.so.6(clone+0x6D) [0x7fb50fa39b9d]
-----  END BACKTRACE  -----



 Comments   
Comment by Davide Italiano [ 29/Oct/14 ]

Fixed in master.

Comment by Davide Italiano [ 29/Oct/14 ]

milkie working on a repro.

Comment by Eric Milkie [ 29/Oct/14 ]

There isn't enough context to diagnose; are there reproducer steps or can you explain further what was special about what you were doing when the crash occurred?

Generated at Thu Feb 08 03:39:11 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.