-
Type: Question
-
Resolution: Community Answered
-
Priority: Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
Labels:None
From the log below, the initial sync attempt failed multiple times by the same MaxTimeMSExpired error and shut down finally. Could you help me to find the reason for this?
2020-07-27T21:47:45.305+0000 E - [replication-11] Assertion: BSONObjectTooLarge: BSONObj size: 36469414 (0x22C7AA6) is invalid. Size must be between 0 and 16793600(16MB) First element: failedInitialSyncAttempts: 9 src/mongo/bson/bsonobj.cpp 98
2020-07-27T21:47:45.305+0000 I INITSYNC [replication-11] Error creating initial sync progress object: BSONObjectTooLarge: BSONObj size: 36469414 (0x22C7AA6) is invalid. Size must be between 0 and 16793600(16MB) First element: failedInitialSyncAttempts: 9
2020-07-27T21:47:45.305+0000 I INITSYNC [replication-11] Initial Sync Attempt Statistics: { failedInitialSyncAttempts: 9, maxFailedInitialSyncAttempts: 10, initialSyncStart: new Date(1595879408073), initialSyncAttempts: [Unknown macro: { durationMillis}, { durationMillis: 1449358, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 82011, status: "NetworkInterfaceExceededTimeLimit: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: Request 48...", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 1268589, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 100113, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 109455, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 132139, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 120688, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 81710, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" } ], fetchedMissingDocs: 0, appliedOps: 0, initialSyncOplogStart: Timestamp(1595884194, 9), initialSyncOplogFetchingStart: Timestamp(1595884060, 1) }
2020-07-27T21:47:45.305+0000 E INITSYNC [replication-11] Initial sync attempt failed – attempts left: 0 cause: MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit
2020-07-27T21:47:45.305+0000 F INITSYNC [replication-11] The maximum number of retries have been exhausted for initial sync.
2020-07-27T21:47:45.305+0000 I STORAGE [replication-11] Finishing collection drop for local.temp_oplog_buffer (11ccdcf7-894f-4b8e-9323-2f354c26cbc4).
2020-07-27T21:47:45.307+0000 E REPL [replication-11] Initial sync failed, shutting down now. Restart the server to attempt a new initial sync.
2020-07-27T21:47:45.307+0000 F - [replication-11] Fatal assertion 40088 MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit at src/mongo/db/repl/replication_coordinator_impl.cpp 745
2020-07-27T21:47:45.307+0000 F - [replication-11]***aborting after fassert() failure