Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-50067

Initial sync attempt failed multiple times and throw MaxTimeMSExpired error

    • Type: Icon: Question Question
    • Resolution: Community Answered
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • Labels:
      None

      From the log below, the initial sync attempt failed multiple times by the same MaxTimeMSExpired error and shut down finally. Could you help me to find the reason for this?

      2020-07-27T21:47:45.305+0000 E - [replication-11] Assertion: BSONObjectTooLarge: BSONObj size: 36469414 (0x22C7AA6) is invalid. Size must be between 0 and 16793600(16MB) First element: failedInitialSyncAttempts: 9 src/mongo/bson/bsonobj.cpp 98
      2020-07-27T21:47:45.305+0000 I INITSYNC [replication-11] Error creating initial sync progress object: BSONObjectTooLarge: BSONObj size: 36469414 (0x22C7AA6) is invalid. Size must be between 0 and 16793600(16MB) First element: failedInitialSyncAttempts: 9
      2020-07-27T21:47:45.305+0000 I INITSYNC [replication-11] Initial Sync Attempt Statistics: { failedInitialSyncAttempts: 9, maxFailedInitialSyncAttempts: 10, initialSyncStart: new Date(1595879408073), initialSyncAttempts: [

      Unknown macro: { durationMillis}

      , { durationMillis: 1449358, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 82011, status: "NetworkInterfaceExceededTimeLimit: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: Request 48...", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 1268589, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 100113, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 109455, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 132139, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 120688, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" }, { durationMillis: 81710, status: "MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit", syncSource: "infra1.sd.tusimple.ai:27017" } ], fetchedMissingDocs: 0, appliedOps: 0, initialSyncOplogStart: Timestamp(1595884194, 9), initialSyncOplogFetchingStart: Timestamp(1595884060, 1) }
      2020-07-27T21:47:45.305+0000 E INITSYNC [replication-11] Initial sync attempt failed – attempts left: 0 cause: MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit
      2020-07-27T21:47:45.305+0000 F INITSYNC [replication-11] The maximum number of retries have been exhausted for initial sync.
      2020-07-27T21:47:45.305+0000 I STORAGE [replication-11] Finishing collection drop for local.temp_oplog_buffer (11ccdcf7-894f-4b8e-9323-2f354c26cbc4).
      2020-07-27T21:47:45.307+0000 E REPL [replication-11] Initial sync failed, shutting down now. Restart the server to attempt a new initial sync.
      2020-07-27T21:47:45.307+0000 F - [replication-11] Fatal assertion 40088 MaxTimeMSExpired: error fetching oplog during initial sync :: caused by :: error in fetcher batch callback :: caused by :: operation exceeded time limit at src/mongo/db/repl/replication_coordinator_impl.cpp 745
      2020-07-27T21:47:45.307+0000 F - [replication-11]

      ***aborting after fassert() failure

            Assignee:
            dmitry.agranat@mongodb.com Dmitry Agranat
            Reporter:
            zhuoran.dong@tusimple.ai Zhuoran Dong
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: