Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-60803

Primary node turns to ROLLBACK state permanently

    • Type: Icon: Bug Bug
    • Resolution: Duplicate
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: 4.2.15
    • Component/s: None
    • Labels:
      None
    • ALL
      1. The primary node is down with some data unable to sync to secondary nodes.
      2. Some new data writes to the new primary node and sync to the rest of the replica set.
      3. Restart the former primary node.

      We have a MongoDB cluster host on-premise on AWS, containing 1 primary node and 2 secondary nodes, on 3 r5 EC2 instances. Due to some heavy workloads, the primary node's memory utilization reached 100% and then the instance crashed.

      After rebooting the instance, we restart the MongoDB, one of the secondary nodes became the primary as expected. Then the former primary node turned into ROLLBACK state. We noticed the docs on https://docs.mongodb.com/manual/core/replica-set-rollbacks/ that this is because secondaries can not keep up with the throughput of operations on the former primary. However, it stuck at the state after several rollback files were created under the rollback folder, and after that, we did not notice any new rollback activities on the log.

      In the end, we stopped MongoDB, cleared all data on the node, and started again to sync data from the replica set.

            Assignee:
            Unassigned Unassigned
            Reporter:
            zijun.tian@tusimple.ai Zijun Tian
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Resolved: