Two primary nodes found in one replica set.

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Done
    • Priority: Major - P3
    • None
    • Affects Version/s: 4.2.17
    • Component/s: None
    • None
    • ALL
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      We have a replica set of 5 nodes.

      From the log, the primary node was handling some aggregation which caused the CPU utilization to 100% in a short time. And after 10 seconds of no-respond from primary, an election started and a new primary was elected. The result sent to the previous primary node, and the log showed the previous primary stepped down and changed the state to secondary. However, the state did not change due to an unknown reason, and when we use `rs.status()` command on any node in the cluster, we can find two primary nodes at the same time (although other 3 secondary nodes sync to the new primary)

      As a result, some users using PyMongo to connect to the cluster met with connection issues while some users did not. I guess it's because some users connected to the wrong primary node (the previous one)

      We tried to remove the previous primary and added it back, there would still be two primary nodes. We had to reboot the previous primary and added it back to the cluster, this time it turned to be rollback state, after several minutes, it became a secondary.

            Assignee:
            Edwin Zhou
            Reporter:
            Zijun Tian
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: