Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-50318

Only restart scheduled heartbeats

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major - P3
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.8.0, 4.9.0, 4.4.4
    • Component/s: Replication
    • Labels:
      None
    • Backwards Compatibility:
      Fully Compatible
    • Operating System:
      ALL
    • Backport Requested:
      v4.4
    • Sprint:
      Repl 2020-09-07, Repl 2020-09-21, Repl 2020-10-05, Repl 2020-10-19, Repl 2020-11-02, Repl 2020-11-16
    • Linked BF Score:
      19

      Description

      After SERVER-29030, we cancel our own heartbeat requests if we receive a heartbeat request that announces a new primary. Since we don't update our knowledge of the primary when we receive a heartbeat request, it seems possible to continuously schedule and cancel our heartbeat requests. As a result, a node in initial sync may not be able to find a sync source, because it has not successfully received 2N heartbeats from other nodes, and eventually the node will shut down.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              xuerui.fa Xuerui Fa
              Reporter:
              xuerui.fa Xuerui Fa
              Participants:
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: