Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-57612

Investigate whether we can perform a fake disk write in a dry-run election

    XMLWordPrintable

    Details

    • Type: Task
    • Status: Open
    • Priority: Major - P3
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: Backlog
    • Component/s: None

      Description

      During an election, if disk operations are failing on a voting node, the real election can never succeed because we attempt to store the last vote document on disk. However, we don't perform a disk operation during dry-run elections, so they can still succeed. We can get into a state where a node is repeatedly voting 'yes' in the dry-run election, but timing out (due to the disk operation) in the real election. This can cause the term to escalate quickly to a very high number. 

       

      As suggested, we should try to investigate and see if we can perform a 'fake' disk write in the dry-run election, to ensure that if disk operations are failing, we fail the dry-run election.

        Attachments

          Activity

            People

            Assignee:
            backlog-server-repl Backlog - Replication Team
            Reporter:
            ali.mir Ali Mir
            Participants:
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Dates

              Created:
              Updated: