Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-42454

Wait for node to find appropriate sync source before disabling failpoint in do_not_advance_commit_point_beyond_last_applied_term.js

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major - P3
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.0.13, 4.2.1, 4.3.1
    • Component/s: Replication
    • Labels:
      None
    • Backwards Compatibility:
      Fully Compatible
    • Operating System:
      ALL
    • Backport Requested:
      v4.2, v4.0
    • Sprint:
      Repl 2019-08-12
    • Linked BF Score:
      16

      Description

      Currently, the stopReplProducerOnDocument failpoint ignores and silently fails the batch of downloaded operations. In most cases, we expect the document that triggers the failpoint should never be applied. However, since we continue to receive responses from the fetcher even when the failpoint is enabled, it is possible that a fetcher response is already in the middle of being processed when we disable the failpoint. This can lead to a document to be applied when we don't expect it to.

      In do_not_advance_commit_point_beyond_last_applied_term.js, it is possible that we end up replicating the term 3 document onto node E which we don't expect, this causes node B to sync from node E which our test doesn't account for. We should instead wait for node E to sync from node B before disabling the stopReplProducerOnDocument failpoint.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              jason.chan Jason Chan
              Reporter:
              jason.chan Jason Chan
              Participants:
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: