Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-73916

Improve ReshardingTest fixture error reporting when reshardCollection has already failed before any failpoints are waited on

    • Sharding NYC
    • Fully Compatible
    • v6.0, v5.0
    • Sharding NYC 2023-02-20
    • 49

      To avoid hanging or crashing the mongo shell process, the ReshardingTest fixture goes through some lengths to interrupt the reshardCollection command on mongos and join the background thread in the mongo shell which was running the reshardCollection command. However, after the changes from 0d5fd57 as part of SERVER-61985, the reshardCollection command may be blocked on the reshardingPauseCoordinatorBeforeCompletion failpoint without having waited on the interceding reshardingPauseCoordinatorBeforeBlockingWrites and reshardingPauseCoordinatorBeforeDecisionPersisted failpoints. This means the _commandDoneSignal won't be decremented to break the ReshardingTest fixture out of _waitForFailPoint(). Instead the mongo shell reports a scenario where the reshardCollection will have definitively failed as an assert.soon() failpoint of not reaching the expected failpoint within the allotted time.

            Assignee:
            max.hirschhorn@mongodb.com Max Hirschhorn
            Reporter:
            max.hirschhorn@mongodb.com Max Hirschhorn
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: