Race condition in the TimeoutAbortsAwaitReplication leading the test to hang until timing out

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Fixed
    • Priority: Major - P3
    • 6.0.0-rc0
    • Affects Version/s: None
    • Component/s: None
    • None
    • Fully Compatible
    • ALL
    • Hide

      run the shard_split_donor_service_test unit test and change the 
      shardSplitTimeoutMS in the `TimeoutAbortsAwaitReplication` to 100

      Show
      run the shard_split_donor_service_test unit test and change the  shardSplitTimeoutMS in the `TimeoutAbortsAwaitReplication` to 100
    • Server Serverless 2022-03-07
    • 51
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      It was found that there is a race condition on the windows environment that leads the test to hang forever waiting for the pauseShardSplitAfterBlocking fail point to be reached and instead the timeout happen and we go directly to the .onError step which hangs the test forever and the service is waiting for forget shard split command to be kicked in from the test which is currently blocking on the failpoint to be triggered. 

            Assignee:
            Mathis Bessa (Inactive)
            Reporter:
            Mathis Bessa (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: