We need this machinery to assert the fixes that will be made in RSM to handle mongos->mongod outage in help ticket are resilient to delays.
However this test may inject quite a lot of flakiness, the mitigation plan should be developed on the go. Most likely, the delay cap should start low and be increased after the shard is up.