Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Gone away
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Replication
Labels:
None

Sprint:
Repl 2020-02-10
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When the server responds with a State Change Errors from the failCommand failpoint, it should also increase topologyVersion and respond to waiting isMasters. The Drivers team uses failCommand extensively in spec tests for retryable writes+reads. Without this change, it takes the client ~10 seconds (maxAwaitTimeMS) to rediscover the server's state.

For example:

client configures a failCommand with NotMaster
client runs a retryable write against Primary P
client observes a NotMaster error and sets P to Unknown
client runs the retry attempt which blocks until P is rediscovered
P's Monitor is blocked for 10 seconds waiting for an awaitable isMaster response

After this change to 10 seconds hang should be removed:

client configures a failCommand with NotMaster
client runs a retryable write against Primary P
client observes a NotMaster error and sets P to Unknown
client runs the retry attempt which blocks until P is rediscovered
P's Monitor immediately receives an awaitable isMaster response and set P to Primary
client retry attempt succeeds ASAP

Assignee:: Jason Chan
Reporter:: Shane Harvey
Participants:: Jason Chan, Shane Harvey, Tess Avitabile
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Jan 21 2020 10:27:04 PM UTC
Updated:: Oct 27 2023 08:42:25 PM UTC
Resolved:: Jan 31 2020 02:45:26 PM UTC
Confidence Status Last Update:: 28/Jan/20 7:23 PM

Details

Description

Attachments

Activity

People

Dates