-
Type:
Task
-
Resolution: Unresolved
-
Priority:
Unknown
-
None
-
Component/s: Retryability
-
Needed
Summary
Unified tests for Retryable Reads & Writes of the form "[command] succeeds after retryable handshake [error type]" intermittently fail the assertion on `ConnectionCheckoutStarted` events (expected 4x, got 3x) in multiple drivers (Node - NODE-6832, Python - PYTHON-5323, and Rust confirmed). We should investigate how to make these tests reliable.
Motivation
Who is the affected end user?
Driver devs
How does this affect the end user?
Flaky tests are distracting and can mask true failures.
How likely is it that this problem or use case will occur?
Very; e.g., weekly in Node
If the problem does occur, what are the consequences and how severe are they?
Time is wasted
Is this issue urgent?
No
Is this ticket required by a downstream team?
No
Is this ticket only for tests?
Yes
Acceptance Criteria
- Update the retryable reads and writes tests for retryable handshake errors to eliminate flakiness (as tested by the affected drivers: no occurrences in 1000x iterations after the fix)
- related to
-
NODE-6832 Flaky: <command> succeeds after retryable handshake network error [5]
-
- Backlog
-
-
PYTHON-5323 Test Failure (PyPy) - test.test_retryable_writes_unified.TestUnifiedHandshakeError.test_collection_findOneAndReplace_succeeds_after_retryable_handshake_server_error__ShutdownInProgress_
-
- Backlog
-