Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 7.1.0-rc0, 7.0.0-rc8, 6.0.9
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Service Arch
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v7.0, v6.0
Sprint:
Service Arch 2023-07-10
Linked BF Score:
34
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

$search commands are now retried once in mongod when they fail due to network errors. The infrastructure they run on is obligated to tell the ConnectionPool about the networking error, so that the ConnectionPool can react to information about the un-reachability of the host/issues with sockets to that host.

Since ~~SERVER-77195~~, the ConnectionPool will react to most such reported network errors by closing current-generation connections to the remote as the driver CMAP spec specifies, and open a new generation of connections for future requests.

When using NetworkInterfaceTL to run the RPC/$search, it is guaranteed that the ConnectionPool receives the notification of the NetworkError before the $search/RPC can be retried, which ensures that the retry uses a connection from the new generation. However, with the PinnedConnectionTaskExecutor, there is a race condition where there is the potential for the retry to happen before the ConnectionPool is notified of the error.

This isn't strictly a correctness issue, but our testing assumes that the retry will get a healthy connection, which might not be true if the request sneaks in before the pool closes current-gen connections, because it also fails pending requests as a part of that process. So we should fix either the behavior or the test.

Assignee:: George Wangensteen (Inactive)
Reporter:: George Wangensteen (Inactive)
Participants:: George Wangensteen, Githook User
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Jun 29 2023 08:45:02 PM UTC
Updated:: Oct 29 2023 09:19:23 PM UTC
Resolved:: Jul 07 2023 01:23:36 PM UTC
Confidence Status Last Update:: 06/Jul/23 1:52 PM

Details

Description

Attachments

Forms

Activity

People

Dates