Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Critical - P2
Fix Version/s: 3.6.5, 3.7.7
Affects Version/s: 3.6.4
Component/s: Networking
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v3.6
Sprint:
Platforms 2018-04-23, Platforms 2018-05-07
Linked BF Score:
0
Confidence Status:
None
Work Order:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The AsyncRequestsSender holds a lock during construction and work scheduling.
This lock prevents callbacks from running if their response comes back during scheduling.
Scheduling can take a long time (up to 20 seconds per shard) if a read preference cannot be satisfied. This is done by a blocking call into the ReplicaSetMonitor

The bad sequence of events is:

Scatter gather request to two shards is dispatched
The first host suceeds in targetting and runs
The second host cannot satisfy it's read pref, blocking holding a lock
The first request suceeds, blocking in running _handleResponse

If you have enough of those, you can saturate all background networking workers, making your mongos completely unresponsive until targeting can succeed.

related to

SERVER-35167 AsyncResultsMerger can block networking threads in callbacks

Closed

Assignee:: Mira Carey
Reporter:: Mira Carey
Participants:: Githook User, Mira Carey
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Apr 19 2018 10:14:21 PM UTC
Updated:: Oct 29 2023 10:32:33 PM UTC
Resolved:: Apr 24 2018 04:12:51 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates