Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 2.5.0
Component/s: Sharding
Labels:
None

Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In particular failure modes, retries to a failed config server can take several seconds and block queries to secondary and tertiary config servers. When possible, we should be smarter about reading from other config servers when a server is unavailable. This especially impacts authenticated clusters, since authentication data is not cached in mongos, so new authenticated connections are initially slow to respond.

Example:
1. First config server goes down and is unresponsive to the network, but does not reject packets.
2. A new authenticated connection is created to mongos.
3. Mongos tries to read from the first config server, and before the read tries to reconnect. This eventually fails, but not until the several second timeout.
4. Mongos successfully reads from the second config server, but the response time is bad.
5. This continues to happen for future new connections, each new connection waits for the full timeout, despite the fact that the server is still unavailable.

duplicates

SERVER-11332 Authentication requests delayed if first config server is unresponsive

Closed

Assignee:: Unassigned
Reporter:: Greg Studer (Inactive)
Participants:: Greg Studer, Justin Patrin
Votes:: 3 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Jun 12 2013 07:25:33 PM UTC
Updated:: Dec 10 2014 11:04:48 PM UTC
Resolved:: Mar 07 2014 07:03:45 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates