Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: 2.4.0-rc1
Affects Version/s: None
Component/s: Replication
Labels:
None

Operating System:
ALL
Steps To Reproduce:

Hide

1) Setup a replica set with 5 nodes A, B, C, D, E, primary is A
2) Artificially slaveDelay nodes B and C by 40s, and node D by 80s
3) Artificially introduce network latency such D's ping time to B and C is low but varies, and the latency from D to E is high (so E will never be chosen as a sync source)
4) Symmetrically blackhole connections from A to D, to make sure D doesn't choose the primary as a sync source initially (could be done differently, this was just easiest).
5) Start single-threaded javascript insert load on node A

The idea here is that there are three lagging nodes in the replica set, B, C, and D. Because E is 30s ahead of B and C, D will keep trying to change sync sources. However, since B and C are closer in ping time than E, E will never be chosen and the sync source changes between C and D depending on the stochastic ping delay.

This kind of problem might occur in the wild if three lagging nodes were in a separate data center, for example. Once a node falls too far behind the others, the sync swapping would push the node into permanent recovery mode (until one of the other nodes catches up).

Show
1) Setup a replica set with 5 nodes A, B, C, D, E, primary is A 2) Artificially slaveDelay nodes B and C by 40s, and node D by 80s 3) Artificially introduce network latency such D's ping time to B and C is low but varies, and the latency from D to E is high (so E will never be chosen as a sync source) 4) Symmetrically blackhole connections from A to D, to make sure D doesn't choose the primary as a sync source initially (could be done differently, this was just easiest). 5) Start single-threaded javascript insert load on node A The idea here is that there are three lagging nodes in the replica set, B, C, and D. Because E is 30s ahead of B and C, D will keep trying to change sync sources. However, since B and C are closer in ping time than E, E will never be chosen and the sync source changes between C and D depending on the stochastic ping delay. This kind of problem might occur in the wild if three lagging nodes were in a separate data center, for example. Once a node falls too far behind the others, the sync swapping would push the node into permanent recovery mode (until one of the other nodes catches up).
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Given particular replica set configurations and network delays, it's possible for a node to repeatedly change the host it syncs from after every replicated batch. This causes the node to fall behind, and eventually the node ends up in recovery mode.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

sync_fast_switch.js
3 kB
Jan 18 2013 06:31:32 PM UTC
sync_fast_switch.js
3 kB
Jan 18 2013 10:22:03 PM UTC
helpers.js
5 kB
Jan 18 2013 10:22:03 PM UTC
currentTest.txt
1.97 MB
Jan 18 2013 06:31:32 PM UTC
currentTest_fallbehind.txt
897 kB
Jan 18 2013 06:31:32 PM UTC

is related to

SERVER-5208 Replica set periodic reevaluation of sync targets

Closed

Assignee:: Kristina Chodorow (Inactive)
Reporter:: Greg Studer (Inactive)
Participants:: auto, Greg Studer, Kristina Chodorow
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Jan 18 2013 06:24:47 PM UTC
Updated:: Jul 11 2016 05:58:24 PM UTC
Resolved:: Feb 16 2013 12:47:08 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates