Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.6.3, 3.7.2
Affects Version/s: None
Component/s: Sharding
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v3.6
Sprint:
Sharding 2018-01-15, Sharding 2018-01-29, Sharding 2018-02-12
Linked BF Score:
0
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

During the critical section, the source shard sends _configsvrCommitChunkMigration to the CSRS primary, but this can fail if the primary recently stepped down, causing the source shard to try to log "moveChunk.validating" on the CSRS primary to update its optime before refreshing metadata, and if this also fails, the source shard will fassert.

From this comment, it seems that this is desired behavior, but it's a problem for the continuous stepdowns concurrency suite with the balancer enabled, since background migrations can crash servers and fail the test when the cluster is torn down.

Example failure: https://evergreen.mongodb.com/task/mongodb_mongo_master_enterprise_rhel_62_64_bit_concurrency_sharded_with_stepdowns_and_balancer_WT_patch_b8f64cc3fde6d041f3e90b1cb2e153b0b15f6c47_5a20e4efe3c33173de00c68d_17_12_01_05_13_55

Assignee:: Jack Mulrow
Reporter:: Jack Mulrow
Participants:: Andy Schwerin, Githook User, Jack Mulrow, Kaloian Manassiev
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Jan 08 2018 11:18:39 PM UTC
Updated:: Oct 30 2023 11:09:30 PM UTC
Resolved:: Jan 30 2018 07:43:32 PM UTC
Confidence Status Last Update:: 24/Jan/18 7:24 PM

Details

Description

Attachments

Forms

Activity

People

Dates