Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.0.5, 4.1.4
Affects Version/s: 3.5.11
Component/s: Sharding
Labels:
- sharding-wfbf-day

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v4.0, v3.6
Sprint:
Sharding 2018-10-08
Case:
Linked BF Score:
25
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The _shardingOnTransitionToPrimaryHook callback is invoked when a node becomes a primary. If that node is part of a sharded cluster, it will execute the "ShardingStateRecovery" step, which reads from disk the optime of the last write that the node performed against the config server (where such a write is the chunk migration commit).

The _shardingOnTransitionToPrimaryHook step is executed after the replMutex has been unlocked and because of this, it is possible that the node can actually lose the majority quorum and never become primary. Since the "ShardingStateRecovery" step performs majority reads it will fail in this case, which in turn will crash replication step-up with assert 40107.

Since this is an expected situation, the sharding code should handle it appropriately.

Assignee:: Kaloian Manassiev
Reporter:: Randolph Tan
Participants:: Githook User, Kaloian Manassiev, Randolph Tan
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Aug 17 2017 02:59:13 PM UTC
Updated:: Oct 30 2023 11:14:13 PM UTC
Resolved:: Sep 26 2018 07:57:44 AM UTC
Confidence Status Last Update:: 24/Sep/18 2:33 PM

Details

Description

Attachments

Forms

Activity

People

Dates