Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.2.0-rc3, 4.3.1
Affects Version/s: None
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v4.2
Sprint:
Service Arch 2019-07-01
Linked BF Score:
16
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

We currently spin up sharding in advance of replication (see ~~SERVER-41005~~). Because of that, it is possible for sharding to miss out on certain writes on startup (writes to admin.system.version that are still in the oplog and haven't yet been recovered).

It's going to be quite difficult to untangle all the dependencies between sharding and replication, and in the mean while shard_aware_init has more failures than we'd like. See BF-12759. That particular test specifically checks that corrupting our version (via a manual update to admin.system.version) causes mongod to crash on startup. The problem is that because we start sharding before replication (and also do a complicated dance of restarting in standalone mode to corrupt the document), we can perform an update when the document we want to modify isn't present (because it's still in the oplog and we're in standalone mode), and then fail to crash on startup.

So let's fix up that test by waiting to flush the oplog before shutting down the node (when in replica set mode).

is related to

SERVER-41005 Sharding initialization should not occur before replication recovery

Closed

Assignee:: Mira Carey
Reporter:: Mira Carey
Participants:: Githook User, Mira Carey
Votes:: 0 Vote for this issue
Watchers:: 1 Start watching this issue

Created:: Jun 17 2019 07:49:43 PM UTC
Updated:: Oct 29 2023 10:19:47 PM UTC
Resolved:: Jun 19 2019 07:22:49 PM UTC
Confidence Status Last Update:: 17/Jun/19 8:34 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates