Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 5.2.0, 4.4.17, 5.0.11
Affects Version/s: 5.0.0, 4.2.13, 4.4.5, 4.0.24
Component/s: Sharding
Labels:
None

Backwards Compatibility:
Fully Compatible
Backport Requested:

v5.3, v5.0, v4.4, v4.2
Sprint:
Sharding EMEA 2021-09-20, Sharding EMEA 2021-10-04, Sharding EMEA 2021-10-18, Sharding EMEA 2021-11-01
Case:
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In current MongoDB code, the critical section will be entered regardless of whether the session migration has completed. This makes sense from a correctness standpoint – we must block writes before ensuring that all sessions associated with those writes have been migrated over. However, there exists the question of whether it's possible to coordinate the choice to enter the critical section with specific checkpoints in the critical section. Without such checkpointing, a migration could still take a long time completing the session migration while blocking writes.

An example would be ensuring that all entries that originally existed in the on-disk session catalog when migration started have been copied over before entering the critical section.

The cost of waiting on session migration checkpointing before entering the critical section is that it would allow more writes to come in, causing transferMods to take longer as well.

This ticket is to investigate the possibility of a middle ground with checkpointing session migrations while not prolonging the length of the migration unnecessarily. Other solutions to this problem are also welcome.

causes

SERVER-60958 Avoid server hang in chunk migration when step-down event occurs

Closed

SERVER-72254 Chunk Migration should fail immediately when session migration fails.

Backlog

related to

SERVER-31347 Remove SessionCatalogMigrationDestination::waitUntilReadyToCommit call in MigrationDestinationManager

Closed

SERVER-56307 The chunk migration "convergence algorithm" is very primitive

Closed

Assignee:: Allison Easton
Reporter:: Blake Oler
Participants:: Allison Easton, Blake Oler, Githook User
Votes:: 0 Vote for this issue
Watchers:: 20 Start watching this issue

Created:: Apr 19 2021 07:13:14 PM UTC
Updated:: Oct 29 2023 09:54:47 PM UTC
Resolved:: Oct 18 2021 12:45:05 PM UTC
Confidence Status Last Update:: 29/Sep/21 6:34 AM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates