Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Critical - P2
Fix Version/s: 6.0.6, 7.0.0-rc1, 6.3.2
Affects Version/s: 6.1.1, 6.0.4, 7.0.0-rc0, 6.0.5, 6.2.1, 6.3.0-rc3
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v7.0
Sprint:
Sharding EMEA 2023-05-01
Case:
Confidence Status:
None
Work Order:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Issue
In case there is at least one huge chunks on a shard being drained, the balancer may end up indefinitely in the following scenario:

The migration proceeds for 6 hours before being aborted
The same migration is rescheduled

Technical description
When draining a shard, migrations are being scheduled by the balancer with the forceJumbo flag set to true (meaning they can proceed no matter the number of documents to clone) and by passing the whole chunk entry as argument (meaning that the whole chunk must be migrated in one shot).

This is different from the usual balancing behavior that - after the removal of the auto-splitter in 6.0.3 - consists in issuing moveRange commands by only specifying the min bound so that the shard autonomously decides on which key to chop a chunk according to the configured chunk size.

is caused by

SERVER-71787 Balancer needs to attach forceJumbo to `moveRange` command

Closed

Assignee:: Pierlauro Sciarelli
Reporter:: Pierlauro Sciarelli
Participants:: Githook User, Pierlauro Sciarelli
Votes:: 0 Vote for this issue
Watchers:: 10 Start watching this issue

Created:: Apr 26 2023 02:41:13 PM UTC
Updated:: Oct 29 2023 09:22:28 PM UTC
Resolved:: Apr 27 2023 02:36:22 PM UTC
Confidence Status Last Update:: 26/Apr/23 2:44 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates