Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Sharding
Labels:
- cs-product-sync
- resharding-improvements

Assigned Teams:

Cluster Scalability
Operating System:
ALL
Sprint:
Cluster Scalability 2024-07-08, Cluster Scalability 2024-07-22, Cluster Scalability 2024-08-19
Case:
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Resharding like replication applies oplog entries in batches using multiple parallel threads. Oplog entries that touch the same document are batched together and applied in the same thread. So oplog application in resharding (and replication) preserves the (timestamp, _id) order; however, it doesn't preserve the overall write order. Consider a collection with a unique index {a: 1}, we insert the document {_id: 1, a: "foo"} and then delete {_id: 1, a: "foo"} and then insert {_id: 2, a: "foo"}. Resharding would apply the oplog entries in two threads:

Thread 1: insert {_id: 1, a: "foo"}, delete {_id: 1, a: "foo"}
Thread 2: insert {_id: 2 a: "foo"}

So if Thread 2 runs completely before Thread 1 if Thread 2 interleaves with Thread 1, then oplog application would end up with a DuplicateKey error. It should just ignore this DuplicateKey error just like what replication oplog application does today.

is duplicated by

SERVER-99668 Also add oplogBatchApplierTaskCount to reshardCollection command

Closed

related to

SERVER-90669 moveCollection can hit a duplicate key error during applying phase

Closed

SERVER-99668 Also add oplogBatchApplierTaskCount to reshardCollection command

Closed

SERVER-92043 Add reshardingOplogBatchTaskCount as a command parameter to reshardCollection, moveCollection and unshardCollection

Closed

1.	Record all indexes which are unique:true indexes	SERVER-92081	Open	Grant Xiao (Inactive)
2.	Build all indexes as unique:false indexes	SERVER-92082	Backlog	Grant Xiao (Inactive)
3.	Run collMod during resharding critical section	SERVER-92083	Backlog	Grant Xiao (Inactive)
4.	Add performance test for impact of collMod to make indexes unique	SERVER-92124	Backlog	Grant Xiao (Inactive)
5.	Improve performance of collmod/ use separate approach to meet critical section requirements	SERVER-93223	Backlog	Unassigned

Assignee:: Unassigned
Reporter:: Cheahuychou Mao
Participants:: Cheahuychou Mao, Max Hirschhorn
Votes:: 0 Vote for this issue
Watchers:: 16 Start watching this issue

Created:: Jan 16 2024 10:13:48 PM UTC
Updated:: Mar 14 2025 07:24:41 PM UTC
Confidence Status Last Update:: 03/Jul/24 5:45 PM

Details

Description

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates