[SERVER-81942] ShardingDDLCoordinator should retry on LockTimeout errors Created: 06/Oct/23  Updated: 06/Nov/23  Resolved: 20/Oct/23

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: 5.0.21, 7.2.0-rc0, 7.0.2, 6.0.11
Fix Version/s: 7.1.1, 7.2.0-rc0, 6.0.12, 5.0.23, 7.0.4

Type: Bug Priority: Major - P3
Reporter: Tommaso Tocci Assignee: Tommaso Tocci
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: File lock_timeout_bug_repro.js    
Issue Links:
Backports
Assigned Teams:
Sharding EMEA
Backwards Compatibility: Fully Compatible
Operating System: ALL
Backport Requested:
v7.1, v7.0, v6.0, v5.0
Sprint: Sharding EMEA 2023-10-16, Sharding EMEA 2023-10-30
Participants:

 Description   

Sharding DDL Coordinators are not retrying on LockTimeout errors.

I propose to make them retry the entire `Interruption` error category.



 Comments   
Comment by Githook User [ 06/Nov/23 ]

Author:

{'name': 'Tommaso Tocci', 'email': 'tommaso.tocci@mongodb.com', 'username': 'toto-dev'}

Message: SERVER-81942 ShardingDDLCoordinator should retry on LockTimeout errors

(cherry picked from commit 0391da8da8ce1d0c1a7a72be7fc9e9fad8793f93)
Branch: v7.1
https://github.com/mongodb/mongo/commit/86aead28bb6f90afeef9f38cead55c796248e72f

Comment by Githook User [ 06/Nov/23 ]

Author:

{'name': 'Tommaso Tocci', 'email': 'tommaso.tocci@mongodb.com', 'username': 'toto-dev'}

Message: SERVER-81942 ShardingDDLCoordinator should retry on LockTimeout errors

(cherry picked from commit 0391da8da8ce1d0c1a7a72be7fc9e9fad8793f93)
Branch: v5.0
https://github.com/mongodb/mongo/commit/3dfecca4b753aa5eb37d4cace3ce09efaa4a1b7d

Comment by Githook User [ 06/Nov/23 ]

Author:

{'name': 'Tommaso Tocci', 'email': 'tommaso.tocci@mongodb.com', 'username': 'toto-dev'}

Message: SERVER-81942 ShardingDDLCoordinator should retry on LockTimeout errors

(cherry picked from commit 0391da8da8ce1d0c1a7a72be7fc9e9fad8793f93)
Branch: v7.0
https://github.com/mongodb/mongo/commit/2a403ec36083e661cb0fbdeb45a0608be634213d

Comment by Githook User [ 06/Nov/23 ]

Author:

{'name': 'Tommaso Tocci', 'email': 'tommaso.tocci@mongodb.com', 'username': 'toto-dev'}

Message: SERVER-81942 ShardingDDLCoordinator should retry on LockTimeout errors

(cherry picked from commit 0391da8da8ce1d0c1a7a72be7fc9e9fad8793f93)
Branch: v6.0
https://github.com/mongodb/mongo/commit/e251ebea8e06c63fd400bcdf3903363bf8c1f5aa

Comment by Githook User [ 20/Oct/23 ]

Author:

{'name': 'Tommaso Tocci', 'email': 'tommaso.tocci@mongodb.com', 'username': 'toto-dev'}

Message: SERVER-81942 ShardingDDLCoordinator should retry on LockTimeout errors
Branch: master
https://github.com/mongodb/mongo/commit/0391da8da8ce1d0c1a7a72be7fc9e9fad8793f93

Comment by Marcos José Grillo Ramirez [ 06/Oct/23 ]

I've attached an easy repro.

Generated at Thu Feb 08 06:47:48 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.