Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- servicearch-wfbf-day

Assigned Teams:

Service Arch
Operating System:
ALL
Linked BF Score:
29
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The PrimaryOnlyService stores a list of operation contexts running on its associated Client threads. When the host running the service steps down, and PrimaryOnlyService::onStepDown is called, each operation context in the list is killed here.

However, if another thread is currently managing the step-up process when stepDown is called, it's possible another thread is in the middle of running PrimaryOnlyService::_rebuildInstances. In this thread, a new operation context associated with the POS is created here, and registered with the POS (i.e. inserted into it's _opCtxs member) by the hooks in the PrimaryOnlyServiceClientObserver here. If this operation context goes out of scope while another thread runs onStepDown/tries to kill it, there will be a race between the killing thread reading the operationContext's _baton member here and the thread in which it has fallen out of scope writing the value of _baton here in the chain of calls starting with the opCtx's destructor.

To fix this, we could consider:
running the PrimaryOnlyServiceClientObserver's cleanup hooks, which will remove the opCtx from the POS's list, before allowing the opCtx destructor to modify any of it's state (i.e. switch the call to opCtx->getBaton->detach() with the line invoking the hooks here).

related to

SERVER-52849 PrimaryOnlyService _rebuildServices accesses _scopedExecutor without locking the mutex

Closed

Assignee:: [DO NOT USE] Backlog - Service Architecture
Reporter:: George Wangensteen (Inactive)
Participants:: [DO NOT USE] Backlog - Service Architecture, George Wangensteen
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Dec 21 2020 03:19:20 PM UTC
Updated:: Dec 06 2022 01:41:08 AM UTC
Resolved:: Mar 16 2021 07:17:10 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates