Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 6.1.0-rc0
Affects Version/s: None
Component/s: Internal Code
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
Service Arch 2022-05-30
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When a mongos shuts down, it only attempts to join with client threads when ASAN is enabled, and even then, it does so with a timeout before exiting the process. Before this happens, it calls shutdownAndJoin on the TaskExecutorPool. Therefore, client threads may still be running while the ThreadPoolTaskExecutor is in a call to join(). If join() completes (and as part of completing, signals all of the unsignaled events) just before a client thread tries to signal an event, the client thread will signal the event for a second time, and trigger an invariant(). I believe this is a bug (rather than a misuse) of TaskExecutor.

One way to solve this would be to make signalEvent() a no-op when the TaskExecutor is in shutdown. This way we guarantee every event is signaled exactly once: Either it is signaled before shutdown, or it is signaled as part of shutdown, and all subsequent calls to signalEvent() don't do anything.

Another way of solving this would be to change the order of shutdown, so that we join with all client threads before shutting down the TaskExecutor. Right now, we don't even attempt to join with client threads unless we're running under ASAN, and even then, we do so with a timeout, so this would be a significant change.

I believe this problem is the cause of:
~~SERVER-25497~~

AC: Choose one of the two proposed solutions (or a potential third?).

is related to

SERVER-25497 Fix sharded query path to handle shutdown of the mongos process

Closed

Assignee:: Amirsaman Memaripour
Reporter:: Ian Boros
Participants:: Amirsaman Memaripour, Andy Schwerin, Githook User, Ian Boros, Lauren Lewis, Matthew Tretin, Max Hirschhorn, Ruoxin Xu
Votes:: 0 Vote for this issue
Watchers:: 13 Start watching this issue

Created:: Feb 27 2018 06:30:42 PM UTC
Updated:: Oct 29 2023 10:34:23 PM UTC
Resolved:: May 18 2022 09:13:51 PM UTC
Confidence Status Last Update:: 18/May/22 4:31 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates