[SERVER-36742] mongo shell shutdown: signal handler race? Created: 17/Aug/18  Updated: 29/Oct/23  Resolved: 20/Aug/18

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: 4.1.3

Type: Bug Priority: Major - P3
Reporter: Billy Donahue Assignee: Billy Donahue
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Duplicate
Backwards Compatibility: Fully Compatible
Operating System: ALL
Sprint: Platforms 2018-08-27
Participants:
Linked BF Score: 25

 Description   

The logs of BFGs attached to https://jira.mongodb.org/browse/BF-10120 shows a unit test js_test:user_cache_doc_source succeeding.

It launches a mongo shell (with

startParallelShell

) and a mongod server. The server shuts down cleanly, but the mongo shell does not, dying from the SIGTERM. (pid 15354 is the spawned 'mongo' process)

[js_test:user_cache_doc_source] 2018-08-03T18:23:06.819+0000 terminating the following processes started by jstests/auth/user_cache_doc_source.js: 15354
[js_test:user_cache_doc_source] 2018-08-03T18:23:06.821+0000 2018-08-03T18:23:06.821+0000 I - [main] Process with pid 15354 exited with error code -15
[js_test:user_cache_doc_source] 2018-08-03T18:23:06.821+0000 one more more child processes exited with an error during jstests/auth/user_cache_doc_source.js

 
https://logkeeper.mongodb.org/lobster/build/85c97cc526eeaeffc622a55464085e27/test/5b63b673c2ab6838a8031998#bookmarks=0%2C49%2C110%2C127

 

Hypothesis: The test refactoring of SERVER-36384 affected this test, and maybe the lifespan of these spawned processes is very short now, exposing a job control bug.



 Comments   
Comment by Githook User [ 20/Aug/18 ]

Author:

{'name': 'Billy Donahue', 'email': 'billy.donahue@mongodb.com', 'username': 'BillyDonahue'}

Message: SERVER-36742 : wait on the spawned parallel shell
Branch: master
https://github.com/mongodb/mongo/commit/a057d5cf181b3cba808a2de30cd4110deb9bd6c9

Comment by Billy Donahue [ 17/Aug/18 ]

The problem might be that this test does not store or invoke the function object returned from

startParallelShell

.

Generated at Thu Feb 08 04:43:57 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.