[SERVER-26928] Check for shutdown in pauseRsBgSyncProducer failpoint Created: 07/Nov/16  Updated: 08/Dec/16  Resolved: 08/Nov/16

Status: Closed
Project: Core Server
Component/s: Replication
Affects Version/s: None
Fix Version/s: 3.4.0-rc3

Type: Bug Priority: Major - P3
Reporter: Scott Hernandez (Inactive) Assignee: Judah Schvimer
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Backports
Depends
Related
is related to SERVER-27280 repl failpoints that block until they... Closed
Backwards Compatibility: Fully Compatible
Operating System: ALL
Backport Requested:
v3.2
Sprint: Repl 2016-11-21
Participants:
Linked BF Score: 0

 Description   

During shutdown if this failpoint is enabled it may deadlock the thread shutting down.

[2016/11/07 05:13:54.593] Thread 39 (Thread 0x7f530dc42700 (LWP 9183)):
[2016/11/07 05:13:54.593] #0  0x00007f5313c792fd in pthread_join () from /lib64/libpthread.so.0
[2016/11/07 05:13:54.593] #1  0x00007f53187c7d07 in std::thread::join() ()
[2016/11/07 05:13:54.593] #2  0x00007f53176d98f6 in mongo::repl::BackgroundSync::join(mongo::OperationContext*) ()
[2016/11/07 05:13:54.593] #3  0x00007f53177a5307 in mongo::repl::ReplicationCoordinatorExternalStateImpl::_stopDataReplication_inlock(mongo::OperationContext*, std::unique_lock<std::mutex>*) ()
[2016/11/07 05:13:54.594] #4  0x00007f53177a6b3d in mongo::repl::ReplicationCoordinatorExternalStateImpl::shutdown(mongo::OperationContext*) ()
[2016/11/07 05:13:54.594] #5  0x00007f53177b86aa in mongo::repl::ReplicationCoordinatorImpl::shutdown(mongo::OperationContext*) ()
[2016/11/07 05:13:54.594] #6  0x00007f5316f9d9d7 in shutdownTask() ()
[2016/11/07 05:13:54.594] #7  0x00007f5317cd3ea2 in mongo::(anonymous namespace)::runTasks(std::stack<std::function<void ()>, std::deque<std::function<void ()>, std::allocator<std::function<void ()> > > >) [clone .constprop.62] ()
[2016/11/07 05:13:54.594] #8  0x00007f5316f2dede in mongo::shutdown(mongo::ExitCode) ()
[2016/11/07 05:13:54.594] #9  0x00007f5317d54a9e in mongo::(anonymous namespace)::signalProcessingThread() ()
...
[2016/11/07 05:13:54.644] Thread 3 (Thread 0x7f52f4d02700 (LWP 9397)):
[2016/11/07 05:13:54.644] #0  0x00007f5317cd53c8 in mongo::FailPoint::shouldFailCloseBlock() ()
[2016/11/07 05:13:54.644] #1  0x00007f53176dc153 in mongo::repl::BackgroundSync::_produce(mongo::OperationContext*) ()
[2016/11/07 05:13:54.644] #2  0x00007f53176de1fd in mongo::repl::BackgroundSync::_runProducer() ()
[2016/11/07 05:13:54.644] #3  0x00007f53176de42a in mongo::repl::BackgroundSync::_run() ()
...



 Comments   
Comment by Githook User [ 08/Nov/16 ]

Author:

{u'username': u'judahschvimer', u'name': u'Judah Schvimer', u'email': u'judah@mongodb.com'}

Message: SERVER-26928 Check for shutdown in pauseRsBgSyncProducer failpoint
Branch: master
https://github.com/mongodb/mongo/commit/4b423738f24c1ccd628e1fcc2b9ee5fd84bbfe4c

Comment by Spencer Brody (Inactive) [ 07/Nov/16 ]

Judah, can you take this over?

Generated at Thu Feb 08 04:13:38 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.