Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: 1.3.1
Affects Version/s: None
Component/s: None
Labels:
None

CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

I'm trying to debug some issues that occur when we run our standard js tests in parallel. While looking at the code, I noticed the following:

/* called every 4 seconds. millis is amount of idle time passed since the last call – could be zero */
void ClientCursor::idleTimeReport(unsigned millis) {
recursive_boostlock lock(ccmutex);
for ( CCByLoc::iterator i = byLoc.begin(); i != byLoc.end(); ) {
CCByLoc::iterator j = i;
i++;
if( j->second->shouldTimeout( millis ) )

{ log(1) << "killing old cursor " << j->second->cursorid << ' ' << j->second->ns << " idle:" << j->second->idleTime() << "ms\n"; delete j->second; }

}
}

idleTimeReport() gets called with only a read lock in place, so it can happen in parallel with a getMore() request. getMore() grabs the client cursor mutex while it's finding a client cursor, but then it releases the client cursor mutex and continues to use the client cursor object it has found. I believe it's possible for idleTimeReport() to delete a getMore()'s client cursor after getMore() has looked up the client cursor but before getMore() has finished accessing the client cursor's attributes.

I'm sorry I haven't written a test case - that's hard to do for a rare race condition like this.

Assignee:: Dwight Merriman
Reporter:: Aaron Staple (Inactive)
Participants:: Aaron Staple, auto, Dwight Merriman
Votes:: 0 Vote for this issue
Watchers:: 0 Start watching this issue

Created:: Jan 04 2010 05:16:23 PM UTC
Updated:: Jul 12 2016 12:28:41 AM UTC
Resolved:: Jan 06 2010 01:55:59 PM UTC

Details

Description

Attachments

Activity

People

Dates