[SERVER-20578] stale_clustered.js in noPassthroughWithMongod_WT fails with "waiting for state indicator state for 300000ms" Created: 14/Sep/15 Updated: 07/Oct/15 Resolved: 25/Sep/15 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Sharding |
| Affects Version/s: | None |
| Fix Version/s: | 3.1.9 |
| Type: | Bug | Priority: | Critical - P2 |
| Reporter: | Charlie Swanson | Assignee: | Kaloian Manassiev |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Backwards Compatibility: | Fully Compatible |
| Operating System: | ALL |
| Sprint: | Sharding A (10/09/15) |
| Participants: |
| Description |
|
stale_clustered.js in noPassthroughWithMongod_WT fails with "waiting for state indicator state for 300000ms". Failures observed on ASIO SSL Windows 2008R2, OS X 10.8, and SSL OS X 10.8 variants. See example task, example logs. Excerpt:
|
| Comments |
| Comment by Kaloian Manassiev [ 25/Sep/15 ] | |||||||
|
This is a different failure in the server selection logic it seems. I have opened | |||||||
| Comment by J Rassi [ 25/Sep/15 ] | |||||||
|
Re-opening, as this test is still failing in master. Failures from the past 24 hours:
The test appears to still fail on the same line (line 83), but now with a "node is recovering" message instead of a "waiting for state" message:
Kal, please investigate. | |||||||
| Comment by Githook User [ 24/Sep/15 ] | |||||||
|
Author: {u'username': u'kaloianm', u'name': u'Kaloian Manassiev', u'email': u'kaloian.manassiev@mongodb.com'}Message: Also reduce the ShardingTest oplog size in order to make tests run faster. In addition, this reverts commit eee325e63005939199f6081b1899f1c2863b0530. | |||||||
| Comment by David Storch [ 23/Sep/15 ] | |||||||
|
kaloian.manassiev, as part of renabling this test, please remove the useClusterClientCursor setParameter at the beginning: Rassi and I think that this was added unnecessarily in b1982bb7fb610. | |||||||
| Comment by Githook User [ 23/Sep/15 ] | |||||||
|
Author: {u'username': u'jrassi', u'name': u'Jason Rassi', u'email': u'rassi@10gen.com'}Message: | |||||||
| Comment by J Rassi [ 23/Sep/15 ] | |||||||
|
Three failures observed on OS X in the past 48 hours, one failure observed on SSL OS X 10.8 in the past 48 hours. Bumping to P2. kaloian.manassiev, are you the right assignee for this ticket? If so, please work on this today, or point me towards someone else more appropriate. | |||||||
| Comment by Charlie Swanson [ 16/Sep/15 ] | |||||||
|
Lowering to P4 since I haven't seen this very often. Still don't know why it's happening though. | |||||||
| Comment by Charlie Swanson [ 14/Sep/15 ] | |||||||
|
spencer, any idea what might be happening? Or who might be able to answer that? |