[SERVER-20805] Sharding tests that move the same chunk multiple times should use '_waitForDelete: true' Created: 05/Oct/15 Updated: 15/Oct/15 Resolved: 07/Oct/15 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Sharding |
| Affects Version/s: | None |
| Fix Version/s: | 3.2.0-rc0 |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Daniel Pasette (Inactive) | Assignee: | Kaloian Manassiev |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Backwards Compatibility: | Fully Compatible |
| Operating System: | ALL |
| Sprint: | Sharding A (10/09/15) |
| Participants: |
| Description |
|
A shard may not be a donor or recipient of a chunk if there are any outstanding deletes of previous migration. That way, tests, which move chunks back and forth multiple times may occasionally fail. Such tests should be made to pass _waitForDelete: true, so that the move chunk operation will wait for the range to be deleted. Only seen this once so far.
|
| Comments |
| Comment by Githook User [ 07/Oct/15 ] |
|
Author: {u'username': u'kaloianm', u'name': u'Kaloian Manassiev', u'email': u'kaloian.manassiev@mongodb.com'}Message: These are tests, which make multiple moveChunk calls. Adding |
| Comment by Andy Schwerin [ 05/Oct/15 ] |
|
The proximate cause of this failure is that a moveChunk command failed because a prior one hadn't finished cleaning up. In particular, this moveChunk failed, but we didn't assert.commandWorked its result, so we end up seeing the problem when we confirm that the document has changed/moved appropriately, later on. All of the commands in tests should really have their results checked. That said, this would only have exposed the error more clearly, not fixed it. We need to wait for cleanup to complete on the prior moveChunk before starting the subsequent one,presumably by setting _waitForDelete: 1 on all the moveChunk operations in this test. |
| Comment by Daniel Pasette (Inactive) [ 05/Oct/15 ] |
|
Assigning to Andy for triage and distribution. |