Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Done
Priority: Major - P3
Fix Version/s: 2.4.2, 2.5.0
Affects Version/s: None
Component/s: Sharding
Labels:
None

CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Certain failure modes of the SyncClusterConnection to the config server are safe failures (we are sure nothing has been written) - it is incorrect to fail by aborting mongod here since our state is fully known (the migration failed). Rollback and continue instead.

Original description:

If the config server is offline for a short amount of time during the critical section causing a write error but we're able to reconnect later, we should try to A) verify the write failure on all servers and B) locally rollback the new version, aborting the migration. Currently we fail hard, and rely on the mongod restart for the config reload.

Note this is different from the case when the config servers go down and stay down, since in that case we are unable to read the state of the metadata and so have to terminate.

Assignee:: Alberto Lerner (Inactive)
Reporter:: Greg Studer (Inactive)
Participants:: Alberto Lerner, auto, Greg Studer, Shaun Verch
Votes:: 4 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Oct 05 2012 01:26:02 PM UTC
Updated:: Jul 11 2016 05:59:21 PM UTC
Resolved:: Apr 02 2013 04:23:43 PM UTC

Details

Description

Attachments

Activity

People

Dates

PagerDuty