Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Done
Priority: Major - P3
Fix Version/s: 2.6.6, 2.7.8
Affects Version/s: 2.4.8
Component/s: Replication, Stability
Labels:
None

Backwards Compatibility:
Minor Change
Backport Completed:

2.6.6
Case:
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When a new clean node is being added to a replica set, if the source for initial sync has corrupted data, it seems that we try to sync whatever we can, silently ignoring all the records we can not fetch.

While this "best effort" behavior makes sense, it can lead to significant data inconsistency within the replica set. We should not ignore data access errors during initial sync.

example behavior (PRIMARY is the node with corruption in dummy.acl namespace, SECONDARY is the newly synced secondary):

X:SECONDARY> rs.slaveOk()
X:SECONDARY> use dummy
switched to db dummy
X:SECONDARY> db.acl.count()
101
X:SECONDARY> exit
bye
AD-MAC10G:ff alexander$ mongo
MongoDB shell version: 2.4.8
connecting to: test
X:PRIMARY> use dummy
switched to db dummy
X:PRIMARY> db.acl.count()
10002
X:PRIMARY> exit
bye

is duplicated by

SERVER-17903 When corruption detected, server continues to run and sync secondaries

Closed

is related to

SERVER-1558 Documents should write checksum on write, verify checksum on read

Closed

related to

SERVER-17903 When corruption detected, server continues to run and sync secondaries

Closed

Assignee:: Eric Milkie
Reporter:: Alexander Komyagin (Inactive)
Participants:: Alexander Komyagin, Eric Milkie, Githook User
Votes:: 2 Vote for this issue
Watchers:: 9 Start watching this issue

Created:: Dec 12 2013 03:09:24 AM UTC
Updated:: Sep 21 2017 04:51:25 PM UTC
Resolved:: Oct 17 2014 04:06:37 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates