Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: WT2.9.1, 3.2.12, 3.4.2, 3.5.1
Affects Version/s: WT2.8.0
Component/s: None
Labels:
None
Environment:
Reproduced on both Amazon Linux and MacOS.

Total Hours with Assigned Team:
85,397.754
Sprint:
Storage 2016-11-21, Storage 2016-12-12
Story Points:
None

We believe we have run into a potential snapshot corruption issue in 2.8.0, when running compaction concurrently with writes. Minimal repro case is attached below, so would appreciate comments if there’s perhaps a problem with what we’re doing. What appears to happen is that default snapshot created during compaction has a table and its index out of sync with each other (records that exist in one are not in the other, and vice versa). Calling session.verify() does not report any problems, however explicitly comparing records in index to the table uncovers large number of mismatches.

We have not yet been able to fully confirm that only snapshot is corrupted, or if the state of BTree is affected as well. But early indication is that only snapshot is impacted, because explicitly calling checkpoint() after compact() gets the table and index into consistent state again.

Here’s the minimal repro case (using java APIs, but I don't think that's relevant here). The case goes like this:

· Open connection and create a table with one index
· Verify contents of the table against content of the index. (Note, there are no mutations at this time, so expect both to match perfectly).
· Start one writer thread (90/10 ratio of adds to deletes).
· Sleep 10 seconds
· Run compaction on both table and index
· Sleep 10 more seconds
· Exit

When run first time get expected “0 corrupted records out of 0”. When run the second time (note a snapshot now exists, created by compaction) get the following error “6598 corrupted records out of 1138935”

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

CorruptionExample.java
3 kB
Oct 12 2016 07:53:03 PM UTC

links to

https://github.com/wiredtiger/wiredtiger/pull/3160

Assignee:: Keith Bostic (Inactive)
Reporter:: Joe
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Oct 12 2016 07:53:03 PM UTC
Updated:: Oct 12 2017 11:12:15 PM UTC
Resolved:: Dec 06 2016 03:06:42 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates