Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: WT2.7.0
Affects Version/s: None
Component/s: None
Labels:
None

Sprint:
None
Story Points:
None

michael.cahill, following up on this GitHub conversation

I was asking the question badly then, but I think there's a real problem.

I have a test case (attached) which does this:

Create a single table, populate it with 1000 key/value pairs.
Close and re-open the database, so we can fast-delete pages.
Truncate a chunk of the key/value pairs inside a transaction.
With the truncation uncommitted, checkpoint the database.
Crash.
Open the database
Check that all the keys are there.

The output I get is:

truncate 290 to 500
key 294, WT_NOTFOUND: item not found
ret: -31803 (WT_NOTFOUND: item not found)
Assertion failed: (__r == 0), function main, file t2.c, line 45.
Abort trap (core dumped)

What's going on is I'm truncating key/value pairs 290 to 500, and the first key on a new page boundary is key 294, so that page is fast-deleted, and after the crash, that's the first key we don't see.

The problem is not that we don't write the backing leaf page correctly (I think we do), but the internal page has a cell type of WT_CELL_DEL and when we read it, we assume all of the keys on the page have been deleted, which isn't correct because the deleting transaction never committed.

I think we need to fix the code in reconciliation to not write WT_CELL_DEL cells unless the delete is globally visible, but I'd need to stare at the code some more to be sure.

And, we need to think about named checkpoints, specifically in the page-reading code, where we recently made the change to short-circuit any deleted page or look-aside table handling if we're reading from a checkpoint handle – I think it's OK, but someone needs to review in the context of this ticket.

And, maybe there are logging implications?

Let me know if I'm just missing something!

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

WT2115.tar.gz
1 kB
Sep 18 2015 09:26:04 PM UTC

is depended on by

SERVER-20408 WiredTiger changes for MongoDB 3.1.9

Closed

SERVER-20479 WiredTiger changes for MongoDB 3.0.7

Closed

Assignee:: Keith Bostic (Inactive)
Reporter:: Keith Bostic (Inactive)
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Sep 18 2015 09:22:44 PM UTC
Updated:: Oct 12 2017 11:16:13 PM UTC
Resolved:: Sep 29 2015 12:36:13 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates