Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-3590

Keep data consistent if writes fail during a clean shutdown

    • Type: Icon: Bug Bug
    • Resolution: Fixed
    • Priority: Icon: Critical - P2 Critical - P2
    • 3.4.10, 3.6.0-rc0, WT3.0.0, 3.2.20
    • Affects Version/s: None
    • Component/s: None
    • None
    • Storage 2017-10-02
    • v3.4, v3.2

      Write failures during a clean shutdown can leave WiredTiger tables inconsistent with each other on restart.

      WT_CONNECTION::close attempts to continue after most errors so that it can free as many resources as possible. However, continuing after some writes fail during clean shutdown could incorrectly go on to mark the database as clean so that recovery would be skipped next time the database is opened. If some tables were successfully flushed before the error occurred, this could lead to inconsistencies in the data.

      For example, if WiredTiger successfully closes a MongoDB _id index (making it durable), but writes fail before the corresponding collection becomes durable, then on restart, when replication replays the oplog entries, it can successfully insert into the collection but then unexpectedly find the _id entries from before the shutdown, causing an assertion failure.

      WiredTiger should do a full checkpoint as part of a clean shutdown to avoid trees becoming inconsistent in this situation.

            Assignee:
            sue.loverso@mongodb.com Susan LoVerso
            Reporter:
            michael.cahill@mongodb.com Michael Cahill (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            14 Start watching this issue

              Created:
              Updated:
              Resolved: