Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: WT2.9.2, 3.2.13, 3.4.4, 3.5.6
Affects Version/s: WT2.9.1
Component/s: None
Labels:
None

Assigned Teams:

Storage Engines
Total Hours with Assigned Team:
20,873.849
Epic Link:
SPM-771
Sprint:
Storage 2017-03-27
Story Points:
None
Case:

MongoDB customers have reported occasional slow operations when collection drops are executed concurrently with a checkpoint. MongoDB drop operations hold an exclusive database lock, so anything that causes a drop to be slow can block other operations.

There is machinery in the WiredTiger storage engine API implementation to deal with drops that cannot complete immediately. WiredTiger returns EBUSY and the storage engine implementation keeps a list of tables and will retry the drops later. This relies on a checkpoint_wait=false configuration to WT_SESSION::drop, which is intended to make the drop fail immediately instead of blocking due to a checkpoint.

However, if a table is clean at the start of a checkpoint then dropping it during the checkpoint with checkpoint_wait=false will block. The drop will spin on the special lock handle checkpoint is holding rather than failing immediately. The drop thread is holding the table lock at that point, blocking new cursors from being opened.

My preferred solution here is to move the handle gathering stage of checkpoints until after scrubbing, and after it has started its transaction because then it can safely skip clean handles. That is non-trivial because metadata-changing operations can complete after the transaction starts but before handles are locked (e.g., creates, bulk load completion).

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Initial.png
337 kB
Jun 13 2017 12:47:37 PM UTC
metrics.png
378 kB
Jun 13 2017 12:48:19 PM UTC
recovered.png
727 kB
Jul 17 2017 11:29:11 AM UTC
screenshot-1.png
169 kB
Apr 21 2017 09:54:46 AM UTC
slightly later.png
586 kB
Jun 13 2017 12:47:59 PM UTC
Starting.png
459 kB
Jul 17 2017 11:25:00 AM UTC

is duplicated by

SERVER-29419 Dropping unused indexes adversely affects query performance

Closed

SERVER-29811 extremely slow reads from secondaries after drop of unused indexes

Closed

links to

https://github.com/wiredtiger/wiredtiger/pull/3319

Assignee:: Michael Cahill
Reporter:: Michael Cahill
Votes:: 0 Vote for this issue
Watchers:: 12 Start watching this issue

Created:: Mar 06 2017 04:52:32 AM UTC
Updated:: Mar 04 2024 07:29:51 AM UTC
Resolved:: Mar 13 2017 06:14:09 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates