Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: WT10.0.1, 4.4.7, 5.0.0-rc0
Affects Version/s: None
Component/s: None
Labels:
- SERW
- dh50proj

Total Hours with Assigned Team:
47,009.97
Sprint:
Storage - Ra 2021-04-05, Storage - Ra 2021-04-19, Storage - Ra 2021-05-03
Story Points:
8

Backport Requested:

v4.4

The issue was initially reported as an unusually long load phase in py-tpcc workloads. The issue appears intermittently as a few inserts in the load phase will get stuck for more than a few hours. Related PERF and HELP tickets have more information on the history of the issue.

bruce.lucas has come up with a standalone reproducer, attached to the ticket.

Issue:
When inserting into a unique index, there is potential to get stuck repeatedly searching history store, ie calling __wt_hs_find_upd. We see a very high history store table reads missed statistic in these runs, which convey that these searches through history store are not returning anything. A callgraph that reflects this situation:

Observed behaviour with the repro script:

Observed behaviour in 4.4 is that insert rate is erratic, and a couple of the threads typically seem to get stuck apparently indefinitely with a high rate of missed history store reads with stacks like the above.

Acceptance criterion:

With the repro script: Expected behaviour (same as observed in 4.2) is that insert rate should be steady and each thread should complete at about the same time
py-tpcc load phase should not get stuck

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

441.png
Mar 04 2021 12:30:20 AM UTC
114 kB
Sulabh Mahajan
442.png
Mar 04 2021 12:30:18 AM UTC
140 kB
Sulabh Mahajan
hs.png
Mar 04 2021 12:21:51 AM UTC
198 kB
Sulabh Mahajan
image.png
Sep 07 2021 07:53:50 PM UTC
198 kB
Dianna Hohensee
image-2021-03-31-17-11-35-375.png
Mar 31 2021 06:11:36 AM UTC
29 kB
Luke Pearson
repro.sh
Mar 04 2021 12:25:57 AM UTC
1 kB
Sulabh Mahajan

causes

SERVER-58936 Unique index constraints may not be enforced

Closed

WT-8070 Remove discrepancy between prefix_key and prefix_search

Closed

depends on

WT-7912 Fix prefix search near optimisation to handle scenarios where the key range is split across pages.

Closed

is depended on by

WT-7912 Fix prefix search near optimisation to handle scenarios where the key range is split across pages.

Closed

SERVER-56509 Wrap unique index insertion _keyExists call in a WT cursor reconfigure.

Closed

is duplicated by

WT-6664 Cache eviction causes high latency during py-tpcc.

Closed

is related to

SERVER-67350 stalling during concurrent insertMany operations when unique index exists

Closed

WT-7653 Inconsistent update performance on unique indexes

Closed

WT-8091 Create cpp test performing unique indexes insertions

Closed

related to

WT-6664 Cache eviction causes high latency during py-tpcc.

Closed

WT-7912 Fix prefix search near optimisation to handle scenarios where the key range is split across pages.

Closed

(1 is duplicated by, 3 is related to, 2 related to)

Assignee:: Luke Pearson
Reporter:: Sulabh Mahajan
Votes:: 0 Vote for this issue
Watchers:: 22 Start watching this issue

Created:: Mar 04 2021 12:21:00 AM UTC
Updated:: Oct 29 2023 04:42:20 PM UTC
Resolved:: May 03 2021 07:35:06 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates