Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: WT12.0.0, 8.2.0-rc0, 8.1.3
Affects Version/s: None
Component/s: Schema Management
Labels:
- code-quality
- wt-atomic

Assigned Teams:

Storage Engines, Storage Engines - Foundations
Sprint:
StorEng - 2025-03-14, StorEng - 2025-03-28, StorEng - 2025-04-25, SE Transactions - 2025-06-06
Story Points:
8
Case:

Backport Requested:

v8.1, v8.0, v7.0, v6.0

Fix the issue where an unnecessary schema lock is taken for actively used file: -prefixed dhandles because their corresponding table:-prefixed dhandles are expired by the sweep server. This leads to schema lock contention, especially during checkpoint prepare, affecting performance.

Description

When opening a file: -prefixed dhandle, the table:-prefixed dhandle is used to determine if a corresponding file: -prefixed dhandle is a simple table. However, the sweep server sweeps table:-prefixed dhandles, leading to their premature expiration.

Since file: -prefixed dhandles have places to reset their "Time of Death", they remain active. But there is no "Time of Death" reset mechanism for table:-prefixed dhandles during schema operations, causing them to expire. Also, even when there are no schema ops, corresponding table: -prefixed dhandles are expired by the sweep server.

This results in:

Unnecessary reopening of table:-prefixed dhandles, requiring a schema lock.
Schema lock contention when a checkpoint is preparing, since it also needs the schema lock.
Performance degradation due to increased blocking between application threads and the checkpoint thread.

Reproducer

In the Python test below, I create 1,000 dhandles and then spawn 1,000 threads to perform inserts for 100 iterations, ensuring that all dhandles remain active.

def insert(self, i, start, rows):
    session = self.conn.open_session()
    uri = self.uri + str(i)
    cursor = session.open_cursor(uri)
    session.begin_transaction()
    for i in range(start, rows):
         cursor.set_key(i)
         cursor.set_value(str(i))
         cursor.insert()
    session.commit_transaction()
     cursor.close()
     session.close()

def test_dhandles(self):
    dhandles = 1000
    for i in range(1,dhandles):
        uri = self.uri + str(i)
        self.session.create(uri, format)

    for i in range(1,100):
        threads = []
        for i in range(1,dhandles):
            thread = threading.Thread(target=self.insert, args=(i, 0, 100))
            thread.start()
            threads.append(thread)
        
        for thread in threads:
           thread.join()

To test the sweep server, I modify the config accordingly

file_manager=(close_handle_minimum=0,close_idle_time=60,close_scan_interval=30)

Scope:

Decide the potential solutions described in WT-13663.
Fix the issue by applying the chosen solution.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

screenshot-1.png
72 kB
Mar 27 2025 09:18:00 AM UTC
Screenshot 2025-03-22 at 9.45.09 AM.png
172 kB
Mar 21 2025 11:24:03 PM UTC
Screenshot 2025-03-22 at 9.45.23 AM.png
193 kB
Mar 21 2025 11:23:55 PM UTC

blocks

WT-12721 Investigate large schema lock contention during checkpoint and session sweep

Closed

WT-14349 failed: unit-test-macos on macos-14-arm64 [wiredtiger @ 9f32cdcc]

Closed

causes

WT-14363 45.44% decrease in Read count in Variant amazon2023-perf-tests-arm64 for Task perf-test-short-key-btree in Test short-key-btree.wtperf

Closed

WT-14804 Fix crash when printing an invalid dhandle's name on sweep walk

Closed

is related to

WT-12721 Investigate large schema lock contention during checkpoint and session sweep

Closed

WT-14336 Track the dirty and update bytes of the history store

Closed

WT-14843 Add stats to track eviction pages seen and queued

Closed

related to

WT-13703 Truncate history store for a dropped file

Closed

(2 is related to, 1 related to)

Assignee:: Ravi Giri
Reporter:: Sid Mahajan
Votes:: 0 Vote for this issue
Watchers:: 13 Start watching this issue

Created:: Feb 13 2025 11:39:43 PM UTC
Updated:: Jul 21 2025 12:11:27 AM UTC
Resolved:: May 26 2025 05:48:40 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates