Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: WT2.9.2, 3.2.13, 3.4.3, 3.5.4
Affects Version/s: None
Component/s: None
Labels:
None

Sprint:
Storage 2017-03-06
Story Points:
None

Our jenkins test detected a hang when shutting down a run of ../bench/wtperf/stress//shared-cache-stress.wtperf, the hang showed a thread was adjusting the pool:

Thread 22 (Thread 0x7f2e0cff9700 (LWP 129430)):
#0  0x000000000041b97f in __cache_pool_adjust (session=0x7f2e280174f0, highest=3487, bump_threshold=0, forward=true, adjustedp=0x7f2e0cff8eff)
    at ../src/conn/conn_cache_pool.c:573
#1  0x000000000041b4c6 in __cache_pool_balance (session=0x7f2e280174f0, forward=true) at ../src/conn/conn_cache_pool.c:446
#2  0x000000000041c035 in __wt_cache_pool_server (arg=0x7f2e280174f0) at ../src/conn/conn_cache_pool.c:758
#3  0x0000003e6ce07555 in start_thread () from /lib64/libpthread.so.0
#4  0x0000003e6cb02ded in clone () from /lib64/libc.so.6

In the debugger it became clear that the in-use proportion of the pool was never being adjusted, so the __cache_pool_balance call was never completing, which meant that destroy could not get the cache pool spin lock and would end up hanging.

Assignee:: Alexander Gorrod
Reporter:: Alexander Gorrod
Votes:: 0 Vote for this issue
Watchers:: 1 Start watching this issue

Created:: Feb 16 2017 11:15:07 PM UTC
Updated:: Oct 12 2017 11:14:45 PM UTC
Resolved:: Feb 17 2017 09:54:05 PM UTC

Details

Description

Attachments

Forms

Activity

People

Dates