Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Won't Do
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Sharding NYC
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

This comes from discussion with lingzhi.deng about possible relation of PM-2248 thread liveness monitoring and SERVER-42308 that proposes to make it possible to trigger failpoints based on previously reached failpoints.

This idea is slightly different - when in debug mode, collect the exact sequence of visited checkpoints defined by the instrumentation of thread liveness monitoring and log them if the test fails. This way we will have the exact sequence of what happened in the code before the test failed.

The plan of PM-2248 was to dump stacks for instrumented threads in the failed tests anyway, but with timestamps. Timestamps are not sufficiently accurate and cannot be relied up to reason on extremely narrow races between multiple threads. This is just an incremental improvement to the planned feature, not much extra effort.

Lingzhi said: "Yes, that would be helpful. It is an idea similar to undoDB but we keep track of the last x sequences ourselves. But I still think it is valuable to be able to examine possible interleaving in unittests instead of integration tests. I know STM team has a project to integrate some thread/network fuzzing into our tests. So maybe that's good enough. In that case, having the checkpoint sequence logged will be helpful."

Assignee:: [DO NOT USE] Backlog - Sharding NYC
Reporter:: Andrew Shuvalov (Inactive)
Participants:: [DO NOT USE] Backlog - Sharding NYC, Andrew Shuvalov, Connie Chen, Ratika Gandhi
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Sep 24 2021 03:21:16 PM UTC
Updated:: Dec 06 2022 12:54:47 AM UTC
Resolved:: Sep 23 2022 06:06:57 PM UTC

Details

Description

Attachments

Activity

People

Dates