Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-100750

testSchedulingTwiceOnTimerCancelsFirstOne is flaky on very slow machines

    • Type: Icon: Bug Bug
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • 8.1.0-rc0
    • Affects Version/s: None
    • Component/s: None
    • None
    • Networking & Observability
    • Fully Compatible
    • ALL
    • Egress gRPC 2025-02-14
    • 200
    • None
    • 3
    • None
    • None
    • None
    • None
    • None
    • None

      The testSchedulingTwiceOnTimerCancelsFirstOne test sets an alarm for 1 second in the future and attempts to overwrite it with another timer. The cancellation that is involved with this goes through gRPC, and it involves scheduling work on the gRPC event engine. It's possible that on super slow machines (e.g. with TSAN or other sanitizers enabled), that 1 second isn't enough time for cancellation to catch the timer. We should extend this alarm to be indefinite to avoid this race.

            Assignee:
            patrick.freed@mongodb.com Patrick Freed
            Reporter:
            patrick.freed@mongodb.com Patrick Freed
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: