[SERVER-57093] Heartbeat thread in backup_utils.js timed out trying to connect Created: 20/May/21  Updated: 06/Dec/22  Resolved: 21/May/21

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Improvement Priority: Major - P3
Reporter: Gregory Wlodarek Assignee: Backlog - Storage Execution Team
Resolution: Won't Do Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Depends
Assigned Teams:
Storage Execution
Participants:
Linked BF Score: 31

 Description   

There was a build failure where one of the heartbeat threads timed out trying to connect to the mongod, causing the test to fail. Heartbeat threads are used to keep idle cursors alive.

This happened on a UBSAN+Debug variant. I think the 5-second limit to establish a connection may be the cause here.

There are two things we can try doing here:

  • Don't run backup/restore tests on UBSAN+Debug due to the variants slowness
  • The heartbeat thread should retry connecting to the mongod

Generated at Thu Feb 08 05:40:57 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.