Details
-
Improvement
-
Status: Closed
-
Major - P3
-
Resolution: Fixed
-
None
Description
I see that sometimes we exhaust all 10 retries within less than a minute
Oct 2, 2018 2:45:57 pm Status changed from running to quarantined by mci.
Oct 2, 2018 2:45:57 pm New agent deploy failed
Oct 2, 2018 2:45:52 pm New agent deploy failed
Oct 2, 2018 2:45:46 pm New agent deploy failed
Oct 2, 2018 2:45:45 pm New agent deploy failed
Oct 2, 2018 2:45:41 pm New agent deploy failed
Oct 2, 2018 2:45:38 pm New agent deploy failed
Oct 2, 2018 2:45:31 pm New agent deploy failed
Oct 2, 2018 2:45:27 pm New agent deploy failed
Oct 2, 2018 2:45:22 pm New agent deploy failed
Oct 2, 2018 2:45:16 pm New agent deploy failed
Oct 2, 2018 2:45:12 pm Status changed from quarantined to running by ******.
Can we a delay between the attempts?
It's possible for hosts to have short intermittent networking problems, having longer retries will help prevent some of them from getting quarantined and requiring manual intervention.
At the same time, we recognize that some vendors are more prone to those types of problems and that needs to be addressed separately and we're working on that.
Attachments
Issue Links
- is duplicated by
-
EVG-5274 Agent deploy jobs should requeue themselves
-
- Closed
-