Uploaded image for project: 'Evergreen'
  1. Evergreen
  2. EVG-5433

Add a delay before deploying an agent to a server if previous agent deployment failed.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major - P3
    • Resolution: Fixed
    • None
    • v2018.11.01
    • app

    Description

      I see that sometimes we exhaust all 10 retries within less than a minute

      Oct 2, 2018 2:45:57 pm Status changed from running to quarantined by mci.
      Oct 2, 2018 2:45:57 pm New agent deploy failed
      Oct 2, 2018 2:45:52 pm New agent deploy failed
      Oct 2, 2018 2:45:46 pm New agent deploy failed
      Oct 2, 2018 2:45:45 pm New agent deploy failed
      Oct 2, 2018 2:45:41 pm New agent deploy failed
      Oct 2, 2018 2:45:38 pm New agent deploy failed
      Oct 2, 2018 2:45:31 pm New agent deploy failed
      Oct 2, 2018 2:45:27 pm New agent deploy failed
      Oct 2, 2018 2:45:22 pm New agent deploy failed
      Oct 2, 2018 2:45:16 pm New agent deploy failed
      Oct 2, 2018 2:45:12 pm Status changed from quarantined to running by ******.

      Can we a delay between the attempts?

      It's possible for hosts to have short intermittent networking problems, having longer retries will help prevent some of them from getting quarantined and requiring manual intervention.

      At the same time, we recognize that some vendors are more prone to those types of problems and that needs to be addressed separately and we're working on that.

      Attachments

        Issue Links

          Activity

            People

              john.liu@mongodb.com John Liu
              zakhar.kleyman@mongodb.com Zakhar Kleyman
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: