Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-75531

EVG tasks wrongly marked as system unresponsive

    • Fully Compatible
    • ALL

      Recently I saw an increased number of tasks marked as SYSTEM UNRESPONSIVE.

      It seems that in some cases those tasks were executing tests that actually failed but the task is wrongly labeled and the real error is swallowed.

      For instance recently the serverless suite keep failing consistently with SYSTEM UNRESPONSIVE.

      I've analyzed this task and I noticed that job3 was executing change_collection_server_stats.js (jobs logs) actually failed with:

      [js_test:change_collection_server_stats] assert.soon failed (timeout 600000ms): () => {
      [js_test:change_collection_server_stats]     // All change collection entries are removed but one.
      [js_test:change_collection_server_stats]     return changeCollection.count() === 1;
      [js_test:change_collection_server_stats] } 
      

      But this error is not reported in the task logs nor in the evergreen UI.

            Assignee:
            daniel.moody@mongodb.com Daniel Moody
            Reporter:
            tommaso.tocci@mongodb.com Tommaso Tocci
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

              Created:
              Updated:
              Resolved: