Uploaded image for project: 'Documentation'
  1. Documentation
  2. DOCS-10425

Ops manager: Backup job is too busy alert, clarifications needed

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major - P3
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Ops Manager
    • Labels:
      None

      Description

      The current docs about the alert "Backup job is too busy" may not be clear enough:

      https://docs.opsmanager.mongodb.com/current/reference/alerts/#Backup-job-is-busy-for...

      Backup job is busy for...
      Available only as a global alert.

      Sends an alert when a backup job has taken longer than the time specified. This could occur if you have an overloaded Backup Daemon or blockstore. Check the corresponding job log for error messages. Contact MongoDB Support if you need help interpreting the error message.

      Steve Briskin clarified it with this comment:

      The alert is meant to alert admins when a job is more active than expected. They are then responsible for investigating if this is normal (e.g. temporary spike in activity or known hardware degredation) or not (e.g. unexpected increase in activity or hardware is underprovisioned).
      The alert computes the amount of time the daemon spent working on applyOps and snapshot jobs over the last 24 hours and compares to their alert threshold. This alert is always based on a 24 hour period and is independent from the snapshot schedule.

      I'd suggest to start a discussion on how to improve the alert description in the Docs.

      As a side note the alert text is "Backup job is too busy" and not "Backup job is busy for..." in the current Ops Manager versions. This may be a leftover from previous Ops Manager versions.

      Thanks,
      Emilio

        Attachments

          Activity

            People

            Assignee:
            tony.sansone Anthony Sansone
            Reporter:
            emilio.scalise Emilio Scalise
            Participants:
            Last commenter:
            Anthony Sansone Anthony Sansone
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved:
              Days since reply:
              2 years, 48 weeks, 6 days ago
              Date of 1st Reply: