Monguard: Log Ingestion in Atlas MMS

    • Type: Task
    • Resolution: Unresolved
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Monguard
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Atlas MMS uses the certain logs from Monguard process to provide alerting and monitoring capabilities. They are used for things like AFM / analytics / DW queries / alerts based on specific patterns across Atlas (e.g., “all Fatal Assertion X across M70+ in us-east-1”).

      These logs are monitored by Agent-embedded Filebeat spooler. It applies a rule’s query + projection locally, and sends only matching lines to MMS via dedicated log ingestion APIs. MMS persists matches in nds.logIngestion.logs (TTL 7 days), then exports nightly to DW and exposes them via a Private API (keyset-paginated).

      Atlas Log Ingestion

       
       

      Tasks

       
      Based on the mongotune pattern (scope + design + rule forms + AFM integration), these are the concrete steps Monguard should own or co-own:

      1. Define a Monguard log schema for ingestible events
      1. Add ingestion/alert flags to Monguard logs (mirroring mongotune)
      1. Ensure all Monguard logs used for ingestion follow server JSON structure
      1. Write and publish a Monguard log format spec
      1. Identify the initial set of log ingestion rules you want
        • Start with a small, high-value set (e.g. all faults with shouldIngest=true, maybe one or two “monguard crash”/“unrecoverable error” rules).
      1. File Log Ingestion Rule Request Forms + HELP tickets
        • Ask ACAD/Fleet Rollout to:
          • Create rules in DEV, QA, and PROD consistently (same pattern as “Mongotune observability (dry run)/(wet run)” rules).
      1. Test end-to-end in lower environments
      1. Integrate with AFM and downstream alerting
      1. Capacity / safety work
      1. Runbook & docs

       
       

            Assignee:
            Unassigned
            Reporter:
            Anand Paithankar
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: