[DOCS-5821] Need documentation to address issues seen in Cloud Manager and monitoring agent communication. Created: 10/Jul/15  Updated: 11/Jan/17  Resolved: 27/Jul/16

Status: Closed
Project: Documentation
Component/s: Cloud Manager, Ops Manager
Affects Version/s: None
Fix Version/s: 01112017-cleanup

Type: Task Priority: Major - P3
Reporter: Joshua Maag Assignee: Unassigned
Resolution: Won't Fix Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Participants:
Days since reply: 7 years, 29 weeks ago
Epic Link: docs-monitoring

 Description   

We regularly see an issue with users understanding the agents, how many are required for installation and how a monitoring agent communicates. I have put together a frequent response to this, but we should have better documentation to help the customers find this.

Here is my canned response:

Hi [customer name],

Thanks for contacting the Cloud Manager support team. Looking at the error, it looks like the monitoring agent is unable to communicate with the nodes. With Cloud Manager, you only need to have one Monitoring Agent and Backup agent per group. If you look at the [agent list|

Unknown macro: {url of agents for group}

], you'll see there is an active and standby for both. You can read more about the needed agents in this blog post.

In order to check connectivity, can you connect to the server the active monitoring agent is on, [server active monitoring agent is on]. Once connected to the shell, try connecting to the nodes that are unreachable with a mongo command like this:

mongo [host name]:[port]

You can also try pinging the servers from [unreachable server]. If either fail, then the agent is unable to resolve the host name of the instance. Please try this on each of the nodes and let me know.



 Comments   
Comment by Emily Hall [ 27/Jul/16 ]

Closed for housekeeping on 7/27/2016 by Emily Hall.
If you require additional support, please open a new ticket for prioritization.
Thanks,
Emily

Comment by Allison Reinheimer Moore [ 13/Jul/15 ]

To clarify, is this confusion generally found among folks using Cloud Manager *without* Automation?

Generated at Thu Feb 08 07:51:04 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.