How to monitor 100 cloud VM's
-
There is work being done on this right now to get it ready, but in theory, you might be able to move to this yet today.
-
Scott I got a friend that is managing 100+ servers, we need to be able to see how the health is looking. Host is https://virmach.com/
-
@scottalanmiller Comodo ONE can't handle it from the aspect of giving him visibility to all them at one time.
-
@krisleslie said in How to monitor 100 cloud VM's:
@scottalanmiller Comodo ONE can't handle it from the aspect of giving him visibility to all them at one time.
How could anything? 100 CPU (assuming single socket servers), 100 machines of RAM, 100 machines of disk.. that wouldn't all fit in one screen and be large enough to be of real use.
-
@krisleslie said in How to monitor 100 cloud VM's:
@scottalanmiller Comodo ONE can't handle it from the aspect of giving him visibility to all them at one time.
What do you want all at once? What would that view look like?
-
@dashrender said in How to monitor 100 cloud VM's:
@krisleslie said in How to monitor 100 cloud VM's:
@scottalanmiller Comodo ONE can't handle it from the aspect of giving him visibility to all them at one time.
How could anything? 100 CPU (assuming single socket servers), 100 machines of RAM, 100 machines of disk.. that wouldn't all fit in one screen and be large enough to be of real use.
That's what I'm wondering about the view. What would we want to look at?
-
@dashrender Even if not on one, just enough to be able to break from having to manually check 100 machines.
-
Zabbix is another option.
-
@krisleslie said in How to monitor 100 cloud VM's:
@dashrender Even if not on one, just enough to be able to break from having to manually check 100 machines.
Yeah, SS can do that. It would be a list with like 25 per screen. And since SS is based here, if he had ideas on how he'd want to view things, that can be modified with new views to accomodate that kind of visibility.
-
I would think it would be better to set something like a notice threshold (per VM) and then only show things that exceed the threshold in a single view. Otherwise you could create your own groups - say all PBX VMs, all SQL VMs, etc to be shown as a group.
-
@dashrender said in How to monitor 100 cloud VM's:
I would think it would be better to set something like a notice threshold (per VM) and then only show things that exceed the threshold in a single view. Otherwise you could create your own groups - say all PBX VMs, all SQL VMs, etc to be shown as a group.
Tagging, in the SS context. Tag a server and see those groups as views. Or use custom filters to sort. Like by customer or whatever.
-
CPU usage
RAM usage
Network connectivity -
@krisleslie said in How to monitor 100 cloud VM's:
CPU usage
RAM usage
Network connectivityThat's all there right now.
-
I like the direction your going it would be totally cool to see 25 at a time. Its digestable.
-
@scottalanmiller What type of resources are we talking about for SS for the agents?
-
another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.
-
@fuznutz04 said in How to monitor 100 cloud VM's:
@scottalanmiller What type of resources are we talking about for SS for the agents?
Relatively little. It uses Salt Minion alone as the local agent right now. It's not zero, but it is low and doesn't need to be a priority.
-
@dashrender said in How to monitor 100 cloud VM's:
another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.
Yup, a CPU Overview kind of screen would be nice.
-
@krisleslie said in How to monitor 100 cloud VM's:
I like the direction your going it would be totally cool to see 25 at a time. Its digestable.
Um, what do these 1000 windows 2012 servers do???
And when they break, why?
Also, why?
-
I'm doing these checks with PRTG locally, at even 75 servers the main screen is crazy to look at.