ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    How to monitor 100 cloud VM's

    IT Discussion
    10
    30
    1.8k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • black3dynamiteB
      black3dynamite
      last edited by

      Zabbix is another option.

      1 Reply Last reply Reply Quote 3
      • scottalanmillerS
        scottalanmiller @krisleslie
        last edited by

        @krisleslie said in How to monitor 100 cloud VM's:

        @dashrender Even if not on one, just enough to be able to break from having to manually check 100 machines.

        Yeah, SS can do that. It would be a list with like 25 per screen. And since SS is based here, if he had ideas on how he'd want to view things, that can be modified with new views to accomodate that kind of visibility.

        1 Reply Last reply Reply Quote 0
        • DashrenderD
          Dashrender
          last edited by

          I would think it would be better to set something like a notice threshold (per VM) and then only show things that exceed the threshold in a single view. Otherwise you could create your own groups - say all PBX VMs, all SQL VMs, etc to be shown as a group.

          scottalanmillerS 1 Reply Last reply Reply Quote 2
          • scottalanmillerS
            scottalanmiller @Dashrender
            last edited by

            @dashrender said in How to monitor 100 cloud VM's:

            I would think it would be better to set something like a notice threshold (per VM) and then only show things that exceed the threshold in a single view. Otherwise you could create your own groups - say all PBX VMs, all SQL VMs, etc to be shown as a group.

            Tagging, in the SS context. Tag a server and see those groups as views. Or use custom filters to sort. Like by customer or whatever.

            1 Reply Last reply Reply Quote 0
            • K
              krisleslie
              last edited by

              CPU usage
              RAM usage
              Network connectivity

              scottalanmillerS 1 Reply Last reply Reply Quote 0
              • scottalanmillerS
                scottalanmiller @krisleslie
                last edited by

                @krisleslie said in How to monitor 100 cloud VM's:

                CPU usage
                RAM usage
                Network connectivity

                That's all there right now.

                1 Reply Last reply Reply Quote 0
                • K
                  krisleslie
                  last edited by

                  I like the direction your going it would be totally cool to see 25 at a time. Its digestable.

                  bigbearB 1 Reply Last reply Reply Quote 0
                  • AdamFA
                    AdamF
                    last edited by

                    @scottalanmiller What type of resources are we talking about for SS for the agents?

                    scottalanmillerS 1 Reply Last reply Reply Quote 0
                    • DashrenderD
                      Dashrender
                      last edited by

                      another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

                      scottalanmillerS 1 Reply Last reply Reply Quote 0
                      • scottalanmillerS
                        scottalanmiller @AdamF
                        last edited by

                        @fuznutz04 said in How to monitor 100 cloud VM's:

                        @scottalanmiller What type of resources are we talking about for SS for the agents?

                        Relatively little. It uses Salt Minion alone as the local agent right now. It's not zero, but it is low and doesn't need to be a priority.

                        1 Reply Last reply Reply Quote 0
                        • scottalanmillerS
                          scottalanmiller @Dashrender
                          last edited by

                          @dashrender said in How to monitor 100 cloud VM's:

                          another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

                          Yup, a CPU Overview kind of screen would be nice.

                          stacksofplatesS 1 Reply Last reply Reply Quote 1
                          • bigbearB
                            bigbear @krisleslie
                            last edited by

                            @krisleslie said in How to monitor 100 cloud VM's:

                            I like the direction your going it would be totally cool to see 25 at a time. Its digestable.

                            Um, what do these 1000 windows 2012 servers do???

                            And when they break, why?

                            Also, why?

                            1 Reply Last reply Reply Quote 1
                            • J
                              Jimmy9008
                              last edited by

                              I'm doing these checks with PRTG locally, at even 75 servers the main screen is crazy to look at.

                              1 Reply Last reply Reply Quote 1
                              • J
                                Jimmy9008
                                last edited by

                                The main screen is actually not that bad. This is a decent view, green means everything is within the agreed limits. Each item has five or so checks underneath. Disk, RAM, Ping, Event Log, Uptime etc... You can drill down to them by clicking on them.

                                @krisleslie would that do what you need? I'd guess that screen will still be usable at 100 VMs as any issues would change from green to red, flagging it to you?

                                0_1510827746165_1.PNG

                                1 Reply Last reply Reply Quote 0
                                • stacksofplatesS
                                  stacksofplates @scottalanmiller
                                  last edited by

                                  @scottalanmiller said in How to monitor 100 cloud VM's:

                                  @dashrender said in How to monitor 100 cloud VM's:

                                  another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

                                  Yup, a CPU Overview kind of screen would be nice.

                                  I have a load screen in Grafana that shows just the load of every system. It's really handy.

                                  stacksofplatesS 1 Reply Last reply Reply Quote 0
                                  • stacksofplatesS
                                    stacksofplates @stacksofplates
                                    last edited by

                                    @stacksofplates said in How to monitor 100 cloud VM's:

                                    @scottalanmiller said in How to monitor 100 cloud VM's:

                                    @dashrender said in How to monitor 100 cloud VM's:

                                    another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

                                    Yup, a CPU Overview kind of screen would be nice.

                                    I have a load screen in Grafana that shows just the load of every system. It's really handy.

                                    And then there is another dashboard that does full system details with everything from Prometheus.

                                    1 Reply Last reply Reply Quote 0
                                    • K
                                      krisleslie
                                      last edited by

                                      @stacksofplates said in How to monitor 100 cloud VM's:

                                      Prometheus

                                      I think any tool that can handle it would be of use.

                                      If it's graphical and can do the job so be it. If it is a table and can do the job so be it.

                                      I've tried suggesting and using Comodo ONE in this use case and I don't think it's up to the task for the job. It can monitor, and notify sure. But a visualization I'm not 100% sure about.

                                      Same could be said about Spiceworks.

                                      scottalanmillerS dbeatoD 2 Replies Last reply Reply Quote 0
                                      • scottalanmillerS
                                        scottalanmiller @krisleslie
                                        last edited by

                                        @krisleslie said in How to monitor 100 cloud VM's:

                                        @stacksofplates said in How to monitor 100 cloud VM's:

                                        Prometheus

                                        I think any tool that can handle it would be of use.

                                        If it's graphical and can do the job so be it. If it is a table and can do the job so be it.

                                        I've tried suggesting and using Comodo ONE in this use case and I don't think it's up to the task for the job. It can monitor, and notify sure. But a visualization I'm not 100% sure about.

                                        Same could be said about Spiceworks.

                                        SW would be insanely heavy for that many machines to monitor and requires a dedicated Windows server to run.

                                        1 Reply Last reply Reply Quote 2
                                        • dbeatoD
                                          dbeato @krisleslie
                                          last edited by

                                          @krisleslie said in How to monitor 100 cloud VM's:

                                          @stacksofplates said in How to monitor 100 cloud VM's:

                                          Prometheus

                                          I think any tool that can handle it would be of use.

                                          If it's graphical and can do the job so be it. If it is a table and can do the job so be it.

                                          I've tried suggesting and using Comodo ONE in this use case and I don't think it's up to the task for the job. It can monitor, and notify sure. But a visualization I'm not 100% sure about.

                                          Same could be said about Spiceworks.

                                          I would recommend Zabbix. And other options below:
                                          https://www.opennms.org/en/docs
                                          https://my-netdata.io/
                                          https://sensuapp.org/downloads

                                          1 Reply Last reply Reply Quote 2
                                          • NetworkNerdN
                                            NetworkNerd
                                            last edited by NetworkNerd

                                            If these were my servers, I would want to see bandwidth usage too, especially if the cloud provider is charging me for it. The PRTG approach looks like a really good option.

                                            I've heard good things about Sensu as well.

                                            But no matter what you use, you need to be able to know what normal operation (performance, capacity, utilization) is like across the servers so you will truly know if the behavior you see is an outlier or expected behavior (i.e. a SQL VM spikes in CPU and memory usage because there are a ton of queries running for order inserts at the end of the day, etc.).

                                            1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 1 / 2
                                            • First post
                                              Last post