ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    How to monitor 100 cloud VM's

    IT Discussion
    10
    30
    1.8k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • scottalanmillerS
      scottalanmiller @Dashrender
      last edited by

      @dashrender said in How to monitor 100 cloud VM's:

      another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

      Yup, a CPU Overview kind of screen would be nice.

      stacksofplatesS 1 Reply Last reply Reply Quote 1
      • bigbearB
        bigbear @krisleslie
        last edited by

        @krisleslie said in How to monitor 100 cloud VM's:

        I like the direction your going it would be totally cool to see 25 at a time. Its digestable.

        Um, what do these 1000 windows 2012 servers do???

        And when they break, why?

        Also, why?

        1 Reply Last reply Reply Quote 1
        • J
          Jimmy9008
          last edited by

          I'm doing these checks with PRTG locally, at even 75 servers the main screen is crazy to look at.

          1 Reply Last reply Reply Quote 1
          • J
            Jimmy9008
            last edited by

            The main screen is actually not that bad. This is a decent view, green means everything is within the agreed limits. Each item has five or so checks underneath. Disk, RAM, Ping, Event Log, Uptime etc... You can drill down to them by clicking on them.

            @krisleslie would that do what you need? I'd guess that screen will still be usable at 100 VMs as any issues would change from green to red, flagging it to you?

            0_1510827746165_1.PNG

            1 Reply Last reply Reply Quote 0
            • stacksofplatesS
              stacksofplates @scottalanmiller
              last edited by

              @scottalanmiller said in How to monitor 100 cloud VM's:

              @dashrender said in How to monitor 100 cloud VM's:

              another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

              Yup, a CPU Overview kind of screen would be nice.

              I have a load screen in Grafana that shows just the load of every system. It's really handy.

              stacksofplatesS 1 Reply Last reply Reply Quote 0
              • stacksofplatesS
                stacksofplates @stacksofplates
                last edited by

                @stacksofplates said in How to monitor 100 cloud VM's:

                @scottalanmiller said in How to monitor 100 cloud VM's:

                @dashrender said in How to monitor 100 cloud VM's:

                another thought would be three different views, one for CPU, one for RAM, one for Network. each view would have a 10 x 10 grid of either green or red, the red meaning it's over some threshold.. they you click the box and get directly to the machine in question.

                Yup, a CPU Overview kind of screen would be nice.

                I have a load screen in Grafana that shows just the load of every system. It's really handy.

                And then there is another dashboard that does full system details with everything from Prometheus.

                1 Reply Last reply Reply Quote 0
                • K
                  krisleslie
                  last edited by

                  @stacksofplates said in How to monitor 100 cloud VM's:

                  Prometheus

                  I think any tool that can handle it would be of use.

                  If it's graphical and can do the job so be it. If it is a table and can do the job so be it.

                  I've tried suggesting and using Comodo ONE in this use case and I don't think it's up to the task for the job. It can monitor, and notify sure. But a visualization I'm not 100% sure about.

                  Same could be said about Spiceworks.

                  scottalanmillerS dbeatoD 2 Replies Last reply Reply Quote 0
                  • scottalanmillerS
                    scottalanmiller @krisleslie
                    last edited by

                    @krisleslie said in How to monitor 100 cloud VM's:

                    @stacksofplates said in How to monitor 100 cloud VM's:

                    Prometheus

                    I think any tool that can handle it would be of use.

                    If it's graphical and can do the job so be it. If it is a table and can do the job so be it.

                    I've tried suggesting and using Comodo ONE in this use case and I don't think it's up to the task for the job. It can monitor, and notify sure. But a visualization I'm not 100% sure about.

                    Same could be said about Spiceworks.

                    SW would be insanely heavy for that many machines to monitor and requires a dedicated Windows server to run.

                    1 Reply Last reply Reply Quote 2
                    • dbeatoD
                      dbeato @krisleslie
                      last edited by

                      @krisleslie said in How to monitor 100 cloud VM's:

                      @stacksofplates said in How to monitor 100 cloud VM's:

                      Prometheus

                      I think any tool that can handle it would be of use.

                      If it's graphical and can do the job so be it. If it is a table and can do the job so be it.

                      I've tried suggesting and using Comodo ONE in this use case and I don't think it's up to the task for the job. It can monitor, and notify sure. But a visualization I'm not 100% sure about.

                      Same could be said about Spiceworks.

                      I would recommend Zabbix. And other options below:
                      https://www.opennms.org/en/docs
                      https://my-netdata.io/
                      https://sensuapp.org/downloads

                      1 Reply Last reply Reply Quote 2
                      • NetworkNerdN
                        NetworkNerd
                        last edited by NetworkNerd

                        If these were my servers, I would want to see bandwidth usage too, especially if the cloud provider is charging me for it. The PRTG approach looks like a really good option.

                        I've heard good things about Sensu as well.

                        But no matter what you use, you need to be able to know what normal operation (performance, capacity, utilization) is like across the servers so you will truly know if the behavior you see is an outlier or expected behavior (i.e. a SQL VM spikes in CPU and memory usage because there are a ton of queries running for order inserts at the end of the day, etc.).

                        1 Reply Last reply Reply Quote 0
                        • 1
                        • 2
                        • 2 / 2
                        • First post
                          Last post