ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Domain Controller Down (VM)

    Scheduled Pinned Locked Moved IT Discussion
    609 Posts 16 Posters 139.0k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • stacksofplatesS
      stacksofplates @scottalanmiller
      last edited by

      @scottalanmiller said in Domain Controller Down (VM):

      @stacksofplates said in Domain Controller Down (VM):

      No we don't need bigger systems. We have jobs that run over 20 nodes (multiple clusters) at 100% each with 256 GB RAM and 20 E7 cores. They take 100% no matter what you run. And they run for as long as 6 months on one calculation. There is no scale issue. I plug the new one in and it's part of the cluster.

      That's kind of a scale issue, though. You can get single servers bigger than that that would likely do the calculation in a fraction of the time. Not seconds, still months, but potentially a bit faster and without needing a cluster (and all big iron servers are 100% virtual.) You only have this semi-physical option because you are running below a certain scale and virtualizing at the cluster level.

      It won't though. This breaks the calculations up so each is manageable. CFD is kind of nuts. The acoustics solves are crazy also. They kind of have to be broken up in pieces.

      stacksofplatesS scottalanmillerS 2 Replies Last reply Reply Quote 0
      • stacksofplatesS
        stacksofplates @stacksofplates
        last edited by

        @stacksofplates said in Domain Controller Down (VM):

        @scottalanmiller said in Domain Controller Down (VM):

        @stacksofplates said in Domain Controller Down (VM):

        No we don't need bigger systems. We have jobs that run over 20 nodes (multiple clusters) at 100% each with 256 GB RAM and 20 E7 cores. They take 100% no matter what you run. And they run for as long as 6 months on one calculation. There is no scale issue. I plug the new one in and it's part of the cluster.

        That's kind of a scale issue, though. You can get single servers bigger than that that would likely do the calculation in a fraction of the time. Not seconds, still months, but potentially a bit faster and without needing a cluster (and all big iron servers are 100% virtual.) You only have this semi-physical option because you are running below a certain scale and virtualizing at the cluster level.

        It won't though. This breaks the calculations up so each is manageable. CFD is kind of nuts. The acoustics solves are crazy also. They kind of have to be broken up in pieces.

        We've done a bit of research on what the highest we can get out of everything is (for single jobs). We are at about the peak right now.

        If we have 300 jobs running then we could get more, but we're at about the max right now. (Unless we want to spend a ton of money for very little more performance).

        scottalanmillerS 1 Reply Last reply Reply Quote 0
        • DashrenderD
          Dashrender @scottalanmiller
          last edited by

          @scottalanmiller said in Domain Controller Down (VM):

          @Dashrender said in Domain Controller Down (VM):

          @stacksofplates said in Domain Controller Down (VM):

          @scottalanmiller said in Domain Controller Down (VM):

          @stacksofplates said in Domain Controller Down (VM):

          @scottalanmiller said in Domain Controller Down (VM):

          @stacksofplates said in Domain Controller Down (VM):

          If you're running on something using PV drivers that they don't understand...

          Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

          When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

          So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

          What about IO mode? native, threads, default?

          No one can support every hypervisor at that level.

          Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

          You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

          I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
          The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
          SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
          RDS single proc xeon 4 core 2 drive boot, 2 drive data
          IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
          etc
          etc

          Because... no support 🙂

          eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

          scottalanmillerS 1 Reply Last reply Reply Quote 0
          • scottalanmillerS
            scottalanmiller @Dashrender
            last edited by

            @Dashrender said in Domain Controller Down (VM):

            @scottalanmiller said in Domain Controller Down (VM):

            @Dashrender said in Domain Controller Down (VM):

            @stacksofplates said in Domain Controller Down (VM):

            @scottalanmiller said in Domain Controller Down (VM):

            @stacksofplates said in Domain Controller Down (VM):

            @scottalanmiller said in Domain Controller Down (VM):

            @stacksofplates said in Domain Controller Down (VM):

            If you're running on something using PV drivers that they don't understand...

            Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

            When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

            So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

            What about IO mode? native, threads, default?

            No one can support every hypervisor at that level.

            Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

            You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

            I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
            The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
            SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
            RDS single proc xeon 4 core 2 drive boot, 2 drive data
            IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
            etc
            etc

            Because... no support 🙂

            eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

            That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

            DustinB3403D DashrenderD 2 Replies Last reply Reply Quote 0
            • scottalanmillerS
              scottalanmiller @stacksofplates
              last edited by

              @stacksofplates said in Domain Controller Down (VM):

              We've done a bit of research on what the highest we can get out of everything is (for single jobs). We are at about the peak right now.

              If we have 300 jobs running then we could get more, but we're at about the max right now. (Unless we want to spend a ton of money for very little more performance).

              Have they looked at other architectures like ARM, Power and Sparc? That's often where the big performance boosts are, but not always. It's very workload dependent.

              stacksofplatesS 1 Reply Last reply Reply Quote 0
              • DustinB3403D
                DustinB3403 @scottalanmiller
                last edited by

                @scottalanmiller said in Domain Controller Down (VM):

                @Dashrender said in Domain Controller Down (VM):

                @scottalanmiller said in Domain Controller Down (VM):

                @Dashrender said in Domain Controller Down (VM):

                @stacksofplates said in Domain Controller Down (VM):

                @scottalanmiller said in Domain Controller Down (VM):

                @stacksofplates said in Domain Controller Down (VM):

                @scottalanmiller said in Domain Controller Down (VM):

                @stacksofplates said in Domain Controller Down (VM):

                If you're running on something using PV drivers that they don't understand...

                Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                What about IO mode? native, threads, default?

                No one can support every hypervisor at that level.

                Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                RDS single proc xeon 4 core 2 drive boot, 2 drive data
                IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                etc
                etc

                Because... no support 🙂

                eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                DashrenderD scottalanmillerS 2 Replies Last reply Reply Quote 0
                • DashrenderD
                  Dashrender @scottalanmiller
                  last edited by

                  @scottalanmiller said in Domain Controller Down (VM):

                  @Dashrender said in Domain Controller Down (VM):

                  @scottalanmiller said in Domain Controller Down (VM):

                  @Dashrender said in Domain Controller Down (VM):

                  @stacksofplates said in Domain Controller Down (VM):

                  @scottalanmiller said in Domain Controller Down (VM):

                  @stacksofplates said in Domain Controller Down (VM):

                  @scottalanmiller said in Domain Controller Down (VM):

                  @stacksofplates said in Domain Controller Down (VM):

                  If you're running on something using PV drivers that they don't understand...

                  Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                  When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                  So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                  What about IO mode? native, threads, default?

                  No one can support every hypervisor at that level.

                  Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                  You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                  I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                  The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                  SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                  RDS single proc xeon 4 core 2 drive boot, 2 drive data
                  IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                  etc
                  etc

                  Because... no support 🙂

                  eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                  That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                  LOL - Short of someone like Epic, from what I can tell, they are mostly software developers, who don't care about the hardware/VM it's running on. They don't approach the software holistically.

                  scottalanmillerS S 2 Replies Last reply Reply Quote 0
                  • scottalanmillerS
                    scottalanmiller @stacksofplates
                    last edited by

                    @stacksofplates said in Domain Controller Down (VM):

                    @scottalanmiller said in Domain Controller Down (VM):

                    @stacksofplates said in Domain Controller Down (VM):

                    No we don't need bigger systems. We have jobs that run over 20 nodes (multiple clusters) at 100% each with 256 GB RAM and 20 E7 cores. They take 100% no matter what you run. And they run for as long as 6 months on one calculation. There is no scale issue. I plug the new one in and it's part of the cluster.

                    That's kind of a scale issue, though. You can get single servers bigger than that that would likely do the calculation in a fraction of the time. Not seconds, still months, but potentially a bit faster and without needing a cluster (and all big iron servers are 100% virtual.) You only have this semi-physical option because you are running below a certain scale and virtualizing at the cluster level.

                    It won't though. This breaks the calculations up so each is manageable. CFD is kind of nuts. The acoustics solves are crazy also. They kind of have to be broken up in pieces.

                    I understand how it works, that's why we used massive Monte Carlo clusters and such. Some calculations are pretty discrete. but ther eis still cluster coordination overhead.

                    1 Reply Last reply Reply Quote 0
                    • stacksofplatesS
                      stacksofplates @scottalanmiller
                      last edited by

                      @scottalanmiller said in Domain Controller Down (VM):

                      @stacksofplates said in Domain Controller Down (VM):

                      We've done a bit of research on what the highest we can get out of everything is (for single jobs). We are at about the peak right now.

                      If we have 300 jobs running then we could get more, but we're at about the max right now. (Unless we want to spend a ton of money for very little more performance).

                      Have they looked at other architectures like ARM, Power and Sparc? That's often where the big performance boosts are, but not always. It's very workload dependent.

                      We used to have a lot of Sparc (still have a lot around), but they found the performance was better on x86. (before my time).

                      1 Reply Last reply Reply Quote 0
                      • scottalanmillerS
                        scottalanmiller @Dashrender
                        last edited by

                        @Dashrender said in Domain Controller Down (VM):

                        @scottalanmiller said in Domain Controller Down (VM):

                        @Dashrender said in Domain Controller Down (VM):

                        @scottalanmiller said in Domain Controller Down (VM):

                        @Dashrender said in Domain Controller Down (VM):

                        @stacksofplates said in Domain Controller Down (VM):

                        @scottalanmiller said in Domain Controller Down (VM):

                        @stacksofplates said in Domain Controller Down (VM):

                        @scottalanmiller said in Domain Controller Down (VM):

                        @stacksofplates said in Domain Controller Down (VM):

                        If you're running on something using PV drivers that they don't understand...

                        Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                        When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                        So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                        What about IO mode? native, threads, default?

                        No one can support every hypervisor at that level.

                        Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                        You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                        I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                        The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                        SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                        RDS single proc xeon 4 core 2 drive boot, 2 drive data
                        IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                        etc
                        etc

                        Because... no support 🙂

                        eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                        That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                        LOL - Short of someone like Epic, from what I can tell, they are mostly software developers, who don't care about the hardware/VM it's running on. They don't approach the software holistically.

                        That's just another way of writing... not well supported. If they demand certain hardware but don't know anything about hardware, are they really supporting it or just "not supporting" things.

                        1 Reply Last reply Reply Quote 1
                        • DashrenderD
                          Dashrender @DustinB3403
                          last edited by

                          @DustinB3403 said in Domain Controller Down (VM):

                          @scottalanmiller said in Domain Controller Down (VM):

                          @Dashrender said in Domain Controller Down (VM):

                          @scottalanmiller said in Domain Controller Down (VM):

                          @Dashrender said in Domain Controller Down (VM):

                          @stacksofplates said in Domain Controller Down (VM):

                          @scottalanmiller said in Domain Controller Down (VM):

                          @stacksofplates said in Domain Controller Down (VM):

                          @scottalanmiller said in Domain Controller Down (VM):

                          @stacksofplates said in Domain Controller Down (VM):

                          If you're running on something using PV drivers that they don't understand...

                          Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                          When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                          So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                          What about IO mode? native, threads, default?

                          No one can support every hypervisor at that level.

                          Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                          You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                          I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                          The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                          SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                          RDS single proc xeon 4 core 2 drive boot, 2 drive data
                          IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                          etc
                          etc

                          Because... no support 🙂

                          eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                          That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                          Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                          it's because they don't ever support anything other than their own software, most likely - they think the server hardware is magic. it will just do what they want, they don't KNOW they need xyz IOPs, or RAM, etc.

                          DustinB3403D scottalanmillerS 2 Replies Last reply Reply Quote 0
                          • DustinB3403D
                            DustinB3403 @Dashrender
                            last edited by

                            @Dashrender said in Domain Controller Down (VM):

                            @DustinB3403 said in Domain Controller Down (VM):

                            @scottalanmiller said in Domain Controller Down (VM):

                            @Dashrender said in Domain Controller Down (VM):

                            @scottalanmiller said in Domain Controller Down (VM):

                            @Dashrender said in Domain Controller Down (VM):

                            @stacksofplates said in Domain Controller Down (VM):

                            @scottalanmiller said in Domain Controller Down (VM):

                            @stacksofplates said in Domain Controller Down (VM):

                            @scottalanmiller said in Domain Controller Down (VM):

                            @stacksofplates said in Domain Controller Down (VM):

                            If you're running on something using PV drivers that they don't understand...

                            Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                            When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                            So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                            What about IO mode? native, threads, default?

                            No one can support every hypervisor at that level.

                            Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                            You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                            I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                            The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                            SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                            RDS single proc xeon 4 core 2 drive boot, 2 drive data
                            IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                            etc
                            etc

                            Because... no support 🙂

                            eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                            That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                            Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                            it's because they don't ever support anything other than their own software, most likely - they think the server hardware is magic. it will just do what they want, they don't KNOW they need xyz IOPs, or RAM, etc.

                            And therefore should be on the short list of software vendors from which the business should be considering.

                            1 Reply Last reply Reply Quote 0
                            • scottalanmillerS
                              scottalanmiller @Dashrender
                              last edited by

                              @Dashrender said in Domain Controller Down (VM):

                              @stacksofplates said in Domain Controller Down (VM):

                              @scottalanmiller said in Domain Controller Down (VM):

                              @stacksofplates said in Domain Controller Down (VM):

                              No we don't need bigger systems. We have jobs that run over 20 nodes (multiple clusters) at 100% each with 256 GB RAM and 20 E7 cores. They take 100% no matter what you run. And they run for as long as 6 months on one calculation. There is no scale issue. I plug the new one in and it's part of the cluster.

                              Ah, you are using a different type of virtualization. You are using a multi-system virtualization platform. That's different that it is semantically not what people normally mean by virtualization but is covered by what we mean. That's not a server, that's a node in a server. I ran the 10K node compute cluster for a Wall St. firm. That was virtual, but the virtualization was at a higher layer above the individual nodes. Like you have there. It's one cluster, not individual servers.

                              It's just a job scheduler over the nodes. It's not really virtualization (I guess maybe if you consider the scheduler some kind of virtualization). The master node just runs on whatever we tell it. It can also just run on itself.

                              But the work is spread out over the different compute nodes, the fact that the main program doesn't care where the compute comes from is what makes this a virtualized setup. Nodes versus servers. i.e. a node is useless on it's own, like a CPU is useless outside the server, but a server is a complete thing, it's usable on it's own.

                              Exactly. It doesn't "Exactly" have the hardware abstraction that you want to see from virtualization, but it has most of the aspects of it. The entire cluster is a single computer and has hardware abstraction at a higher level abstracting the entire cluster as a single thing. You could absolutely still virtualize again on the individual nodes, but the "always virtualize" rule is handled by the abstraction higher up. I've written about this scenario before.

                              1 Reply Last reply Reply Quote 1
                              • scottalanmillerS
                                scottalanmiller @DustinB3403
                                last edited by

                                @DustinB3403 said in Domain Controller Down (VM):

                                @scottalanmiller said in Domain Controller Down (VM):

                                @Dashrender said in Domain Controller Down (VM):

                                @scottalanmiller said in Domain Controller Down (VM):

                                @Dashrender said in Domain Controller Down (VM):

                                @stacksofplates said in Domain Controller Down (VM):

                                @scottalanmiller said in Domain Controller Down (VM):

                                @stacksofplates said in Domain Controller Down (VM):

                                @scottalanmiller said in Domain Controller Down (VM):

                                @stacksofplates said in Domain Controller Down (VM):

                                If you're running on something using PV drivers that they don't understand...

                                Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                What about IO mode? native, threads, default?

                                No one can support every hypervisor at that level.

                                Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                                I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                                The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                                SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                                RDS single proc xeon 4 core 2 drive boot, 2 drive data
                                IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                                etc
                                etc

                                Because... no support 🙂

                                eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                                That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                                Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                                NOrmally it's because they don't have support at all and are just milking customers until things don't work. The illusion of support makes the money. Support itself is expensive.

                                DustinB3403D 1 Reply Last reply Reply Quote 0
                                • DustinB3403D
                                  DustinB3403 @scottalanmiller
                                  last edited by

                                  @scottalanmiller said in Domain Controller Down (VM):

                                  @DustinB3403 said in Domain Controller Down (VM):

                                  @scottalanmiller said in Domain Controller Down (VM):

                                  @Dashrender said in Domain Controller Down (VM):

                                  @scottalanmiller said in Domain Controller Down (VM):

                                  @Dashrender said in Domain Controller Down (VM):

                                  @stacksofplates said in Domain Controller Down (VM):

                                  @scottalanmiller said in Domain Controller Down (VM):

                                  @stacksofplates said in Domain Controller Down (VM):

                                  @scottalanmiller said in Domain Controller Down (VM):

                                  @stacksofplates said in Domain Controller Down (VM):

                                  If you're running on something using PV drivers that they don't understand...

                                  Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                  When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                  So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                  What about IO mode? native, threads, default?

                                  No one can support every hypervisor at that level.

                                  Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                  You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                                  I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                                  The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                                  SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                                  RDS single proc xeon 4 core 2 drive boot, 2 drive data
                                  IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                                  etc
                                  etc

                                  Because... no support 🙂

                                  eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                                  That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                                  Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                                  NOrmally it's because they don't have support at all and are just milking customers until things don't work. The illusion of support makes the money. Support itself is expensive.

                                  That was a rhetorical question. 🙂

                                  1 Reply Last reply Reply Quote 0
                                  • scottalanmillerS
                                    scottalanmiller @Dashrender
                                    last edited by

                                    @Dashrender said in Domain Controller Down (VM):

                                    @DustinB3403 said in Domain Controller Down (VM):

                                    @scottalanmiller said in Domain Controller Down (VM):

                                    @Dashrender said in Domain Controller Down (VM):

                                    @scottalanmiller said in Domain Controller Down (VM):

                                    @Dashrender said in Domain Controller Down (VM):

                                    @stacksofplates said in Domain Controller Down (VM):

                                    @scottalanmiller said in Domain Controller Down (VM):

                                    @stacksofplates said in Domain Controller Down (VM):

                                    @scottalanmiller said in Domain Controller Down (VM):

                                    @stacksofplates said in Domain Controller Down (VM):

                                    If you're running on something using PV drivers that they don't understand...

                                    Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                    When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                    So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                    What about IO mode? native, threads, default?

                                    No one can support every hypervisor at that level.

                                    Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                    You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                                    I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                                    The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                                    SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                                    RDS single proc xeon 4 core 2 drive boot, 2 drive data
                                    IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                                    etc
                                    etc

                                    Because... no support 🙂

                                    eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                                    That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                                    Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                                    it's because they don't ever support anything other than their own software, most likely - they think the server hardware is magic. it will just do what they want, they don't KNOW they need xyz IOPs, or RAM, etc.

                                    That's a scary place to be. If they don't know it, who does? And who provides support. "Not-support" isn't a form of support.

                                    DashrenderD DustinB3403D 2 Replies Last reply Reply Quote 0
                                    • DashrenderD
                                      Dashrender @scottalanmiller
                                      last edited by

                                      @scottalanmiller said in Domain Controller Down (VM):

                                      @Dashrender said in Domain Controller Down (VM):

                                      @DustinB3403 said in Domain Controller Down (VM):

                                      @scottalanmiller said in Domain Controller Down (VM):

                                      @Dashrender said in Domain Controller Down (VM):

                                      @scottalanmiller said in Domain Controller Down (VM):

                                      @Dashrender said in Domain Controller Down (VM):

                                      @stacksofplates said in Domain Controller Down (VM):

                                      @scottalanmiller said in Domain Controller Down (VM):

                                      @stacksofplates said in Domain Controller Down (VM):

                                      @scottalanmiller said in Domain Controller Down (VM):

                                      @stacksofplates said in Domain Controller Down (VM):

                                      If you're running on something using PV drivers that they don't understand...

                                      Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                      When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                      So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                      What about IO mode? native, threads, default?

                                      No one can support every hypervisor at that level.

                                      Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                      You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                                      I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                                      The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                                      SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                                      RDS single proc xeon 4 core 2 drive boot, 2 drive data
                                      IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                                      etc
                                      etc

                                      Because... no support 🙂

                                      eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                                      That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                                      Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                                      it's because they don't ever support anything other than their own software, most likely - they think the server hardware is magic. it will just do what they want, they don't KNOW they need xyz IOPs, or RAM, etc.

                                      That's a scary place to be. If they don't know it, who does? And who provides support. "Not-support" isn't a form of support.

                                      LOL - they provide support on the software itself, not on it's performance, from my experience.

                                      1 Reply Last reply Reply Quote 0
                                      • stacksofplatesS
                                        stacksofplates @scottalanmiller
                                        last edited by stacksofplates

                                        @scottalanmiller said in Domain Controller Down (VM):

                                        @stacksofplates said in Domain Controller Down (VM):

                                        @scottalanmiller said in Domain Controller Down (VM):

                                        @stacksofplates said in Domain Controller Down (VM):

                                        If you're running on something using PV drivers that they don't understand...

                                        Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                        When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                        So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                        What about IO mode? native, threads, default?

                                        No one can support every hypervisor at that level.

                                        Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                        So they don't need to be fully supported, but let's say the IT guy down the street who's used Linux twice in his life installs the software in a VM with a non preallocated QCOW2 with no caching and an rtl8139 NIC. It's going to run slower than anything. So he calls the vendor for support and they try to help him. Nothing they are going to be able to tell him is going to help him, because it's nothing to do with their software. It's in their best interest to try to control what you're installing on to to mitigate stupid issues like that.

                                        At least if the other end knew what he needed he could get some help. But now he might cancel his subscription and go somewhere else (which I believe is what they are trying to avoid). I can't imagine the amount of "IT Pros" that contact them looking for support for issues like that.

                                        DustinB3403D scottalanmillerS 3 Replies Last reply Reply Quote 0
                                        • DustinB3403D
                                          DustinB3403 @scottalanmiller
                                          last edited by

                                          @scottalanmiller said in Domain Controller Down (VM):

                                          @Dashrender said in Domain Controller Down (VM):

                                          @DustinB3403 said in Domain Controller Down (VM):

                                          @scottalanmiller said in Domain Controller Down (VM):

                                          @Dashrender said in Domain Controller Down (VM):

                                          @scottalanmiller said in Domain Controller Down (VM):

                                          @Dashrender said in Domain Controller Down (VM):

                                          @stacksofplates said in Domain Controller Down (VM):

                                          @scottalanmiller said in Domain Controller Down (VM):

                                          @stacksofplates said in Domain Controller Down (VM):

                                          @scottalanmiller said in Domain Controller Down (VM):

                                          @stacksofplates said in Domain Controller Down (VM):

                                          If you're running on something using PV drivers that they don't understand...

                                          Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                          When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                          So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                          What about IO mode? native, threads, default?

                                          No one can support every hypervisor at that level.

                                          Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                          You still haven't provided a single healthcare vendor that does any of what you say is appropriate.

                                          I know Greenway didn't have a virtualization plan 3 years ago when we were looking at them. It's why I had to build a ridiculous $100K two server failover system. Today the performance needed could be done for $25k.
                                          The sad thing is that the vendor could not provide any IOPs requirements, etc. They only had this generic hardware requirement.
                                          SQL Dual Proc Xeon 4 cores each two drive boot, 4 drive RAID 10 SQL, 4 drive log
                                          RDS single proc xeon 4 core 2 drive boot, 2 drive data
                                          IIS application dual proc xeon 4 cores each, 2 drive boot, 6 drive RAID 10 data
                                          etc
                                          etc

                                          Because... no support 🙂

                                          eh? yeah Greenway didn't bother to do the right thing for their customers and have support for hypervisors! Shit, how can they really support their customers on bare metal if they don't know the IOPs requirements, etc? Just keep stabbing hardware until they "get lucky"?

                                          That's my guess. Lacking support of VMs isn't exactly the big issue... it's WHY they lack that support that is the big issue.

                                          Why the don't support VM's is very important, almost as important as the question of what made them not want to support their application in a VM?

                                          it's because they don't ever support anything other than their own software, most likely - they think the server hardware is magic. it will just do what they want, they don't KNOW they need xyz IOPs, or RAM, etc.

                                          That's a scary place to be. If they don't know it, who does? And who provides support. "Not-support" isn't a form of support.

                                          Just as scary as a vendor saying we only support dedicated hardware because we need #-IOPS, #-RAM and #-Cores.

                                          All of which are easy to provide to a VM, just scale the hardware appropriately.

                                          1 Reply Last reply Reply Quote 0
                                          • DustinB3403D
                                            DustinB3403 @stacksofplates
                                            last edited by

                                            @stacksofplates said in Domain Controller Down (VM):

                                            @scottalanmiller said in Domain Controller Down (VM):

                                            @stacksofplates said in Domain Controller Down (VM):

                                            @scottalanmiller said in Domain Controller Down (VM):

                                            @stacksofplates said in Domain Controller Down (VM):

                                            If you're running on something using PV drivers that they don't understand...

                                            Then your critical app vendor is below the home line. THAT'S how scary this should be to companies.

                                            When your "business critical support" lacks the knowledge and skills of your first year help desk people, you need to be worried about their ability to support. Sure, when nothing goes wrong, everything is fine. But if anything goes wrong, you are suggesting these people don't have even the most rudimentary knowledge of systems today. That's worrisome. And it's why so many systems simply have no support options - relying on software and hardware that is out of support meaning that while the app might call itself supported, they depend on non-production systems making the whole thing out of support by extension.

                                            So when running with a preallocated qcow2 image, which caching mode do you use for your disk? Writethrough, writeback, directsync, none?

                                            What about IO mode? native, threads, default?

                                            No one can support every hypervisor at that level.

                                            Also, none of those things need to be supported by the app vendor. They just need to support the app and stop looking for meaningless excuses to block support. I understand some vendors want to support all the way down the stack, but if they don't know how to do that with virtualization, they don't know how to do it. The skills to support the stack would give them the skills to do it virtually even better (fewer variables.) So that logic doesn't hold up.

                                            So they don't need to be fully supported, but let's say the IT guy down the street who's used Linux twice in his life installs the software in a VM with a non preallocated QCOW2 with an rtl8139 NIC. It's going to run slower than anything. So he calls the vendor for support and they try to help him. Nothing they are going to be able to tell him is going to help him, because it's nothing to do with their software. It's in their best interest to try to control what you're installing on to to mitigate stupid issues like that.

                                            At least if the other end knew what he needed he could get some help. But now he might cancel his subscription and go somewhere else (which I believe is what they are trying to avoid). I can't imagine the amount of "IT Pros" that contact them looking for support for issues like that.

                                            That is the issue of the IT Guy not understanding the system requirements, the fact that it is virtual means nothing. He could install that image to a bare metal system and have just as poor performance!

                                            stacksofplatesS 1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 27
                                            • 28
                                            • 29
                                            • 30
                                            • 31
                                            • 31 / 31
                                            • First post
                                              Last post