ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Dell PERC Question (Server Down)

    IT Discussion
    17
    255
    145.1k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • BRRABillB
      BRRABill @StrongBad
      last edited by

      @StrongBad said

      From what I could tell, the issue is that Edge does not respond.

      Yes, the tech who was working with me has not responded.

      Now, in the past few weeks I have dealt with people on vacation, and people who were sick, and everything else. So I always like to give them the benefit of the doubt as to why they are not responding. šŸ™‚

      DustinB3403D 1 Reply Last reply Reply Quote 1
      • DustinB3403D
        DustinB3403 @BRRABill
        last edited by

        @BRRABill Any update to share?

        1 Reply Last reply Reply Quote 0
        • BRRABillB
          BRRABill
          last edited by

          This was the latest e-mail from earlier this afternoon:

          "That information is good. I was hoping that your iDRAC log would shine some light on what the actual fault error was being recorded when the drive array is actually going down. I’m working on this now with one of our SSD engineers and I am hoping to have some additional information or potential resolutions about this issue today. "

          1 Reply Last reply Reply Quote 2
          • StrongBadS
            StrongBad
            last edited by

            Checking in again.

            1 Reply Last reply Reply Quote 1
            • BRRABillB
              BRRABill
              last edited by

              Have not heard from them today, sadly.

              Let me go rattle the cage...

              1 Reply Last reply Reply Quote 1
              • BRRABillB
                BRRABill
                last edited by BRRABill

                The cage rattling did nothing.

                In other news, the RAID array crashed again this morning. Management is starting to ask questions, so I think I am just going to go back to the old DELL spinning rust drives. I don't think I have an option at this point.

                This time (this is the fourth time this has happen in a month) was similar to times 1 and 2. In both those instances the entire virtual disk disappeared, as did the physical disks. If you boot into the PERC config, you will see under the FOREIGN tab that the VD and the PD are both there. "Simply" reimport the config, and you're all set.

                The third time, the array was still there, it was the disk in 0:0 that was missing. So we cleared the foreign config off of that.

                This fourth time, I took more notice of what happened when the array came back up. Sure enough it was 0;0 that was degraded. But, I don't know if I can trust that it might just be that drive.

                Here are some pictures of the PERC screens...

                0_1461325759560_fri error.png

                0_1461325766816_fri error 2.png

                0_1461325773018_fri error 3.png

                1 Reply Last reply Reply Quote 0
                • scottalanmillerS
                  scottalanmiller
                  last edited by

                  It's possible that the PERC is bad, I suppose.

                  BRRABillB 2 Replies Last reply Reply Quote 0
                  • BRRABillB
                    BRRABill @scottalanmiller
                    last edited by BRRABill

                    @scottalanmiller said

                    It's possible that the PERC is bad, I suppose.

                    Why would you think that as opposed to it being the disks?

                    Or are you just spitballing ideas?

                    Of course DELL would say it was the disks, but the first thing that @BradfromxByte said the first time was that it had to be the disk. It was the guys from EDGE that said no.

                    scottalanmillerS 1 Reply Last reply Reply Quote 0
                    • BRRABillB
                      BRRABill @scottalanmiller
                      last edited by

                      @scottalanmiller said in Dell PERC Question (Server Down):

                      It's possible that the PERC is bad, I suppose.

                      BTW: this PERC did not come with the server. But it was purchased new from DELL.

                      1 Reply Last reply Reply Quote 0
                      • scottalanmillerS
                        scottalanmiller @BRRABill
                        last edited by

                        @BRRABill said in Dell PERC Question (Server Down):

                        @scottalanmiller said

                        It's possible that the PERC is bad, I suppose.

                        Why would you think that as opposed to it being the disks?

                        Or are you just spitballing ideas?

                        Of course DELL would say it was the disks, but the first thing that @BradfromxByte said the first time was that it had to be the disk. It was the guys from EDGE that said no.

                        Just saying that it is one of the potential culprits. The disks are not failing per se, they are going foreign. While it is certainly possible that the disks are at fault, doesn't it seem more likely to be the controller? What likely failure condition on a disk would result in the controller thinking that it is a foreign device?

                        BRRABillB DashrenderD 2 Replies Last reply Reply Quote 0
                        • BRRABillB
                          BRRABill @scottalanmiller
                          last edited by

                          @scottalanmiller said

                          Just saying that it is one of the potential culprits. The disks are not failing per se, they are going foreign. While it is certainly possible that the disks are at fault, doesn't it seem more likely to be the controller? What likely failure condition on a disk would result in the controller thinking that it is a foreign device?

                          What I have heard from DELL, xByte, and here on ML is that is the firmware on the EDGE disks has some sort of issue communicating, the PERC will set it to foreign.

                          That's why they instantly think it is the drive.

                          1 Reply Last reply Reply Quote 0
                          • scottalanmillerS
                            scottalanmiller
                            last edited by

                            That's not unreasonable. Of course, if the same error happens on the PERC side, it will think that the issue is from the Edge side and behave the same way.

                            1 Reply Last reply Reply Quote 0
                            • DashrenderD
                              Dashrender @scottalanmiller
                              last edited by

                              @scottalanmiller said in Dell PERC Question (Server Down):

                              @BRRABill said in Dell PERC Question (Server Down):

                              @scottalanmiller said

                              It's possible that the PERC is bad, I suppose.

                              Why would you think that as opposed to it being the disks?

                              Or are you just spitballing ideas?

                              Of course DELL would say it was the disks, but the first thing that @BradfromxByte said the first time was that it had to be the disk. It was the guys from EDGE that said no.

                              Just saying that it is one of the potential culprits. The disks are not failing per se, they are going foreign. While it is certainly possible that the disks are at fault, doesn't it seem more likely to be the controller? What likely failure condition on a disk would result in the controller thinking that it is a foreign device?

                              I'm tending to agree with Scott here - it seems really odd for the controller to consider the drives and their config to be completely foreign. Of course this is the problem that we've been talking about ever since one of my first postings with Scott on SW well over 5 years ago - When you're mixing vendors, at what point will someone capitulate that it's their stuff that's broken.

                              Scott's been saying since that time, oh so long ago, that Dell (in this case) can't refuse to provide warranty/support on their equipment just because you're using someone else's drives. This looks like a good case of time to prove that.

                              Also, you could be having two problems - A drive (0:0) and the Perc controller could be bad. In this case, perhaps when the drive freaks out for a moment it's causing the Perc to freak out too.

                              Of course I suppose the drive could be freaking out so badly out of Dell's expectation as to cause the Perc to loose it's mind, but that seems .... you pick a word.

                              BRRABillB 1 Reply Last reply Reply Quote 0
                              • scottalanmillerS
                                scottalanmiller
                                last edited by

                                Yes, Dell has a responsibility to support the PERC, but only if the PERC is at fault. That's what makes this a tough situation that we don't know where the fault lies.

                                1 Reply Last reply Reply Quote 1
                                • BRRABillB
                                  BRRABill @Dashrender
                                  last edited by

                                  @Dashrender said

                                  Scott's been saying since that time, oh so long ago, that Dell (in this case) can't refuse to provide warranty/support on their equipment just because you're using someone else's drives. This looks like a good case of time to prove that.

                                  DELL has actually been great about supporting these. They never really say anything. I kind of cringe waiting for it, but they support with no issue.

                                  Of course, when they look through the logs, they are filled with WARNINGS about non-DELL equipment. They mention that.

                                  But they've been helpful so far.

                                  That's another thing that is so weird. The drives are now showing up a green checks, which they are not supposed to do.

                                  DashrenderD 1 Reply Last reply Reply Quote 0
                                  • DashrenderD
                                    Dashrender @BRRABill
                                    last edited by

                                    @BRRABill said in Dell PERC Question (Server Down):

                                    @Dashrender said

                                    Scott's been saying since that time, oh so long ago, that Dell (in this case) can't refuse to provide warranty/support on their equipment just because you're using someone else's drives. This looks like a good case of time to prove that.

                                    DELL has actually been great about supporting these. They never really say anything. I kind of cringe waiting for it, but they support with no issue.

                                    Of course, when they look through the logs, they are filled with WARNINGS about non-DELL equipment. They mention that.

                                    But they've been helpful so far.

                                    That's another thing that is so weird. The drives are now showing up a green checks, which they are not supposed to do.

                                    They aren't? I thought was the whole point of these Edge drives - that they are compliant and don't have issues with Dell stuff.

                                    But what do I know?

                                    BRRABillB 1 Reply Last reply Reply Quote 0
                                    • DashrenderD
                                      Dashrender
                                      last edited by

                                      So what's the plan here? I see three possibilities.

                                      1. replace at least drive 0:0 with a new Edge drive
                                      2. replace the Perc
                                      3. replace SSDs with Dell drives.
                                      BRRABillB 1 Reply Last reply Reply Quote 0
                                      • BRRABillB
                                        BRRABill @Dashrender
                                        last edited by

                                        @Dashrender said

                                        They aren't? I thought was the whole point of these Edge drives - that they are compliant and don't have issues with Dell stuff.

                                        But what do I know?

                                        I'm not going to speak for anyone at EDGE or xByte. But it seems like there are pockets of issues.

                                        1 Reply Last reply Reply Quote 0
                                        • BRRABillB
                                          BRRABill @Dashrender
                                          last edited by

                                          @Dashrender said in Dell PERC Question (Server Down):

                                          So what's the plan here? I see three possibilities.

                                          1. replace at least drive 0:0 with a new Edge drive
                                          2. replace the Perc
                                          3. replace SSDs with Dell drives.

                                          I heard back from EDGE.

                                          Basically the tech said that while he has never seen this exact issue, he has seen issues that are almost identical, and that they have a special model of the drive I have (I have the 3X) to fix these issues.

                                          So, I am going to cross my fingers and give that a try.

                                          todd-at-xByteT 1 Reply Last reply Reply Quote 1
                                          • scottalanmillerS
                                            scottalanmiller
                                            last edited by

                                            That sounds promising.

                                            1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 5
                                            • 6
                                            • 7
                                            • 8
                                            • 9
                                            • 12
                                            • 13
                                            • 7 / 13
                                            • First post
                                              Last post