Exablox testing journal
-
Have not yet contacted support as I suspect this may be related to the really crappy networking I gave it. I will let it sit and stew for another day as I have no time to poke it today.
-
Connected the delinquent "vancouver" one back up to clean 100mbit lan, will see if anything changes.
-
Note: all errors mentioned below still exist on the "vancouver" unit as I post this, including wacky dedupe ratio, 100GB greater capacity in use and file system inconsistency error
-
Time to get support to take a gander, I would say.
-
@scottalanmiller I'll give it 24h to sort it's self out - if it was the hub, that'll give it time to heal. I really don't expect anyone's tested it with such poor networking gear and while I didn't expect it to cause issues, it's outside of what I'd expect them to support.
-
@MattSpeller No, that is exactly what you are paying for. make use of it. By not contacting support you are wasting time and money.
-
@JaredBusch If it was in production, 100% agree. I unfortunately don't have time to poke at it today so I'll let it sit.
-
@MattSpeller said:
Have not yet contacted support as I suspect this may be related to the really crappy networking I gave it. I will let it sit and stew for another day as I have no time to poke it today.
support will be reaching out to you to look at the secondary. Better to be proactive than reactive
-
@MattSpeller said:
Seems like the errors from last week have gotten strange. I've only been loading data to the "victoria' unit, however the "vanouver" one is showing greater storage used, along with a really sweet amount of dedupe
You'll see really good dedupe ratios like this as it's strictly a calculation based on how much data is written the Ring and then to the physical drives. Over the week and lifetime you'll get a more 'reasonable' ratio
-
Contact established with Support at 1:30pm, email at 4:42pm requesting permission to reboot, I replied immediately, box rebooted at 4:46 with all errors cleared. I'm a happy camper.
-
So, error came back maybe 30 minutes after it was cleared. I rebooted the unit again and it seems to be gone for good. Support reached out to me again and suggested I run the software upgrade they released - it's now underway on both.
-
Cool, thanks for all of the feedback.
-
@scottalanmiller Helps me later when I go to write it up for the boss
-
An interesting find developed over the long weekend. The troubled oneblox unit finally stopped talking some time friday evening. I came in this morning and traced it down to a faulty ethernet cable. How much this has contributed to it's chain of errors is unknown, and I am happy to give it the benefit of the doubt moving forwards.
-
To speed up testing I've installed a gigabit switch, this absolutely will change their performance.
-
@MattSpeller said:
To speed up testing I've installed a gigabit switch, this absolutely will change their performance.
Welcome to 2003.
-
-
Having some further errors on the oneblox units, apparently they're not talking to each other right now. While I find this slightly irritating I want to point out the following email chain, check out the times.
Matthew Speller
Feb 13, 11:27
Hello Support,
Could I please get a hand resolving this?Bob Gardiner (Exablox Support)
Feb 13, 11:30
I will take a look and let you know what I find.
Bob -
Hey @MattSpeller here's some interesting reading:
http://www.theregister.co.uk/2015/02/13/exablox_not_good_for_restore/
Ā
Have you tested the restore times?
Did you find them adequate? -
@nadnerB Not yet, though today may be the day.