Daily rant

Sep. 30th, 2008 07:59 pm
sobrique: (Default)
[personal profile] sobrique
Today, at work, we had a disk fail on one of our storage arrays.

Normally this is irritating, but minor - engineer is dispatched with new disk to swap it.

Not today it seems. No, today our problem management team would not raise an incident for me - there was no user impact, because we have a resilient system, therefore it was not acceptable for them to fix it based on an incident, and a retroactive change. I would therefore have to raise an emergency change.

Well, this first of all pissed me off. I'm not changing anything, I'm just getting a malfunctioning part swapped out. I'm still pretty marginal on whether that should be considered a _change_ at all, because nothing is, in fact, changing.

So anyway. After ... getting a bit wound up by the fact that it was acceptable for us (this was internally) to stonewall replacing some of our customers hardware that we _knew_ had a fault, I spoke to someone in our customer's team, to find out ... quite why they thought that was a good idea.

It seems they didn't, and they were fine with the concept of 'just swap the damn disk, before another one burns out'.

But anyway, I finally cracked and started filling our emergency change form - resisting the urge to be massively sarcastic when answering the 'why can your change not be done as part of a planned release process' (Because I'm not psychic), and 'please explain in non technical terms what you're doing' and 'what is your justification for dispensation from the testing process' and a whole selection of asnine little questions.

My emergency change was rejected, because there wasn't enough time to process it between when I submitted it (about 16:00, admittedly) and when I'd specified for it to start - 09:00 tomorrow.

Now, we have a really rather robust storage system, and it actually is the case that this disk is not really a problem - we've several hot spares, which will function just fine, even after several drives go 'pop'.

But that's not the point. It's not hard, when you have a 4 hour support agreement with a vendor, which costs lots of money, to get this done. It's only when you involve muppets, that it's turned an incident that should have a quick resolution, into what can only be described as the IT equivalent of the benny hill show.

Date: 2008-10-01 06:56 pm (UTC)
From: [identity profile] stgpcm.livejournal.com
The question is does the system automatically using a "hot spare" constitute a change? their disk layout is now different, which *could* alter the performance profile, and they have one less hot spare available to them.

Profile

sobrique: (Default)
sobrique

December 2015

S M T W T F S
  12345
6789101112
13141516171819
20212223242526
2728 293031  

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Feb. 18th, 2026 10:28 am
Powered by Dreamwidth Studios