Jan 272017
 

Greetings everyone!

I had my first predicted disk failure occur on my HPE MSA 2040. As always, it was a breeze contacting HPE support to get the drive replaced (since my unit has a 4 hour response warranty).

However, with this being my first drive swap I came across something worth mentioning. Typically in RAID arrays when a disk fails, you simply swap out the failed disk and it starts rebuilding, this is NOT the case if you have an HPE MSA 2040 that’s fully loaded with no spares configured.

If you have global spares, the moment the disk is failed, it will automatically rebuild on to available configured spares.

If you don’t have any global spares (my case), the replacement disk is marked as unused and available. You must set this disk as a spare in the SMU for the rebuild to start.

One additional note, if you do have spares and a disk fails, when you replace the disk that failed it will not automatically rebuild that disk back from the spare. You must force fail (pull out) the spare disk for it to start rebuilding on the freshly replaced disk. Always confirm current redundancy levels and activity before forcefully failing any disks!

As per HPE’s MSA 1040/2040 Best Practices document:

Source: https://h50146.www5.hpe.com/products/storage/whitepaper/pdfs/4AA4-6892ENW.pdf

  5 Responses to “HPE MSA 2040 – Disk Failure considerations and steps”

  1. Thanks for the quick Response!

    The only option I see in regarding global spare is selecting ” Change Disk Group Spare” and I am unable to click on anything within that option box. Are there any specific steps or options with the GUI I need to access/follow?

  2. Scott, I see you’re online. I’m going to send you a message on chat.

  3. Got the new drive in and swapped it out and automatically went into rebuild mode. I think I got pretty much figured out now. Weird the other drive when straight into archive mode and was solid green. Must of been a used drive from before.

  4. I know this is an old thread but figured ‘d leave a question.

    Had a drive go bad in slot 9. we “believe” that we had 2 Global spares but are not entirely positive. We believe the Global Spares were on disk 2 and 11 and when disk 9 went bad that the Global spare in slot 2 took over. When I replaced the drive in slot 9 this morning I’m pretty sure it said Available but now it is showing as a Vdisk spare? I can’t find a way to make it a global spare.

    600GB SAS drives in a Raid 10 array 8 Drive are in Volume, 2 in another with what we thought were 2 Global spares Exchange environment so don’t want to blow away anything.

    Does what I said makes sense or not?

    Thanks

  5. Hi Jeff,

    You should look at your config and check your spare configuration. You’ll also be able to use this method to add a new disk as a global spare.

    Log in to the SMU, click on “System” on the left. Now click on “Action”, and select “Change Global Spares”. This will show you disks you can add as a Global Spare, and also show you configured Spares.

    Since you’re not familiar with the config of your unit, I’d recommend checking the health to make sure none of the RAID arrays or disk groups are running in a degraded state.

    Additionally, you can find out what disks are members of a disk group and pool by going in the SMU to “Pools” and viewing the different pools and disk groups.

    Hope this helps.

    Stephen

 Leave a Reply

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

(required)

(required)