9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
From: Jack Norton <jack@0x6a.com>
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
Subject: Re: [9fans] Petabytes on a budget: JBODs + Linux + JFS
Date: Mon, 21 Sep 2009 15:57:59 -0500	[thread overview]
Message-ID: <4AB7E8D7.10906@0x6a.com> (raw)
In-Reply-To: <75bd45f10fe4970a189c6824bbadc841@quanstro.net>

erik quanstrom wrote:
>>> i think the lesson here is don't by cheep drives; if you
>>> have enterprise drives at 1e-15 error rate, the fail rate
>>> will be 0.8%.  of course if you don't have a raid, the fail
>>> rate is 100%.
>>>
>>> if that's not acceptable, then use raid 6.
>>>
>> Hopefully Raid 6 or zfs's raidz2 works well enough with cheap
>> drives!
>>
>
> don't hope.  do the calculations.  or simulate it.
>
> this is a pain in the neck as it's a function of ber,
> mtbf, rebuild window and number of drives.
>
> i found that not having a hot spare can increase
> your chances of a double failure by an order of
> magnitude.  the birthday paradox never ceases to
> amaze.
>
> - erik
>
>
While we are on the topic:
How many RAID cards have we failed lately?  I ask because I am about to
hit a fork in the road with my work-a-like of your diskless fs.  I was
originally going to use linux soft raid and vblade, but I am considering
using some raid cards that just so happen to be included in the piece of
hardware I will be getting soon...
At work, we recently had a massive failure of our RAID array.  After
much brown noseing, I come to find that after many harddrives being
shipped to our IT guy and him scratching his head, it was in fact the
RAID card itself that had failed (which takes out the whole array, plus
can take out any new drives you throw at it apparently).

So I ask you all this (especially those in the 'biz): all this
redundancy on the drive side, why no redundancy of controller cards (or
should I say, the driver infrastructure needed)?

It is appealing to me to try and get some plan 9 supported raid card and
have plan 9 throughout (like the coraid setup as far as I can tell), but
this little issue bothers me.

Speaking of birthday, I mentioned to our IT dep (all two people...) that
they should try and spread out the drives used among different mfg dates
and batches.  It shocked me to know that this was news to them...

-Jack



  reply	other threads:[~2009-09-21 20:57 UTC|newest]

Thread overview: 42+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-09-14 16:43 erik quanstrom
2009-09-20 20:13 ` Bakul Shah
2009-09-21  3:37   ` erik quanstrom
2009-09-21 17:43     ` Bakul Shah
2009-09-21 18:02       ` erik quanstrom
2009-09-21 18:49         ` Wes Kussmaul
2009-09-21 19:21           ` erik quanstrom
2009-09-21 20:57             ` Wes Kussmaul
2009-09-21 22:42               ` erik quanstrom
2009-09-22 10:59             ` matt
2009-09-21 19:10         ` Bakul Shah
2009-09-21 20:30           ` erik quanstrom
2009-09-21 20:57             ` Jack Norton [this message]
2009-09-21 23:38               ` erik quanstrom
2009-09-21 22:07             ` Bakul Shah
2009-09-21 23:35               ` Eris Discordia
2009-09-22  0:45                 ` erik quanstrom
     [not found]               ` <6DC61E4A6EC613C81AC1688E@192.168.1.2>
2009-09-21 23:50                 ` Eris Discordia
  -- strict thread matches above, loose matches on Subject: below --
2009-09-04  0:53 Roman V Shaposhnik
2009-09-04  1:20 ` erik quanstrom
2009-09-04  9:37   ` matt
2009-09-04 14:30     ` erik quanstrom
2009-09-04 16:54     ` Roman Shaposhnik
2009-09-04 12:24   ` Eris Discordia
2009-09-04 12:41     ` erik quanstrom
2009-09-04 13:56       ` Eris Discordia
2009-09-04 14:10         ` erik quanstrom
2009-09-04 18:34           ` Eris Discordia
     [not found]       ` <48F03982350BA904DFFA266E@192.168.1.2>
2009-09-07 20:02         ` Uriel
2009-09-08 13:32           ` Eris Discordia
2009-09-04 16:52   ` Roman Shaposhnik
2009-09-04 17:27     ` erik quanstrom
2009-09-04 17:37       ` Jack Norton
2009-09-04 18:33         ` erik quanstrom
2009-09-08 16:53           ` Jack Norton
2009-09-08 17:16             ` erik quanstrom
2009-09-08 18:17               ` Jack Norton
2009-09-08 18:54                 ` erik quanstrom
2009-09-14 15:50                   ` Jack Norton
2009-09-14 17:05                     ` Russ Cox
2009-09-14 17:48                       ` Jack Norton
2009-09-04 23:25   ` James Tomaschke

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4AB7E8D7.10906@0x6a.com \
    --to=jack@0x6a.com \
    --cc=9fans@9fans.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).