From mboxrd@z Thu Jan  1 00:00:00 1970
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
In-reply-to: Your message of "Sat, 14 Jan 2012 17:42:12 EST."
	<3a789b771cbf86e9a6e7ddfff2b95bb4@chula.quanstro.net>
References: <20120113113026.GA419@polynum.com>
	<CA+N-5bY6RFTDfRS1f=XaQZzPs9HQX0KqnpbAexVjjo2KzQbtUQ@mail.gmail.com>
	<20120114003032.1C08F1CC8F@mail.bitblocks.com>
	<201201140201.51504.dexen.devries@gmail.com>
	<db2515dfdcf5c48647048ef6a2760be0@brasstown.quanstro.net>
	<CAFSF3XNFMEQ5R+dVc_2TQcqkO2DfjFUbFZg8ORvZqB0tsKPHMQ@mail.gmail.com>
	<CAOw7k5h2T+xuxbJhwTxPMOjG3K14KarrJPXFmH9EHdHJnXFpPA@mail.gmail.c>
	<a1e85e17bdb39d08474c2c609fa68a67@chula.quanstro.net>
	<CAEAzY3_9jpi6j-C1u87OKaEazajOBwkvbEBdO5f1eUJysJbH1A@mail.gmail.c>
	<2ca6969da468ef7d305866d2c3c484f4@chula.quanstro.net>
	<CAEAzY39GD4QoTRf2S0Nd4rN+vSyG1tsKyP3rFWu5a0mFN=sH6w@mail.gmail.c>
	<3a789b771cbf86e9a6e7ddfff2b95bb4@chula.quanstro.net>
Date: Sat, 14 Jan 2012 15:32:09 -0800
From: Bakul Shah <bakul@bitblocks.com>
Message-Id: <20120114233209.15BD4B85A@mail.bitblocks.com>
Subject: Re: [9fans] fossil pb: FOUND!
Topicbox-Message-UUID: 5c6c1114-ead7-11e9-9d60-3106f5b1d025

On Sat, 14 Jan 2012 17:42:12 EST erik quanstrom <quanstro@quanstro.net>  wrote:
> On Sat Jan 14 17:12:49 EST 2012, aram.h@mgk.ro wrote:
> > > content addressed means given the content, you can generate the address.
> > > this is NOT true of zfs at all.
> >
> > How come? With venti, the address is the SHA-1 hash, with ZFS, you get
> > to chose the hash, but it can still be a hash.
>
> because in zfs the hash is not used as an address (lba).

True.

> > My original claim was that fragmentation is a non issue if you have
> > SSDs.  I still claim this and I expanded on the context in my previous
> > post.  Of course that random I/O is slower than sequential I/O, SSD or
> > not, but in practice, filesystem fragmentation causes an amount or
> > random I/O much less than what a SSD can handle, so throughput in the
> > fragmented case is close to the throughput in the sequential case.
> >
> > I don't think that caching is completely irrelevant.  If I have to
> > chose between a complex scheme that avoids fragmentation and a simple
> > caching scheme that renders it irrelevant for a particular workload,
> > I'll chose the caching scheme because it's simpler.
>
> by all means, show us the numbers.  personally, i believe the mfgrs are not
> lying when they say that random i/o yields 1/10th the performance (at best)
> of sequential i/o.


Intel 320 300GB SSD numbers (for example):
seq read:  270MBps
rnd read:  39.5Kiops == 158Mbps @ 4KB
seq write: 205MBps
rnd write: 23.0Kiops == 92MBps  @ 4KB

SSDs don't have to contend with seek times but you have to pay
the erase cost (which can not be hidden in "backround GC" when
you are going full tilt).

For venti you'd pick 8k at least so the write throughput will
be higher that 92MBps (but not double). IIRC ZFS picks a much
larger block size so it suffers less here.

You have to check the numbers using the blocksizes relevant to
you.