From mboxrd@z Thu Jan 1 00:00:00 1970 From: erik quanstrom Date: Mon, 30 Nov 2009 10:32:34 -0500 To: 9fans@9fans.net Message-ID: <71b1e3b728efbd1b2a2ae2b5b4e2b1d0@coraid.com> In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit Subject: Re: [9fans] =?utf-8?q?gr=C3=ABp_=28rhymes_with_creep=29_and_cptmp?= Topicbox-Message-UUID: a56d9952-ead5-11e9-9d60-3106f5b1d025 On Mon Nov 30 10:13:09 EST 2009, jason.catena@gmail.com wrote: > > it turns out that doing regular expressions is difficult, since > > it's not clear to me what [a-z] should match when unfolded. > > I have discovered a truly marvellous proof of this, which this memory > is too narrow to contain. > > 209 runes in an unfolded a-Ǯ superclass later... > 12498: signal: sys: segmentation violation size isn't the real issue. the real issue is determining what the ranges are for other than the base character. if a maps to [aa'...] and z maps to [zz'...] it's not clear that [a'-z'] is a sensible set. for example what does [e-f] map to? [e-f], clearly but [ë-what?] - erik