9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
From: Jorden Mauro <jrm8005@gmail.com>
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
Subject: Re: [9fans] grëp (rhymes with creep) and cptmp
Date: Mon, 30 Nov 2009 14:43:32 -0500	[thread overview]
Message-ID: <3aaafc130911301143h1e788401t7741a55b7fbb18cb@mail.gmail.com> (raw)
In-Reply-To: <22fc94a82c16f8b347bc45dd539b5fc6@coraid.com>

On Mon, Nov 30, 2009 at 11:00 AM, erik quanstrom <quanstro@coraid.com> wrote:
>> ``unfold turns a character, say ë into the set of
>> characters that can be folded to the same base
>> character.  so
>>        ; unfold ë
>>        [eèéêëēĕėęěȅȇȩḕḗḙḛḝẹẻẽếềểễệ]''
>>
>> To me, that sounds like [e-f] should be
>>
>> [eèéêëēĕėęěȅȇȩḕḗḙḛḝẹẻẽếềểễệfƒ]
>>
>> iff e unfolds to the same set as ë. If e only unfolds to [e], then
>> [e-f] would unfold to [ef].
>
> i don't think that works.  consider [e-g].  normally
> this would match 'f', but under your algorithm it wouldn't.
> the problem is that [a-z] works because ascii is arranged
> in alphabetical order.  all the various accented characters
> are not.

It would work if the algorithm didn't expand the class just by
enumerating ASCII letters, but
for every letter also added the accented chars.

>
> that's why the folding approach has an advantage [a-z]
> will work and will do the Right Thing.
>
> - erik
>
>



  parent reply	other threads:[~2009-11-30 19:43 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <<d50d7d460911292352j7cbcbc7erefa21b3b7f29f20a@mail.gmail.com>
2009-11-30 13:50 ` erik quanstrom
2009-11-30 14:48   ` roger peppe
2009-11-30 14:54     ` David Leimbach
2009-11-30 15:10   ` Jason Catena
2009-11-30 15:32     ` erik quanstrom
2009-11-30 15:54       ` Jorden Mauro
2009-11-30 16:00         ` erik quanstrom
2009-11-30 18:38           ` hiro
2009-11-30 19:43           ` Jorden Mauro [this message]
     [not found] <<df49a7370911300648l5e243b12ncdf6de116d81afa9@mail.gmail.com>
2009-11-30 15:28 ` erik quanstrom
2009-11-30 16:38   ` roger peppe
2009-11-30 17:34     ` erik quanstrom
     [not found] <<df49a7370911300326m3e3a6be1yc77e49a2b23a6da2@mail.gmail.com>
2009-11-30 14:06 ` erik quanstrom
     [not found] <<d50d7d460911291101k7420eb0fna61f87646606e991@mail.gmail.com>
2009-11-30  4:29 ` erik quanstrom
2009-11-30  7:52   ` Jason Catena
2009-11-30  9:00     ` Eris Discordia
2009-11-29 19:01 Jason Catena
2009-11-30  4:51 ` Bruce Ellis
2009-11-30 11:26 ` roger peppe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3aaafc130911301143h1e788401t7741a55b7fbb18cb@mail.gmail.com \
    --to=jrm8005@gmail.com \
    --cc=9fans@9fans.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).