From mboxrd@z Thu Jan 1 00:00:00 1970 To: 9fans@cse.psu.edu Date: Tue, 18 Sep 2007 15:41:12 +0000 From: "Douglas A. Gwyn" Message-ID: <46EFEBAF.F0402D6C@null.net> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit References: , Subject: Re: [9fans] simplicity Topicbox-Message-UUID: c0a6b3dc-ead2-11e9-9d60-3106f5b1d025 Iruata Souza wrote: > On 9/18/07, dave.l@mac.com wrote: > > >But if they're trying to match an alphabetic character class, the > > >result *should* depend on the locale. > > ... so what *should* the result be if the locale specifies an ideographic script? > the result *should* be 'now go and use plan 9' That doesn't address the issue Dave L raised. I don't know off hand what POSIX decreed for "character classes" involving ideographs. My guess is that they have to not count as uppercase or lowercase, and probably not as alphabetic nor alphanumeric. You could ask similar questions about accented characters in alphabet-based languages. This isn't about character coding so much as it is about classification.