From mboxrd@z Thu Jan  1 00:00:00 1970
To: 9fans@cse.psu.edu
Date: Tue, 18 Sep 2007 08:38:41 +0000
From: "Douglas A. Gwyn" <DAGwyn@null.net>
Message-ID: <46EEDC26.F3727FDD@null.net>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
References: <46EE9A41.7DD78E60@null.net>,
	<7ba10925935da3080b62c7cb6e2649d5@coraid.com>
Subject: Re: [9fans] simplicity
Topicbox-Message-UUID: c07d0208-ead2-11e9-9d60-3106f5b1d025

erik quanstrom wrote:
> wchar_t is not the equivalent of Rune.  Rune is always utf-8.  wchar_t
> can be whatever.

I could have sworn that Plan 9 "rune" is used to contain a Unicode
value (UCS-2).  wchar_t can do the same thing, and does on some
platforms.  On others, wchar_t holds a full 31-but UCS-4 code, and
on others (Solaris for example) its encoding is locale-dependent
(which I would agree is not a good design).

> suppose Linux user a and user b grep the same "text" file for the same string.
> results will depend on the users' locales.

But if they're trying to match an alphabetic character class, the
result *should* depend on the locale.