From mboxrd@z Thu Jan 1 00:00:00 1970 To: 9fans@cse.psu.edu Date: Tue, 18 Sep 2007 08:38:41 +0000 From: "Douglas A. Gwyn" Message-ID: <46EEDC26.F3727FDD@null.net> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit References: <46EE9A41.7DD78E60@null.net>, <7ba10925935da3080b62c7cb6e2649d5@coraid.com> Subject: Re: [9fans] simplicity Topicbox-Message-UUID: c07d0208-ead2-11e9-9d60-3106f5b1d025 erik quanstrom wrote: > wchar_t is not the equivalent of Rune. Rune is always utf-8. wchar_t > can be whatever. I could have sworn that Plan 9 "rune" is used to contain a Unicode value (UCS-2). wchar_t can do the same thing, and does on some platforms. On others, wchar_t holds a full 31-but UCS-4 code, and on others (Solaris for example) its encoding is locale-dependent (which I would agree is not a good design). > suppose Linux user a and user b grep the same "text" file for the same string. > results will depend on the users' locales. But if they're trying to match an alphabetic character class, the result *should* depend on the locale.