From mboxrd@z Thu Jan 1 00:00:00 1970 Message-ID: Date: Fri, 19 Aug 2005 11:23:15 -0400 From: Dimitry Golubovsky To: mirtchov@cpsc.ucalgary.ca Subject: [9fans] plan9 and the Unicode Consortium definitions Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Cc: 9fans@cse.psu.edu Topicbox-Message-UUID: 7967aac8-ead0-11e9-9d60-3106f5b1d025 Andrey, Andrey wrote: >> I am just wondering whether any API to access more complete set of >> character properties defined by Unicode.org is available in Plan9. >you mean things like diacritics? I mean character categories defined in=20 http://www.unicode.org/Public/4.1.0/ucd/UCD.html#General_Category_Values Abbr. Description =20 Lu Letter, Uppercase=20 Ll Letter, Lowercase=20 Lt Letter, Titlecase=20 Lm Letter, Modifier=20 Lo Letter, Other=20 Mn Mark, Nonspacing=20 Mc Mark, Spacing Combining=20 Me Mark, Enclosing=20 Nd Number, Decimal Digit=20 etc., total about 30 or so. isxxxrune distinguishes only among 5 categories= . This would probably inlcude diacritics, but my question was more general (maybe even philosophical): there exists a recommended set of Unicode character properties, APIs, and interfaces (Unicode.org). Plan9 which probably influenced some aspects of Unicode to be implemented in other systems does not follow. Is there any historical /political /technical /other reason? Related man pages mention "The Unicode Standard" though in SEE ALSO section. What is more interesting to me (technically, as I asked in my first message) - is 16-bitness of runes hardcoded anywhere in the kernel, or only in libc? --=20 Dimitry Golubovsky Anywhere on the Web