From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/2615 Path: main.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: Wanting to learn plain TeX Date: Wed, 06 Sep 2000 18:11:54 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <3.0.6.32.20000906181154.0162fad0@pop.wxs.nl> References: <3.0.6.32.20000904224547.0086b570@pop.wxs.nl> <4.3.2.7.0.20000904194500.00b207c0@rzdspc1.informatik.uni-h amburg.de> <3.0.6.32.20000831140354.0165ed10@pop.wxs.nl> <4.3.2.7.0.20000831130011.00b1b9a0@pop.btx.dtag.de> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" X-Trace: main.gmane.org 1035393394 10309 80.91.224.250 (23 Oct 2002 17:16:34 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 17:16:34 +0000 (UTC) Cc: ntg-context@ntg.nl Original-To: "Robert F. Beeger" <5beeger@informatik.uni-hamburg.de> In-Reply-To: <4.3.2.7.0.20000906165719.00b24670@rzdspc1.informatik.uni-h amburg.de> Xref: main.gmane.org gmane.comp.tex.context:2615 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:2615 At 05:10 PM 9/6/00 +0200, Robert F. Beeger wrote: >Just because of being curious about this one: How can this work? I remember >of having heard once that the writen Chinese language consists of 2000000 >or even more symbols, of which each one stands for a word. Are Unicode >characters used here or a special mapping from ASCII to Chinese. I also ask >myself what sort of keyboard the chinese guys use when they want to type >some text in Chinese. Wang Lei is the person to answer this best. When I implemented chinese, i did so based on the info he gave me, since the documentation that comes with other tex implementations is in chinese and therefore unreadable for me. So, i implemented chinese from scratch. There are more problems involved than fonts. Currently fonts are dealt with by splitting them up in =<256 glyphs and using the two byte chars to invoke them. In context the first byte (char) triggers a font switch [all bytes>128 are made active]. Then the font+char combination is fed in a unicode handler [on top of font-uni] where spacing and linebreaking is optimized. The complications are in: different encoding/font specific numbering schemes, split labels, mixed chinese latin, and more. This is a rather multi dimensional problem: font/encoding/language/... with languages within languages and so. Concerning keying, there are special keyboards and free tools for editing. Kind of funny to see you windows become chinese, due to characters that are intercepted and mapped onto chinese or whatever. Of course there is emacs. Some day I will use omega for this, but first I will finish the current implementation. Hans ------------------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com -------------------------------------------------------------------------