From mboxrd@z Thu Jan  1 00:00:00 1970
X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/2615
Path: main.gmane.org!not-for-mail
From: Hans Hagen <pragma@wxs.nl>
Newsgroups: gmane.comp.tex.context
Subject: Re: Wanting to learn plain TeX
Date: Wed, 06 Sep 2000 18:11:54 +0200
Sender: owner-ntg-context@let.uu.nl
Message-ID: <3.0.6.32.20000906181154.0162fad0@pop.wxs.nl>
References: <3.0.6.32.20000904224547.0086b570@pop.wxs.nl>
	<4.3.2.7.0.20000904194500.00b207c0@rzdspc1.informatik.uni-h amburg.de>
	<3.0.6.32.20000831140354.0165ed10@pop.wxs.nl>
	<4.3.2.7.0.20000831130011.00b1b9a0@pop.btx.dtag.de>
NNTP-Posting-Host: coloc-standby.netfonds.no
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
X-Trace: main.gmane.org 1035393394 10309 80.91.224.250 (23 Oct 2002 17:16:34 GMT)
X-Complaints-To: usenet@main.gmane.org
NNTP-Posting-Date: Wed, 23 Oct 2002 17:16:34 +0000 (UTC)
Cc: ntg-context@ntg.nl
Original-To: "Robert F. Beeger" <5beeger@informatik.uni-hamburg.de>
In-Reply-To: <4.3.2.7.0.20000906165719.00b24670@rzdspc1.informatik.uni-h
	amburg.de>
Xref: main.gmane.org gmane.comp.tex.context:2615
X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:2615

At 05:10 PM 9/6/00 +0200, Robert F. Beeger wrote:

>Just because of being curious about this one: How can this work? I remember 
>of having heard once that the writen Chinese language consists of 2000000 
>or even more symbols, of which each one stands for a word. Are Unicode 
>characters used here or a special mapping from ASCII to Chinese. I also ask 
>myself what sort of keyboard the chinese guys use when they want to type 
>some text in Chinese.

Wang Lei is the person to answer this best. When I implemented chinese, i
did so based on the info he gave me, since the documentation that comes
with other tex implementations is in chinese and therefore unreadable for
me. So, i implemented chinese from scratch. There are more problems
involved than fonts. Currently fonts are dealt with by splitting them up in
=<256 glyphs and using the two byte chars to invoke them. In context the
first byte (char) triggers a font switch [all bytes>128 are made active].
Then the font+char combination is fed in a unicode handler [on top of
font-uni] where spacing and linebreaking is optimized. 

The complications are in: different encoding/font specific numbering
schemes, split labels, mixed chinese latin, and more. This is a rather
multi dimensional problem: font/encoding/language/... with languages within
languages and so. 

Concerning keying, there are special keyboards and free tools for editing.
Kind of funny to see you windows become chinese, due to characters that are
intercepted and mapped onto chinese or whatever. Of course there is emacs. 

Some day I will use omega for this, but first I will finish the current
implementation.  

Hans
-------------------------------------------------------------------------
                                                  Hans Hagen | PRAGMA ADE
                      Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
 tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com
-------------------------------------------------------------------------