From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/9975 Path: main.gmane.org!not-for-mail From: Gour Newsgroups: gmane.comp.tex.context Subject: Re: DocBookInContext & multi-languages (newbie) / utf Date: Mon, 2 Dec 2002 18:40:30 +0100 Sender: ntg-context-admin@ntg.nl Message-ID: <20021202174029.GA24554@mail.inet.hr> References: <5.1.0.14.1.20021202132549.00af0328@server-1> <20021129072039.GA7792@mail.inet.hr> <20021129072039.GA7792@mail.inet.hr> <5.1.0.14.1.20021202132549.00af0328@server-1> <5.1.0.14.1.20021202153537.03d21258@server-1> Reply-To: ntg-context@ntg.nl NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1038850772 14026 80.91.224.249 (2 Dec 2002 17:39:32 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 2 Dec 2002 17:39:32 +0000 (UTC) Return-path: Original-Received: from ref.vet.uu.nl ([131.211.172.13] helo=ref.ntg.nl) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18IuXa-0003e4-00 for ; Mon, 02 Dec 2002 18:39:30 +0100 Original-Received: from ref.ntg.nl (localhost.localdomain [127.0.0.1]) by ref.ntg.nl (Postfix) with ESMTP id F25B510AE8; Mon, 2 Dec 2002 18:41:47 +0100 (MET) Original-Received: from mail.iskon.hr (inje.iskon.hr [213.191.128.16]) by ref.ntg.nl (Postfix) with SMTP id A64A010AE6 for ; Mon, 2 Dec 2002 18:40:33 +0100 (MET) Original-Received: (qmail 1107 invoked from network); 2 Dec 2002 18:40:32 +0100 Original-Received: from zg05-107.dialin.iskon.hr (HELO gaura) (213.191.138.108) by mail.iskon.hr with SMTP; 2 Dec 2002 18:40:32 +0100 Original-Received: (qmail 25711 invoked by uid 500); 2 Dec 2002 17:40:30 -0000 Original-To: ntg-context@ntg.nl Content-Disposition: inline In-Reply-To: <5.1.0.14.1.20021202153537.03d21258@server-1> User-Agent: Mutt/1.4i Errors-To: ntg-context-admin@ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.0.13 Precedence: bulk X-Reply-To: Gour List-Help: List-Post: List-Subscribe: , List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.comp.tex.context:9975 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:9975 Hans Hagen (pragma@wxs.nl) wrote: > in xtag-utf.tex in .../tex/context/base (at least in my version and the > beta) On my SuSE 8.0 I didn't find it, but fortunately it's in the beta which I downloaded :-) So here I see something like: \defineUTFcharacter amacron 1 1 which corresponds to the Unicode code of amacron: U+0101 and it's according to the output of Vim's function: "ga" which shows: <ā> 257, Hex 0101, Octal 401. Now, it just a question of little work to slowly populate this vector with the values for different Unicode characters. > >UTF-8_and_Unicode_FAQ has some test files and I'm sure this step is not a > >problem. > > So, where can i find that doc? The FAQ document is at: http://www.cl.cam.ac.uk/~mgk25/unicode.html, and the example files are under: http://www.cl.cam.ac.uk/~mgk25/unicode.html#examples Pls. take a look http://www.macchiato.com/unicode/Unicode_transcriptions.html under the example's list. There is also Unicode converter: http://www.macchiato.com/unicode/convert.html > >a) c4 81 -> amacron > >b) 0101 -> amacron > > so, c4 is the trigger, and 81 the character; this means that the function > attached to c4 has to map the 81 onto \amacron I'm not sure whether c4 is the trigger for the 81 character. c4 81 is two-byte representation in memory (that's what you'll see in some hexadecimal editor) of Unicode amacron character with the code U+0101, or simply said: utf-8 code for amacron :-) > can you make me a file with a list like: > > amacron : 01/01 : c4/c8 : > ^^^^^^^^^^^^^^^^^^^^^^^^^ ^^^^^^^^^ > normal ascii real utf So, the line for amacron should look like: amacron : 01/01 c4/c8 since c4/c8 is utfcode for amacron. Is this OK? -- Gour gour@mail.inet.hr Registered Linux User #278493