From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/10003 Path: main.gmane.org!not-for-mail From: Richard Mahoney Newsgroups: gmane.comp.tex.context Subject: Re: DocBookInContext ... [CSX+, UTF-8 Roman, and Nagari Codings] Date: Wed, 4 Dec 2002 08:14:06 +1300 Sender: ntg-context-admin@ntg.nl Message-ID: <20021203191406.GC44925@ext.canterbury.ac.nz> References: <5.1.0.14.1.20021202132549.00af0328@server-1> <20021129072039.GA7792@mail.inet.hr> <20021129072039.GA7792@mail.inet.hr> <5.1.0.14.1.20021202132549.00af0328@server-1> <5.1.0.14.1.20021202153537.03d21258@server-1> <20021202174029.GA24554@mail.inet.hr> Reply-To: ntg-context@ntg.nl NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1038942813 15964 80.91.224.249 (3 Dec 2002 19:13:33 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Tue, 3 Dec 2002 19:13:33 +0000 (UTC) Return-path: Original-Received: from ref.vet.uu.nl ([131.211.172.13] helo=ref.ntg.nl) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18JIU4-00048t-00 for ; Tue, 03 Dec 2002 20:13:29 +0100 Original-Received: from ref.ntg.nl (localhost.localdomain [127.0.0.1]) by ref.ntg.nl (Postfix) with ESMTP id 8389C10AE8; Tue, 3 Dec 2002 20:15:48 +0100 (MET) Original-Received: from ext.canterbury.ac.nz (dynamic-41.remote.comnet.co.nz [131.203.242.41]) by ref.ntg.nl (Postfix) with ESMTP id 171BC10ABF for ; Tue, 3 Dec 2002 20:14:22 +0100 (MET) Original-Received: from rbm49 by ext.canterbury.ac.nz with local (Exim 4.10) id 18JIUg-0007bo-00; Wed, 04 Dec 2002 08:14:06 +1300 Original-To: ntg-context@ntg.nl Mail-Followup-To: ntg-context@ntg.nl Content-Disposition: inline In-Reply-To: <20021202174029.GA24554@mail.inet.hr> User-Agent: Mutt/1.4i X-Operating-System: FreeBSD 4.7-STABLE Errors-To: ntg-context-admin@ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.0.13 Precedence: bulk X-Reply-To: Richard Mahoney List-Help: List-Post: List-Subscribe: , List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.comp.tex.context:10003 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:10003 On Mon, Dec 02, 2002 at 06:40:30PM +0100, Gour wrote: > So here I see something like: > > \defineUTFcharacter amacron 1 1 > > which corresponds to the Unicode code of amacron: U+0101 and it's according to > the output of Vim's function: "ga" which shows: > > <ā> 257, Hex 0101, Octal 401. > > Now, it just a question of little work to slowly populate this vector with the > values for different Unicode characters. To save yourself time you could look at two C programmes that indicate CSXp, UTF-8 Roman, and UTF-8 Devanagari codings: `csxp2ur' -- converts CSXp --> UTF-8 Roman `ur2ud.c' -- converts UTF-8 Roman --> UTF-8 Devanagari Both are from: ftp://bombay.oriental.cam.ac.uk/pub/john/software/programs/ Regards, Richard -- Richard Mahoney | E-mail: rbm49@ext.canterbury.ac.nz 78 Jeffreys Road | r.mahoney@comnet.net.nz Fendalton | Telephone: 0064-3-351-5831 CHRISTCHURCH 8005 | Cellular: 0064-25-829-986 NEW ZEALAND | http://homepages.comnet.co.nz/~r-mahoney