From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/15327 Path: main.gmane.org!not-for-mail From: Richard MAHONEY Newsgroups: gmane.comp.tex.context Subject: Re: [SPAM: 3.411] Arabic-utf-8 (plus a sample) Date: Sun, 6 Jun 2004 11:08:46 +1200 Sender: ntg-context-admin@ntg.nl Message-ID: <20040605230846.GB54691@131-203-240-72.remote.comnet.co.nz> References: Reply-To: ntg-context@ntg.nl NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1086477071 2827 80.91.224.253 (5 Jun 2004 23:11:11 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 5 Jun 2004 23:11:11 +0000 (UTC) Original-X-From: ntg-context-admin@ntg.nl Sun Jun 06 01:11:02 2004 Return-path: Original-Received: from ref.vet.uu.nl ([131.211.172.13] helo=ref.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BWkJa-0000jh-00 for ; Sun, 06 Jun 2004 01:11:02 +0200 Original-Received: from ref.ntg.nl (localhost.localdomain [127.0.0.1]) by ref.ntg.nl (Postfix) with ESMTP id 068DA10B21; Sun, 6 Jun 2004 01:10:58 +0200 (MEST) Original-Received: from 131-203-240-72.remote.comnet.co.nz (131-203-240-72.remote.comnet.co.nz [131.203.240.72]) by ref.ntg.nl (Postfix) with ESMTP id 73B5B10ABB for ; Sun, 6 Jun 2004 01:08:50 +0200 (MEST) Original-Received: from rbm49 by 131-203-240-72.remote.comnet.co.nz with local (Exim 4.34; FreeBSD) id 1BWkHO-000Ej7-Tw; Sun, 06 Jun 2004 11:08:46 +1200 Original-To: ntg-context@ntg.nl Mail-Followup-To: ntg-context@ntg.nl Content-Disposition: inline In-Reply-To: User-Agent: Mutt 1.5.6i on FreeBSD 4.10-STABLE (i386) X-URL: X-Originating-IP: [131.203.240.72] X-Originating-Server: [131-203-240-72.remote.comnet.co.nz] X-Operating-System: FreeBSD 4.10-STABLE Errors-To: ntg-context-admin@ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.0.13 Precedence: bulk X-Reply-To: Richard MAHONEY List-Help: List-Post: List-Subscribe: , List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.comp.tex.context:15327 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:15327 On Sat, Jun 05, 2004 at 01:32:35PM -0600, Idris Samawi Hamid wrote: > Hi gang, > > For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even > ArabTeX) unless one of the encoding filters like utf-8 is used. Even for > utf-8 files, however, it would be very useful to be able to convert a > utf-8 file to Latin transcription for further processing by > Aleph/(e-)Omega. For example, adding diacritics is much easier to do in > Latin than in an Arabic script editor because Latin transcription is > one-dimensional and adding diacritics to Arabic is a 2-dimen affair. > > The best thing would be a perl script but I don't know perl at all (except > to run some some precreated scripts). If someone out of the kindness of > their heart could write a short and simple script for just seven > characters I could do the rest myself and present it back here. You might like to look at some of the encoding conversion scripts at: http://homepages.comnet.co.nz/~r-mahoney/scripts/scripts.html N.B. For sorting utf-8 Arabic you might find the perl module `Sort::ArbBiLex' useful Best regards, Richard Mahoney -- Richard MAHONEY | internet: homepages.comnet.net.nz/~r-mahoney Littledene | telephone / telefax (man.): ++64 3 312 1699 Bay Road | cellular: ++64 25 829 986 OXFORD, NZ | e-mail: r.mahoney[use"@"]comnet.net.nz