From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/15337 Path: main.gmane.org!not-for-mail From: "George N. White III" Newsgroups: gmane.comp.tex.context Subject: Re: Arabic-utf-8 (plus a sample) Date: Sun, 06 Jun 2004 10:22:07 -0300 Organization: Bedford Institute of Oceanography Sender: ntg-context-admin@ntg.nl Message-ID: References: Reply-To: ntg-context@ntg.nl NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Trace: sea.gmane.org 1086528314 13931 80.91.224.253 (6 Jun 2004 13:25:14 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 6 Jun 2004 13:25:14 +0000 (UTC) Cc: "aleph@ntg.nl" Original-X-From: ntg-context-admin@ntg.nl Sun Jun 06 15:25:05 2004 Return-path: Original-Received: from ref.vet.uu.nl ([131.211.172.13] helo=ref.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BWxe5-0004Lj-00 for ; Sun, 06 Jun 2004 15:25:05 +0200 Original-Received: from ref.ntg.nl (localhost.localdomain [127.0.0.1]) by ref.ntg.nl (Postfix) with ESMTP id 6E57E10B37; Sun, 6 Jun 2004 15:25:00 +0200 (MEST) Original-Received: from bionet.bio.dfo.ca (50xt.mar.dfo-mpo.gc.ca [142.176.61.190]) by ref.ntg.nl (Postfix) with ESMTP id ECA2110AE7; Sun, 6 Jun 2004 15:22:45 +0200 (MEST) Original-Received: from montigo.bio.dfo.ca ([142.2.20.184]) by bionet.bio.dfo.ca (PMDF V5.2-32 #39490) with ESMTP id <0HYW00AOV2GJYS@bionet.bio.dfo.ca>; Sun, 6 Jun 2004 10:21:55 -0300 (ADT) In-reply-to: X-X-Sender: gwhite@montigo.bio.dfo.ca Original-To: "ntg-context@ntg.nl" Errors-To: ntg-context-admin@ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.0.13 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.comp.tex.context:15337 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:15337 On Sat, 5 Jun 2004, Idris Samawi Hamid wrote: > Hi gang, > > For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even ArabTeX) > unless one of the encoding filters like utf-8 is used. Even for utf-8 files, > however, it would be very useful to be able to convert a utf-8 file to Latin > transcription for further processing by Aleph/(e-)Omega. For example, adding > diacritics is much easier to do in Latin than in an Arabic script editor > because Latin transcription is one-dimensional and adding diacritics to > Arabic is a 2-dimen affair. > > The best thing would be a perl script but I don't know perl at all (except to > run some some precreated scripts). If someone out of the kindness of their > heart could write a short and simple script for just seven characters I could > do the rest myself and present it back here. Can you use (or extend) GNU recode? It does include support for utf-8 and several TeX encodings. >From the manual: "It is easy for a programmer to add a new charset to `recode'. All it requires is making a few functions kept in a single `.c' file, adjusting `Makefile.am' and remaking `recode'." -- George N. White III Head of St. Margarets Bay, Nova Scotia, Canada