From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/15324 Path: main.gmane.org!not-for-mail From: Idris Samawi Hamid Newsgroups: gmane.comp.tex.context Subject: Re: Arabic-utf-8 (plus a sample) Date: Sat, 05 Jun 2004 15:33:34 -0600 Organization: Colorado State University Sender: ntg-context-admin@ntg.nl Message-ID: References: <1086468099.5707.26.camel@tascomputer.home> Reply-To: ntg-context@ntg.nl NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; format=flowed; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1086471367 26197 80.91.224.253 (5 Jun 2004 21:36:07 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 5 Jun 2004 21:36:07 +0000 (UTC) Original-X-From: ntg-context-admin@ntg.nl Sat Jun 05 23:35:58 2004 Return-path: Original-Received: from ref.vet.uu.nl ([131.211.172.13] helo=ref.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BWipa-0006IB-00 for ; Sat, 05 Jun 2004 23:35:58 +0200 Original-Received: from ref.ntg.nl (localhost.localdomain [127.0.0.1]) by ref.ntg.nl (Postfix) with ESMTP id 0619210B2A; Sat, 5 Jun 2004 23:35:54 +0200 (MEST) Original-Received: from eagle.acns.ColoState.EDU (eagle.acns.colostate.edu [129.82.100.90]) by ref.ntg.nl (Postfix) with ESMTP id E639210ABB for ; Sat, 5 Jun 2004 23:33:35 +0200 (MEST) Original-Received: from lamar.colostate.edu (lamar.acns.colostate.edu [129.82.100.75]) by eagle.acns.ColoState.EDU (AIX5.1/8.11.6p2/8.11.0) with ESMTP id i55LXZL1074828 for ; Sat, 5 Jun 2004 15:33:35 -0600 Original-Received: from IHAMID (ihamid.libarts.colostate.edu [129.82.187.166]) by lamar.colostate.edu (AIX5.1/8.11.6p2/8.11.0) with ESMTP id i55LXYe912250 for ; Sat, 5 Jun 2004 15:33:34 -0600 Original-To: ntg-context@ntg.nl In-Reply-To: <1086468099.5707.26.camel@tascomputer.home> User-Agent: Opera7.23/Win32 M2 build 3227 Errors-To: ntg-context-admin@ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.0.13 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.comp.tex.context:15324 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:15324 On Sat, 05 Jun 2004 22:41:39 +0200, Thomas A. Schmitz wrote: > Idris, > > I know a bit of perl and would love to help. However, I fear that > sending us your stuff via mail will be a bit difficult because the utf-8 > chracters get transformed into gibberish. Thnx 4 such a speedy reply! I don't think you are getting gibberish though; you should be getting the extended ascii representation. So the letter alif (hex 0627) should look like this: ا Do you get a forward-slashed circle and a section symbol? If so, that's the ascii representation I'm trying to convert to the letter `A'. Here are the codes you want: ا [0627] => A ب [0628] => b ج [062C] => j د [062F] => d ه [0647] => h و [0648] => w ز [0632] => z Let me explain my situation more clearly:-) I have a unicode editor, Unitype Global Writer. I save a unicode document as a utf *.txt file. When I open that saved file in my TeX editor (WinEdt), it comes out as extended ascii (that's the "gibberish"). So what I wanted to do was convert the ascii "gibberish" to my Latin transcription. It seems that what you are suggesting is to use the hex representation and convert the unicode txt file into a Latin transcription file directly and bypass the gibberish. On your perl file, can you give me an example of how to use it? I tried (in windows, with name utf2tex.pl and unicode text in unicode-utf.txt) and get ========================= > perl utf2tex.pl unicode-utf.txt Unknown discipline class ':utf8' at C:/Perl/lib/open.pm line 18. BEGIN failed--compilation aborted at utf2tex.pl line 4. ========================= from your script I tried, e.g. ============================ $_ =~ s/\x{0627}/\x{0041}/esg; # from alif to `A' ============================ Your guidance will be greatly appreciated! Thnx a million! Idris -- Professor Idris Samawi Hamid Department of Philosophy Colorado State University Fort Collins, CO 80523