From: Idris Samawi Hamid <ishamid@colostate.edu>
Subject: Re: Arabic-utf-8 (plus a sample)
Date: Sat, 05 Jun 2004 15:33:34 -0600 [thread overview]
Message-ID: <opr844t8tru9mfh0@lamar.colostate.edu> (raw)
In-Reply-To: <1086468099.5707.26.camel@tascomputer.home>
On Sat, 05 Jun 2004 22:41:39 +0200, Thomas A. Schmitz
<thomas.schmitz@uni-bonn.de> wrote:
> Idris,
>
> I know a bit of perl and would love to help. However, I fear that
> sending us your stuff via mail will be a bit difficult because the utf-8
> chracters get transformed into gibberish.
Thnx 4 such a speedy reply! I don't think you are getting gibberish
though; you should be getting the extended ascii representation. So the
letter alif (hex 0627) should look like this:
ا
Do you get a forward-slashed circle and a section symbol? If so, that's
the ascii representation I'm trying to convert to the letter `A'.
Here are the codes you want:
ا [0627] => A
ب [0628] => b
ج [062C] => j
د [062F] => d
Ù‡ [0647] => h
Ùˆ [0648] => w
ز [0632] => z
Let me explain my situation more clearly:-)
I have a unicode editor, Unitype Global Writer. I save a unicode document
as a utf *.txt file. When I open that saved file in my TeX editor
(WinEdt), it comes out as extended ascii (that's the "gibberish"). So what
I wanted to do was convert the ascii "gibberish" to my Latin
transcription. It seems that what you are suggesting is to use the hex
representation and convert the unicode txt file into a Latin transcription
file directly and bypass the gibberish.
On your perl file, can you give me an example of how to use it? I tried
(in windows, with name
utf2tex.pl and unicode text in unicode-utf.txt) and get
=========================
> perl utf2tex.pl unicode-utf.txt
Unknown discipline class ':utf8' at C:/Perl/lib/open.pm line 18.
BEGIN failed--compilation aborted at utf2tex.pl line 4.
=========================
from your script I tried, e.g.
============================
$_ =~
s/\x{0627}/\x{0041}/esg;
# from alif to `A'
============================
Your guidance will be greatly appreciated!
Thnx a million!
Idris
--
Professor Idris Samawi Hamid
Department of Philosophy
Colorado State University
Fort Collins, CO 80523
next prev parent reply other threads:[~2004-06-05 21:33 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-06-05 19:32 Idris Samawi Hamid
2004-06-05 20:41 ` Thomas A. Schmitz
2004-06-05 21:33 ` Idris Samawi Hamid [this message]
2004-06-05 21:48 ` Thomas A. Schmitz
2004-06-05 22:51 ` Idris Samawi Hamid
2004-06-05 23:15 ` Re[2]: " Giuseppe Bilotta
2004-06-05 23:31 ` Idris Samawi Hamid
2004-06-05 23:58 ` Re[4]: " Giuseppe Bilotta
2004-06-06 0:19 ` Idris Samawi Hamid
2004-06-06 0:26 ` Idris Samawi Hamid
2004-06-06 9:09 ` Perl scripting (was: Arabic-utf-8) Henning Hraban Ramm
2004-06-06 21:03 ` Idris Samawi Hamid
2004-06-06 21:28 ` Thomas A. Schmitz
2004-06-07 19:45 ` Henning Hraban Ramm
2004-06-07 20:53 ` Thomas A.Schmitz
2004-06-05 23:08 ` [SPAM: 3.411] Arabic-utf-8 (plus a sample) Richard MAHONEY
2004-06-06 0:19 ` Idris Samawi Hamid
2004-06-06 13:22 ` Arabic-utf-8 " George N. White III
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=opr844t8tru9mfh0@lamar.colostate.edu \
--to=ishamid@colostate.edu \
--cc=ntg-context@ntg.nl \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).