From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/41798 Path: news.gmane.org!not-for-mail From: Khaled Hosny Newsgroups: gmane.comp.tex.context Subject: Re: Arabic index entries Date: Fri, 20 Jun 2008 19:02:33 +0300 Message-ID: <20080620160233.GC15208@khaled-laptop> References: <20080620002305.GA26204@khaled-laptop> <485B5D90.9060606@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0567914260==" X-Trace: ger.gmane.org 1213978095 25933 80.91.229.12 (20 Jun 2008 16:08:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 20 Jun 2008 16:08:15 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Fri Jun 20 18:09:00 2008 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by lo.gmane.org with esmtp (Exim 4.50) id 1K9jAH-0003l6-5W for gctc-ntg-context-518@m.gmane.org; Fri, 20 Jun 2008 18:08:41 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 8AEE41FC1A; Fri, 20 Jun 2008 18:07:50 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 22205-09-25; Fri, 20 Jun 2008 18:07:11 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id B11801FD62; Fri, 20 Jun 2008 18:06:31 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id B42451FCDB for ; Fri, 20 Jun 2008 18:06:30 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 22205-09-23 for ; Fri, 20 Jun 2008 18:05:51 +0200 (CEST) Original-Received: from hs-out-0708.google.com (hs-out-0708.google.com [64.233.178.248]) by ronja.ntg.nl (Postfix) with ESMTP id EB5DE1FD8C for ; Fri, 20 Jun 2008 18:03:09 +0200 (CEST) Original-Received: by hs-out-0708.google.com with SMTP id k27so173296hsc.2 for ; Fri, 20 Jun 2008 09:03:08 -0700 (PDT) Original-Received: by 10.100.172.16 with SMTP id u16mr5798563ane.9.1213977788402; Fri, 20 Jun 2008 09:03:08 -0700 (PDT) Original-Received: from localhost ( [41.232.35.41]) by mx.google.com with ESMTPS id c27sm4708195ana.37.2008.06.20.09.02.54 (version=TLSv1/SSLv3 cipher=RC4-MD5); Fri, 20 Jun 2008 09:03:04 -0700 (PDT) In-Reply-To: <485B5D90.9060606@wxs.nl> User-Agent: Mutt/1.5.17+20080114 (2008-01-14) X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.9 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:41798 Archived-At: --===============0567914260== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="2qXFWqzzG3v1+95a" Content-Disposition: inline --2qXFWqzzG3v1+95a Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Jun 20, 2008 at 09:34:40AM +0200, Hans Hagen wrote: > Idris Samawi Hamid wrote: > > On Thu, 19 Jun 2008 18:23:05 -0600, Khaled Hosny =20 > > wrote: > >=20 > >> Arabic index entries are all listed under "unknown" instead of its > >> respective Arabic letters. I'm not sure if this is a bug or a > >> misconfiguration from my side. See the attached example. > >=20 > > We need to include arabic-farsi-urdu etc. databases in the distro. If H= ans =20 > > can tell us what file to emulate/edit etc.... >=20 > first we need to discuss the logic ... say that we have a sequence of=20 > chars ... do we need to erase the vowels? etc Erase vowels as in not counting them? Then yes we should only respect full letters. We might need also need to strip the Arabic definite article "=D8=A7=D9=84", but this will be tricky since there are words that = start with it. May be we better have syntax like \index[a]{entry} where this entry will be under "a", or we already have this? Regards, Khaled --=20 Khaled Hosny Arabic localizer and member of Arabeyes.org team --2qXFWqzzG3v1+95a Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) iD8DBQFIW9SZRoqITGOuyPIRAgqZAKCEcrtBsFot2Cp+a0s5G5H+gJ890wCeNGA3 RIkwR/Yw2L0SOd34OoHCR6k= =bexw -----END PGP SIGNATURE----- --2qXFWqzzG3v1+95a-- --===============0567914260== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________ --===============0567914260==--