From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/21514 Path: news.gmane.org!not-for-mail From: Christopher Creutzig Newsgroups: gmane.comp.tex.context Subject: Re: Basic question on Unicode and ConTeXt Date: Wed, 20 Jul 2005 22:35:56 +0200 Message-ID: <42DEB5AC.8000806@creutzig.de> References: <6faad9f005071511431a61fa1f@mail.gmail.com> <42DAB916.6000000@wxs.nl> <6faad9f005071813263c3109a1@mail.gmail.com> <42DC250C.8010500@wxs.nl> <6faad9f005071816115514e6ba@mail.gmail.com> <42DCB49D.5020209@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1121894059 30376 80.91.229.2 (20 Jul 2005 21:14:19 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 20 Jul 2005 21:14:19 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Wed Jul 20 23:14:18 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1DvLt0-0008Rn-W3 for gctc-ntg-context-518@m.gmane.org; Wed, 20 Jul 2005 23:13:51 +0200 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 8F2A0127FA; Wed, 20 Jul 2005 23:13:50 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 10269-04-4; Wed, 20 Jul 2005 23:13:49 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 565ED127D6; Wed, 20 Jul 2005 22:36:09 +0200 (CEST) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 23DDC127D6 for ; Wed, 20 Jul 2005 22:36:08 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 09626-05 for ; Wed, 20 Jul 2005 22:36:07 +0200 (CEST) Original-Received: from mailgate.uni-paderborn.de (mailgate.uni-paderborn.de [131.234.22.32]) by ronja.ntg.nl (Postfix) with ESMTP id 44E8C12775 for ; Wed, 20 Jul 2005 22:36:06 +0200 (CEST) Original-Received: from p548b3c31.dip0.t-ipconnect.de ([84.139.60.49] helo=[192.168.1.3]) by mailgate.uni-paderborn.de with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.43) id 1DvLIx-0008HN-CF for ntg-context@ntg.nl; Wed, 20 Jul 2005 22:36:35 +0200 User-Agent: Mozilla Thunderbird 1.0 (Macintosh/20041206) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: <42DCB49D.5020209@wxs.nl> X-UNI-PB_FAK-EIM-MailScanner-Information: Please see http://imap.uni-paderborn.de for details X-UNI-PB_FAK-EIM-MailScanner: Found to be clean X-UNI-PB_FAK-EIM-MailScanner-SpamCheck: not spam, SpamAssassin (score=-3.208, required 4, AUTH_EIM_USER -5.00, RCVD_IN_NJABL_DUL 1.66, RCVD_IN_SORBS_DUL 0.14) X-MailScanner-From: christopher@creutzig.de X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:21514 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:21514 Hans Hagen wrote: >> So why not mapping the characters to unicode first and defining the >> mapping from unicode to \TeXcommand only once? regi-* files (at least >> in the meaning they have now) could be prepared automatically by a >> script, less error-prone and without the need to say "Some more >> definitions will be added later." >> >> > you mean ... > > \defineactivetoken 123 {\uchar{...}{...}} > > it is an option but it's much slower and take much more memory I may be wrong, of course, but I think Mojca proposed something different (and something that should be really easy to implement): Have the unicode vectors stored in a format easily parsed by an external ruby script and create the regi-* files from that, using the conversion tables provided by your operating system or iconv or wherever ruby gets them from. regards, Christopher