From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/22671 Path: news.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: utf-based lang-* files? Date: Thu, 22 Sep 2005 23:23:12 +0200 Message-ID: <433320C0.5040504@wxs.nl> References: <4332A0F5.2040100@creutzig.de> <20050922122747.30135@mail.comp.lancs.ac.uk> <4332DADD.5010705@creutzig.de> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1127424301 7610 80.91.229.2 (22 Sep 2005 21:25:01 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 22 Sep 2005 21:25:01 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Thu Sep 22 23:24:54 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1EIYX6-0002gv-Lg for gctc-ntg-context-518@m.gmane.org; Thu, 22 Sep 2005 23:23:08 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 4BB0712805; Thu, 22 Sep 2005 23:23:07 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 32743-07; Thu, 22 Sep 2005 23:23:03 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id EDE04127F3; Thu, 22 Sep 2005 23:23:02 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id BE747127F3 for ; Thu, 22 Sep 2005 23:23:00 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 00415-01 for ; Thu, 22 Sep 2005 23:22:59 +0200 (CEST) Original-Received: from mailrelay02.solcon.nl (maillb.solcon.nl [212.45.32.200]) by ronja.ntg.nl (Postfix) with ESMTP id C7104127F2 for ; Thu, 22 Sep 2005 23:22:59 +0200 (CEST) Original-Received: from server-1.pragma-net.nl (dsl-212-84-128-085.solcon.nl [212.84.128.85]) by mailrelay02.solcon.nl (8.12.11/SQL-8.12.11-5/8.12.11) with ESMTP id j8MLMsVo015215 for ; Thu, 22 Sep 2005 23:22:54 +0200 Original-Received: by server-1.pragma-net.nl (Postfix, from userid 65534) id 63DD2364B1; Thu, 22 Sep 2005 22:27:59 +0200 (CEST) Original-Received: from [10.100.1.15] (unknown [10.100.1.15]) by server-1.pragma-net.nl (Postfix) with ESMTP id 0AD1C364AF for ; Thu, 22 Sep 2005 20:27:58 +0000 (UTC) User-Agent: Mozilla Thunderbird 1.0.2 (Windows/20050317) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: <4332DADD.5010705@creutzig.de> X-Virus-Scanned: ClamAV version 0.86.2, clamav-milter version 0.86 on mailrelay02 X-Virus-Status: Clean X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:22671 Archived-At: Christopher Creutzig wrote: >Adam Lindsay wrote: > =20 > >>>Is it possible to simply enclose the file in a >>>\startregime[utf]...\stopregime pair or do I risk havoc by doing this? >>> =20 >>> >>Well, if you're using a regime, it still (usually) depends on symbolic >>character names being defined under the hood. Also, such an approach >> =20 >> > > Sure. But editing the file is oh so much easier when I can just type >\def\japChapterNumber#1{=E7=AC=AC#1=E7=AB=A0} >than if I have to look up the unicode numbers first and type >\def\japChapterNumber{\uchar{123}{44}#1\uchar{122}{224}} > > =20 > >>(explicitly calling \startregime[utf]) doesn't make XeTeX as happy as i= t >>could be (XeTeX is happiest if you just pass through Unicode characters= . >> If xetex handles utf-8 by just looking at catcodes letter, you don't=20 need a regime; you just have to make sure that when the file is loaded=20 the chars 128->255 have the right catcode \dostepwiserecurse{128}{255}{1}{\catcode\recurselevel=3D11\relax} Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------