From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/21408 Path: news.gmane.org!not-for-mail From: Mojca Miklavec Newsgroups: gmane.comp.tex.context Subject: ISO/windows encodings (was: Basic question on Unicode ...) Date: Fri, 15 Jul 2005 23:13:55 +0200 Message-ID: <6faad9f0050715141375e97854@mail.gmail.com> References: <6faad9f005071511431a61fa1f@mail.gmail.com> Reply-To: Mojca Miklavec , mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1121462070 15050 80.91.229.2 (15 Jul 2005 21:14:30 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 15 Jul 2005 21:14:30 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Fri Jul 15 23:14:26 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1DtXVS-0000yl-JU for gctc-ntg-context-518@m.gmane.org; Fri, 15 Jul 2005 23:14:02 +0200 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 3EC6F127E8; Fri, 15 Jul 2005 23:14:02 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 26604-04; Fri, 15 Jul 2005 23:13:58 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 17C6E127BB; Fri, 15 Jul 2005 23:13:58 +0200 (CEST) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 56A56127BB for ; Fri, 15 Jul 2005 23:13:57 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 26472-07-2 for ; Fri, 15 Jul 2005 23:13:56 +0200 (CEST) Original-Received: from nproxy.gmail.com (nproxy.gmail.com [64.233.182.198]) by ronja.ntg.nl (Postfix) with ESMTP id 273A8127BA for ; Fri, 15 Jul 2005 23:13:55 +0200 (CEST) Original-Received: by nproxy.gmail.com with SMTP id x4so182394nfb for ; Fri, 15 Jul 2005 14:13:55 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=ixoiW1AADKVZzGlZUUbIXQeK7mJBu3ojwj5+fSS0+njEXERa2ktqef/E3mgr4MK2leZmR6ddTVsjYuKAPeSydi5xSKu8HYG20ZwD6QCteyHOKxIn/i2ndV/U5UiAYIgRgS6XpBoBcDH0n06wfxpxUzXm/yZuKdMe83iQGre4bd0= Original-Received: by 10.48.237.12 with SMTP id k12mr127044nfh; Fri, 15 Jul 2005 14:13:55 -0700 (PDT) Original-Received: by 10.48.247.2 with HTTP; Fri, 15 Jul 2005 14:13:55 -0700 (PDT) Original-To: mailing list for ConTeXt users In-Reply-To: <6faad9f005071511431a61fa1f@mail.gmail.com> Content-Disposition: inline X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:21408 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:21408 (Sorry, I should have opened another thread already before.) I have another couple of questions about regimes support. How can synonyms for regimes be defined, so that \enableregime[windows-1250] would have the same effect as \enableregime[win-1250] or \enableregime[cp1250]? And \enableregime[utf8] the same effect as \enableregime[utf]. I don't won't to be discriminating, but \enableregime[windows] is like writing \enableregime[latin] ("il" in ConTeXt I think) and expecting the whole world to understand that you mean latin1. In my opinion it should be left there (for backward compatibility if for nothing else), but deprecated and given an unambigious name like "windows-1252", "windows1252", "win-1252", "win1252", "cp1252" or "windows-western". > Does anyone have any script to test the encoding (which would produce > a matrix of (almost) 266 characters)? (Seems like I should have learnt for my math exam tomorrow instead of writing this.) I meant if someone has a nice macro or a script to produce the 256 (not 266!) characters table (minus non-printable ones), maybe together with the corresponding name (only if it can still be extracted). It should either look like an usual ASCII table (perhaps with a box around like in TeX font tables) or simply one character per line with a decimal and hex number written. More or less in order to be able to test if the regi-* files are OK. I prepared the file by hand, but now as I know where to look for and after I saw the http://czyborra.com/charsets/iso8859.html page, I think it shouldn't be a problem to prepare support for all those usual and unusual encodings at once (only a clever script and some manually-prepared mapping from unicode to ConTeXt names). Unicode is great, but not everyone uses it (even vim behaves pretty system-dependent and cannot always be used for unicode out-of-the-box). (I also forgot to thank Patrick for explaining me some stuff about regimes.= ) Mojca