From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/74460 Path: news.gmane.org!not-for-mail From: Ulrike Fischer Newsgroups: gmane.comp.tex.context Subject: Re: UTF conversion via Lua Date: Mon, 13 Feb 2012 12:42:19 +0100 Message-ID: References: <20120210105732.GA30993@phlegethon> Reply-To: news3@nililand.de, mailing list for ConTeXt users NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1329133401 18935 80.91.229.3 (13 Feb 2012 11:43:21 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 13 Feb 2012 11:43:21 +0000 (UTC) To: ntg-context@ntg.nl Original-X-From: ntg-context-bounces@ntg.nl Mon Feb 13 12:43:16 2012 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1RwuJO-0008N0-2R for gctc-ntg-context-518@m.gmane.org; Mon, 13 Feb 2012 12:43:14 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 6595FCB207; Mon, 13 Feb 2012 12:43:13 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id AuoCeUvif5wH; Mon, 13 Feb 2012 12:43:07 +0100 (CET) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 6CBF6CB208; Mon, 13 Feb 2012 12:43:07 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 692F3CB208 for ; Mon, 13 Feb 2012 12:43:05 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id x3A-CLwejVHq for ; Mon, 13 Feb 2012 12:42:48 +0100 (CET) Original-Received: from filter4-til.mf.surf.net (filter4-til.mf.surf.net [194.171.167.220]) by balder.ntg.nl (Postfix) with ESMTP id A9EE1CB207 for ; Mon, 13 Feb 2012 12:42:48 +0100 (CET) Original-Received: from plane.gmane.org (plane.gmane.org [80.91.229.3]) by filter4-til.mf.surf.net (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id q1DBgiZ4015858 for ; Mon, 13 Feb 2012 12:42:45 +0100 Original-Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1RwuIp-0007uV-Hw for ntg-context@ntg.nl; Mon, 13 Feb 2012 12:42:39 +0100 Original-Received: from pd9549da8.dip.t-dialin.net ([217.84.157.168]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 13 Feb 2012 12:42:39 +0100 Original-Received: from news3 by pd9549da8.dip.t-dialin.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 13 Feb 2012 12:42:39 +0100 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 15 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: pd9549da8.dip.t-dialin.net User-Agent: 40tude_Dialog/2.0.15.41de X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=80.91.229.3; country=NO; latitude=62.0000; longitude=10.0000; http://maps.google.com/maps?q=62.0000,10.0000&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 05GxnGINj - 9f948ed03a09 - 20120213 (trained as not-spam) X-Scanned-By: CanIt (www . roaringpenguin . com) on 194.171.167.216 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:74460 Archived-At: Am Fri, 10 Feb 2012 12:14:15 +0100 schrieb luigi scarso: > if you mean ASCII with coderange 0-255 *and* =A0ISO-8859-1 (Latin 1) > encoding there is no need to conversion; This is not true. You are mixing up unicode positions and utf8 encoding. E.g. "=E4" has the same position in unicode and latin1 (dez 228, hex E4). But its utf8 code consist of 16 bits (1100001110100100, hex c3a4) while its latin 1 code is 8-bit long (11100100). -- = Ulrike Fischer = ___________________________________________________________________________= ________ If your question is of interest to others as well, please add an entry to t= he Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-cont= ext webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________= ________