From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/83654 Path: news.gmane.org!not-for-mail From: Philipp Gesang Newsgroups: gmane.comp.tex.context Subject: Re: copy&paste from pdf bug (smallcaps, text figures) Date: Fri, 2 Aug 2013 17:56:01 +0200 Message-ID: <20130802155601.GA15777@tartaros> References: <20130801173329.GA5907@phlegethon> <13E355F9-2F54-42AC-8CA6-EB8C05A0B3CD@gmail.com> <20130801214616.GA29570@phlegethon> <20130801220108.GD8003@homerow> <20130801221237.GB29570@phlegethon> <51FB9357.6080000@wxs.nl> <20130802113741.GA14425@tartaros> <20130802120253.GA2084@homerow> <20130802122807.GA15161@tartaros> <51FBC940.2020104@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1906761344==" X-Trace: ger.gmane.org 1375458732 1957 80.91.229.3 (2 Aug 2013 15:52:12 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 2 Aug 2013 15:52:12 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Fri Aug 02 17:52:15 2013 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([5.39.185.229]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1V5HeG-0007fV-MC for gctc-ntg-context-518@m.gmane.org; Fri, 02 Aug 2013 17:52:12 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id ECBAB101F0; Fri, 2 Aug 2013 17:51:37 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id ul4sbYu3yQuQ; Fri, 2 Aug 2013 17:51:34 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [IPv6:::1]) by balder.ntg.nl (Postfix) with ESMTP id 896E1101E8; Fri, 2 Aug 2013 17:51:34 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 54029101E8 for ; Fri, 2 Aug 2013 17:51:33 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id g2SnnGYUYiCF for ; Fri, 2 Aug 2013 17:51:22 +0200 (CEST) Original-Received: from filter3-utr.mf.surf.net (filter3-utr.mf.surf.net [195.169.124.154]) by balder.ntg.nl (Postfix) with ESMTP id 29945101E5 for ; Fri, 2 Aug 2013 17:51:22 +0200 (CEST) Original-Received: from relay.uni-heidelberg.de (relay.uni-heidelberg.de [129.206.100.212]) by filter3-utr.mf.surf.net (8.14.3/8.14.3/Debian-9.4) with ESMTP id r72FpsOY011178 for ; Fri, 2 Aug 2013 17:51:55 +0200 Original-Received: from ix.urz.uni-heidelberg.de (cyrus-portal.urz.uni-heidelberg.de [129.206.100.176]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id r72FprLd030290 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 2 Aug 2013 17:51:54 +0200 Original-Received: from extmail.urz.uni-heidelberg.de (extmail.urz.uni-heidelberg.de [129.206.100.140]) by ix.urz.uni-heidelberg.de (8.13.8/8.13.8) with ESMTP id r72FprXo002558 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 2 Aug 2013 17:51:53 +0200 Original-Received: from localhost (p4FFFD543.dip0.t-ipconnect.de [79.255.213.67]) (authenticated bits=0) by extmail.urz.uni-heidelberg.de (8.13.4/8.13.1) with ESMTP id r72Fpqx2012828 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Fri, 2 Aug 2013 17:51:52 +0200 Mail-Followup-To: mailing list for ConTeXt users In-Reply-To: <51FBC940.2020104@wxs.nl> X-Operating-System: Linux tartaros 3.9.6-1-ARCH User-Agent: Mutt/1.5.21 (2010-09-15) X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=129.206.100.212; country=DE; region=01; city=Heidelberg; latitude=49.4167; longitude=8.7000; http://maps.google.com/maps?q=49.4167,8.7000&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 08K7PPSwI - 3c59ec2ca4c4 - 20130802 (trained as not-spam) X-Scanned-By: CanIt (www . roaringpenguin . com) on 195.169.124.154 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.14 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: ntg-context-bounces@ntg.nl Original-Sender: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:83654 Archived-At: --===============1906761344== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="9jxsPFA5p3P2qPhR" Content-Disposition: inline --9jxsPFA5p3P2qPhR Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable =C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7 > On 8/2/2013 2:28 PM, Philipp Gesang wrote: > >=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7=C2=B7 > > > >>On 2013=E2=80=9308=E2=80=9302 Philipp Gesang wrote: > >> > >>> https://phi-gamma.net/pdf/copypasta.pdf > >>> https://phi-gamma.net/files/copypasta.txt > >>> > >>>I definitely get =EF=9C=B0=EF=9C=B1=EF=9C=B2=EF=9C=B3=EF=9C=B4=EF=9C= =B5=EF=9C=B6=EF=9C=B7=EF=9C=B8=EF=9C=B9 =EF=9D=A1=EF=9D=A2=EF=9D=A3=EF=9D= =A4=EF=9D=A5=EF=9D=A6=EF=9D=A7=EF=9D=A8=EF=9D=A9=EF=9D=AA=EF=9D=AB=EF=9D=AC= =EF=9D=AD=EF=9D=AE=EF=9D=AF=EF=9D=B0=EF=9D=B1=EF=9D=B2=EF=9D=B3=EF=9D=B4=EF= =9D=B5=EF=9D=B6=EF=9D=B7=EF=9D=B8=EF=9D=B9=EF=9D=BA from this > >>>one. > >> > >>Indeed. When I copy from your file I get those private Unicode > >>slots. When I run the example code from your OP, I get the correct > >>characters. I don't know what's the difference between those two > >>files. > > > >The PDF is what Context produces here with that code. > > > >> The LuaTeX version and ConTeXt version is the same. > >> > >>Creator: ConTeXt - 2013.08.01 01:31 > >>Producer: LuaTeX-0.76.0 > > > >There appears to be a difference between node and base mode > >depending on how the font is defined: > > > > \pdfcompresslevel0 > > > > \setupbodyfont [iwona] > > > > \definefontfeature [proto] [onum=3Dyes,smcp=3Dyes,script=3Ddflt,la= ng=3Ddflt] > > \definefontfeature [withbase] [proto] [mode=3Dbase] > > \definefontfeature [withnode] [proto] [mode=3Dnode] > > > > \definefont [iwonab] [file:Iwona-Regular.otf*withbase] > > \definefont [iwonan] [file:Iwona-Regular.otf*withnode] > > > > \starttext > > \feature[<] > > base mode\par > > {\feature[!][withbase]0123456789abcdefghijklmnopqrstuvwxyz\par} > > {\iwonab 0123456789abcdefghijklmnopqrstuvwxyz} > > > > node mode\par > > {\feature[!][withnode]0123456789abcdefghijklmnopqrstuvwxyz\par} > > {\iwonan 0123456789abcdefghijklmnopqrstuvwxyz} > > \stoptext \endinput > > > >This gets me (through pdftotext): > > > > base mode > > =EF=9C=B0=EF=9C=B1=EF=9C=B2=EF=9C=B3=EF=9C=B4=EF=9C=B5=EF=9C=B6=EF=9C= =B7=EF=9C=B8=EF=9C=B9=EF=9D=A1=EF=9D=A2=EF=9D=A3=EF=9D=A4=EF=9D=A5=EF=9D=A6= =EF=9D=A7=EF=9D=A8=EF=9D=A9=EF=9D=AA=EF=9D=AB=EF=9D=AC=EF=9D=AD=EF=9D=AE=EF= =9D=AF=EF=9D=B0=EF=9D=B1=EF=9D=B2=EF=9D=B3=EF=9D=B4=EF=9D=B5=EF=9D=B6=EF=9D= =B7=EF=9D=B8=EF=9D=B9=EF=9D=BA > > 0123456789abcdefghijklmnopqrstuvwxyz > > node mode > > =EF=9C=B0=EF=9C=B1=EF=9C=B2=EF=9C=B3=EF=9C=B4=EF=9C=B5=EF=9C=B6=EF=9C= =B7=EF=9C=B8=EF=9C=B9=EF=9D=A1=EF=9D=A2=EF=9D=A3=EF=9D=A4=EF=9D=A5=EF=9D=A6= =EF=9D=A7=EF=9D=A8=EF=9D=A9=EF=9D=AA=EF=9D=AB=EF=9D=AC=EF=9D=AD=EF=9D=AE=EF= =9D=AF=EF=9D=B0=EF=9D=B1=EF=9D=B2=EF=9D=B3=EF=9D=B4=EF=9D=B5=EF=9D=B6=EF=9D= =B7=EF=9D=B8=EF=9D=B9=EF=9D=BA > > =EF=9C=B0=EF=9C=B1=EF=9C=B2=EF=9C=B3=EF=9C=B4=EF=9C=B5=EF=9C=B6=EF=9C= =B7=EF=9C=B8=EF=9C=B9=EF=9D=A1=EF=9D=A2=EF=9D=A3=EF=9D=A4=EF=9D=A5=EF=9D=A6= =EF=9D=A7=EF=9D=A8=EF=9D=A9=EF=9D=AA=EF=9D=AB=EF=9D=AC=EF=9D=AD=EF=9D=AE=EF= =9D=AF=EF=9D=B0=EF=9D=B1=EF=9D=B2=EF=9D=B3=EF=9D=B4=EF=9D=B5=EF=9D=B6=EF=9D= =B7=EF=9D=B8=EF=9D=B9=EF=9D=BA > > > >So base mode with \definefont works while node mode or the font > >from the typescript doesn=E2=80=99t. >=20 > For such tests you need to compare all cases: >=20 > nopdfcompression >=20 > \setupbodyfont [iwona] >=20 > \definefontfeature [withbaseone] [proto] [mode=3Dbase] > \definefontfeature [withnodeone] [proto] [mode=3Dnode] > \definefontfeature [withbasetwo] [proto] > [mode=3Dbase,onum=3Dyes,smcp=3Dyes,script=3Ddflt,lang=3Ddflt] > \definefontfeature [withnodetwo] [proto] > [mode=3Dnode,onum=3Dyes,smcp=3Dyes,script=3Ddflt,lang=3Ddflt] >=20 > \definefont [iwonabone] [file:Iwona-Regular.otf*withbaseone] > \definefont [iwonanone] [file:Iwona-Regular.otf*withnodeone] >=20 > \definefont [iwonabtwo] [file:Iwona-Regular.otf*withbasetwo] > \definefont [iwonantwo] [file:Iwona-Regular.otf*withnodetwo] >=20 > \starttext >=20 > test 1, both modes: >=20 > {\iwonanone 0123456789abcdefghijklmnopqrstuvwxyz}\par > {\iwonabone 0123456789abcdefghijklmnopqrstuvwxyz}\par > {\iwonantwo 0123456789abcdefghijklmnopqrstuvwxyz}\par > {\iwonabtwo 0123456789abcdefghijklmnopqrstuvwxyz}\par >=20 > % test 2, base only: >=20 > % {\iwonabone 0123456789abcdefghijklmnopqrstuvwxyz}\par > % {\iwonabtwo 0123456789abcdefghijklmnopqrstuvwxyz}\par >=20 > % test 3, node only: >=20 > % {\iwonanone 0123456789abcdefghijklmnopqrstuvwxyz}\par > % {\iwonantwo 0123456789abcdefghijklmnopqrstuvwxyz}\par >=20 > \stoptext >=20 > i.e. get rid of potential interferences >=20 > attached is what i get Your PDF is flawless, but on my machine node mode produces the wrong output. (Also I run the luatex beta-0.76.0-2013040516 that comes with the minimals.) PDF: https://phi-gamma.net/pdf/copypasta-hh.pdf Here=E2=80=99s a link to the diff between your PDF and my output: http://pastie.org/private/zwnesrug7wpy4ket6ppl1g This bug is quite elusive; can you think of anything that this behavior might be a side-effect of? Philipp --9jxsPFA5p3P2qPhR Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.20 (GNU/Linux) iQEcBAEBAgAGBQJR+9aRAAoJEI7yfcKNwM1+PQUH+we0L1s0w39ShlI4wRjY6JbX U/4XmxZ0Zvo/a/D8iFcNjT/lZghmZoA1A8cdN92rE9tmccfq/88ktQbyZSED4/cE iri0rDD9Xb1S4iv502ECqybXLaFLZxWNZKdOy6pGYAXmxQZNgJ736OADIcufRjtD gSQCg/hsNwsTPkZSMg6slx+lV4D2xXdyo+dYz5fZ12BCMUHq7vCDXJFLIvwriJWK b964rLQGfn4f+U21/9Oc0Rsci1Jb4cgrPZ8FK0esHc6Vdg9N0UqNWjz24HLkiKWV P9GBi53yGzS+yRdPOKcC9wF7HTfV9QEBjXsOV2VwIbS4hQW2RXKgBe30crJhV7g= =/UEz -----END PGP SIGNATURE----- --9jxsPFA5p3P2qPhR-- --===============1906761344== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________ --===============1906761344==--