········· > On 8/2/2013 12:12 AM, Philipp Gesang wrote: > >········· > > > >>On 2013–08–01 Philipp Gesang wrote: > >> > >>>········· > >>> > >>>>I tested your example: no problem here on Mac OS X 10.8.4, with > >>>>either TeXShop, Adobe Redaer or Preview, with the latest beta > >>>>(ConTeXt ver: 2013.08.01 01:31 MKIV beta fmt: 2013.8.1 int: > >>>>english/english). > >>> > >>>x64 linux here, but it’s the same with the windows version in > >>>wine32. I get the bad output with okular (poppler), acroread, and > >>>mupdf, but strangely not with zathura (mupdf-based). > >> > >>Just to add to the list: > >> > >>x64 linux here, and it works with the following poppler based > >>viewers (zathura-poppler, xpdf, evince) I’m on a different machine now: the problem affects linux x86 and pdftotext as well. Also, in xpdf I get smallcaps copied as uppercase instead of lowercase. > i'm a bit puzzled > > >For those who want to test the git version, the commits are: > > > > last good: a61813ccdd4b7bcc81932317e1360fda6c79962d > > first bad: 6b2f7c5fd7a3e465f4e2662b1e5bd2c9d5cce8f8 > > > >Don’t forget to delete the cache. > > > >I suspect I found the troublesome changes. The problem vanishes > >if I revert this modification to font-map.lua: > > > > -local separator = S("_.") > > -local other = C((1 - separator)^1) > > -local ligsplitter = Ct(other * (separator * other)^0) > > +local ligseparator = P("_") > > +local varseparator = P(".") > > +local namesplitter = Ct(C((1 - ligseparator - varseparator)^1) * (ligseparator * C((1 - ligseparator - varseparator)^1))^0) > > > >and then further down: > > > > - local split = lpegmatch(ligsplitter,name) > > <...> > > + local split = lpegmatch(namesplitter,name) > > > >For convenience I repeat the link to the changeset: > > what do you revert from ... the + things are already in the file I’m quoting from the changeset, so the “-” lines indicate the good version. > > http://repo.or.cz/w/context.git/commitdiff/6b2f7c5fd7a3e465f4e2662b1e5bd2c9d5cce8f8 > > btw, this bit of code is evolving (was recently adapt to some border > case fonts that use their own rules) > > anyhow, on my win8 system the beta works with sumatra, okular and > acrobat (indeed one might need to wipe the cache, but i can > increment the version number) Weird. Here’s a PDF of the code I posted compiled with version “2013.08.01 01:31” and how pdftotext renders it: https://phi-gamma.net/pdf/copypasta.pdf https://phi-gamma.net/files/copypasta.txt I definitely get   from this one. The characters are mapped from the private use area: <...> 30 beginbfchar <0409> <0416> <0418> <0423> <042A> <0435> <...> Can someone reproduce it at all? Philipp