From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/41968 Path: news.gmane.org!not-for-mail From: Oleg Kolosov Newsgroups: gmane.comp.tex.context Subject: Re: searchable cyrillic in PDF files Date: Sun, 29 Jun 2008 01:59:46 +0400 Message-ID: <4866B452.7010906@mail.ru> References: <486578F2.2050809@mail.ru> <48662A27.8040809@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1809524486==" X-Trace: ger.gmane.org 1214690539 19688 80.91.229.12 (28 Jun 2008 22:02:19 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 28 Jun 2008 22:02:19 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Sun Jun 29 00:02:53 2008 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by lo.gmane.org with esmtp (Exim 4.50) id 1KCiVO-0002CL-Ux for gctc-ntg-context-518@m.gmane.org; Sun, 29 Jun 2008 00:02:51 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id DD3901FBCB; Sun, 29 Jun 2008 00:01:58 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 28482-05-4; Sun, 29 Jun 2008 00:01:18 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id B6F351FC43; Sun, 29 Jun 2008 00:01:18 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 5EA021FBC9 for ; Sun, 29 Jun 2008 00:01:17 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 28481-10-2 for ; Sun, 29 Jun 2008 00:00:41 +0200 (CEST) Original-Received: from mx5.mail.ru (mx5.mail.ru [194.67.23.25]) by ronja.ntg.nl (Postfix) with ESMTP id 8621B1FC43 for ; Sun, 29 Jun 2008 00:00:41 +0200 (CEST) Original-Received: from [87.117.41.245] (port=4068 helo=[172.16.6.197]) by mx5.mail.ru with asmtp id 1KCiTD-000IhQ-00 for ntg-context@ntg.nl; Sun, 29 Jun 2008 02:00:38 +0400 User-Agent: Thunderbird 2.0.0.14 (X11/20080621) In-Reply-To: <48662A27.8040809@wxs.nl> X-Spam: Not detected X-Mras: OK X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.9 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:41968 Archived-At: This is a multi-part message in MIME format. --===============1809524486== Content-Type: multipart/alternative; boundary="------------030509040706080604040307" This is a multi-part message in MIME format. --------------030509040706080604040307 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Hans Hagen wrote: > Oleg Kolosov wrote: > > >> Codes seem to be in place but doesn't match actual font in T2A encoding. >> For ex. I have cyrillic capital a in font on 00C1 where in pdfr-def this >> is Aacute. According to enco-utf my 00C1 in font should map to 0410 >> position (I hope this is understandable description). Maybe there is >> some switch to enable such mapping? I don't understand these >> encoding/mapping issues enough to create necessary table myself. Maybe >> you can provide some example, so I will be able to help? >> > > it si unrelated to the font encoding. so, in your font 00C1 has name > (say CyrillicWhatever) and pdftex needs to associate the name with the > unicode number > > it looks like in pdfr-def cyrillic is just missing > > i uploaded a beta with more mappings > > Hans > I've tested with the new beta as well as the latest stable version (2008.05.21) and found an issue with font map file not loaded properly, so this was error on my side. Sorry. Now copy/paste works in both versions. -- Best Regards, Oleg Kolosov --------------030509040706080604040307 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hans Hagen wrote:
Oleg Kolosov wrote:

  
Codes seem to be in place but doesn't match actual font in T2A encoding. 
For ex. I have cyrillic capital a in font on 00C1 where in pdfr-def this 
is Aacute. According to enco-utf my 00C1 in font should map to 0410 
position (I hope this is understandable description). Maybe there is 
some switch to enable such mapping? I don't understand these 
encoding/mapping issues enough to create necessary table myself. Maybe 
you can provide some example, so I will be able to help?
    

it si unrelated to the font encoding. so, in your font 00C1 has name 
(say CyrillicWhatever) and pdftex needs to associate the name with the
unicode number

it looks like in pdfr-def cyrillic is just missing

i uploaded a beta with more mappings

Hans
  
I've tested with the new beta as well as the latest stable version (2008.05.21) and found an issue with font map file not loaded properly, so this was error on my side. Sorry. Now copy/paste works in both versions.
-- 
Best Regards,
Oleg Kolosov
--------------030509040706080604040307-- --===============1809524486== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________ --===============1809524486==--