From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/31460 Path: news.gmane.org!not-for-mail From: "Dohyun Kim" Newsgroups: gmane.comp.tex.context Subject: bugs found while trying to support utf-8 Korean Hangul under ConTeXt Date: Sat, 21 Oct 2006 13:19:42 +0900 Message-ID: Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1161413650 19624 80.91.229.2 (21 Oct 2006 06:54:10 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 21 Oct 2006 06:54:10 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Sat Oct 21 08:54:08 2006 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1GbAk9-0008Kw-BC for gctc-ntg-context-518@m.gmane.org; Sat, 21 Oct 2006 08:54:05 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 9D66A1FE65; Sat, 21 Oct 2006 08:54:03 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 24579-05; Sat, 21 Oct 2006 08:53:54 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 6C4361FDEF; Sat, 21 Oct 2006 08:53:54 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 06B231FF31 for ; Sat, 21 Oct 2006 06:19:44 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23349-05 for ; Sat, 21 Oct 2006 06:19:43 +0200 (CEST) Original-Received: from nf-out-0910.google.com (nf-out-0910.google.com [64.233.182.185]) by ronja.ntg.nl (Postfix) with SMTP id F39E11FF23 for ; Sat, 21 Oct 2006 06:19:42 +0200 (CEST) Original-Received: by nf-out-0910.google.com with SMTP id h2so1694884nfe for ; Fri, 20 Oct 2006 21:19:42 -0700 (PDT) Original-Received: by 10.82.152.16 with SMTP id z16mr836114bud; Fri, 20 Oct 2006 21:19:42 -0700 (PDT) Original-Received: by 10.82.173.6 with HTTP; Fri, 20 Oct 2006 21:19:42 -0700 (PDT) Original-To: ntg-context@ntg.nl Content-Disposition: inline X-Virus-Scanned: amavisd-new at ntg.nl X-Mailman-Approved-At: Sat, 21 Oct 2006 08:53:53 +0200 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.7 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:31460 Archived-At: Hi, http://210.94.201.157/~nomos/KTUG/context-korean-test.tar.gz (3.4Mb) Recently I have tried to support typesetting UTF-8 Korean document under ConTeXt. In the course of that trying, I found some possible bugs of recent ConTeXt beta. You can see most of the bugs (or missing features) I found in the file named `testme.tex', which is contained in the tarball linked above. For those who do not want to download that large tarball, I here replicate those bugs or missing features: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % should be added to unic-cjk.tex % Hangul Syllables \dostepwiserecurse{172}{215}{1}{\expanded{\defineunicodecommand {\recurselevel}} {\lookaheaduchar}} % CJK Compatibility Ideographs \dostepwiserecurse{249}{250}{1}{\expanded{\defineunicodecommand {\recurselevel}} {\lookaheaduchar}} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % modify \enableunicodefont in font-uni.tex \def\enableunicodefont#1% {\definefontsynonym[\s!Unicode][\getvalue{\??uc#1\c!file}]% \def\unicodescale {\getvalue{\??uc#1\c!scale}}% \def\unicodeheight {\getvalue{\??uc#1\c!height}}% \def\unicodedepth {\getvalue{\??uc#1\c!depth}}% \def\unicodedigits {\getvalue{\??uc#1\c!conversion}}% \def\handleunicodeglyph {\getvalue{\??uc#1\c!command}}% \doifnot\currentregime{utf}{\enableregime[unicode]}% % the following \relax's are realy needed \doifvalue{\??uc#1\c!interlinespace}\v!yes \setupinterlinespace\relax \doifvalue{\??uc#1\c!strut}\v!yes {\setunicodestrut\unicodeheight\unicodedepth}% <- `%' inserted by nomos {\resetunicodestrut}% \getvalue{\??uc#1\c!commands}\relax} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % modify \sanitizePDFuniencoding in spec-tst.tex \long\def\sanitizePDFuniencoding#1\to#2% {\enablePDFunicrlf \simplifycommands % <- this line inserted by nomos \let\unicodechar\relax % prevent further expansion \retainlccodes\lccode32=255 % slooow \lowercasestring\PDFunicodetrigger#1\to#2% \edef#2{\expandafter\doPDFuni#2\empty\empty}} % slooow %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % modify mps file insertion code in spec-dpx.tex \definefileinsertion{dpx}{mps} % tpd -> dpx modified by nomos {\hbox {\convertMPtoPDF\@@DriverImageFile{1}{1}% \global\let\PDFimagereference\empty}} Certainly, the last is not problem confined to Hangul typesetting. As you know, however, CJK peoples do use heavily dvipdfmx to obtain PDF. Anyway, I hope these four files be modified in the near future. In addition, when I compiled LaTeX2ConTeXt.tex by Berend de Boer ( http://www.berenddeboer.net/tex/LaTeX2ConTeXt.tex ), I found there being inserted unintended large vertical whitespace in page 27 of resulting PDF. Though it seems to be a bug related to \setupbackground, I could not find the cause of problem as the code related to that command is too complicated for me. -- Dohyun Kim http://people.ktug.or.kr/~nomos/