From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/24246 Path: news.gmane.org!not-for-mail From: Adam Lindsay Newsgroups: gmane.comp.tex.context Subject: Re: Chinese Date: Fri, 09 Dec 2005 14:43:07 +0000 Message-ID: <439997FB.3000600@comp.lancs.ac.uk> References: <4393EE02.405@gmail.com> <4393EF88.40709@gmail.com> <43940348.9050208@wxs.nl> <43943696.2000002@gmail.com> <43944FAA.4040302@wxs.nl> <4394F1AB.5000506@gmail.com> <4394F78B.1040801@gmail.com> <439852AD.3030708@gmx.de> <439896E1.6080403@gmx.de> <4398E99B.9060600@gmail.com> <4399621C.5020702@gmx.de> <43998AAD.50806@gmail.com> <43998EB1.8080609@net-b.de> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1134139583 25509 80.91.229.2 (9 Dec 2005 14:46:23 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 9 Dec 2005 14:46:23 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Fri Dec 09 15:46:18 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1EkjSA-0002Us-9B for gctc-ntg-context-518@m.gmane.org; Fri, 09 Dec 2005 15:42:30 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id DB07C12806; Fri, 9 Dec 2005 15:42:29 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 05234-01-5; Fri, 9 Dec 2005 15:42:26 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 9B53C12815; Fri, 9 Dec 2005 15:42:26 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id D56B712815 for ; Fri, 9 Dec 2005 15:42:24 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 05234-01-4 for ; Fri, 9 Dec 2005 15:42:23 +0100 (CET) Original-Received: from mail.comp.lancs.ac.uk (mail.comp.lancs.ac.uk [148.88.3.45]) by ronja.ntg.nl (Postfix) with ESMTP id C3DF512806 for ; Fri, 9 Dec 2005 15:42:23 +0100 (CET) Original-Received: from [194.80.37.193] (localhost [127.0.0.1]) by mail.comp.lancs.ac.uk (8.12.10/8.12.10) with ESMTP id jB9EgNPY021930 for ; Fri, 9 Dec 2005 14:42:23 GMT User-Agent: Mozilla Thunderbird 1.0.7 (Macintosh/20050923) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: <43998EB1.8080609@net-b.de> X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:24246 Archived-At: Tobias Burnus wrote: > Hi, > > Xiao Jianfeng wrote: > >>> What would be needed to get UTF-8 input running with Chinese? >> >> If you use vim to edit your tex file, maybe you can try "set >> encoding=utf8", then save and compile. >> As far as I know, GBK is compatible with unicode. > > No, that does not work - that is the reason I started this mail thread. > You get the wrong characters and you may get some TeX errors. > (And that is the reason Lutz wrote a UTF-8 to GBK converted.) Hmm, I suspect that some remix between my old (deprecated) Libertine in ConTeXt recipe and the ttf2tfm automatic unicode splitting would have some positive effects. (I would discourage using that recipe for alphabetic (incl Roman) Unicode fonts because it blows away any kerning that would happen between unicode blocks. Is there less kerning among CJK fonts? I would expect so.) Thinking aloud, you'd probably want to include some language-switching commands, to mediate between the calling of unicode fonts for un-named CJK glyphs (just raw conversion from Unicode to font switch + glyph number) to named roman (and other alphabetic) glyphs (conversion from UTF-8 to named glyphs to font+glyph, which retains kerning where it can). I know it's sketchy and vague, but have a look inside font-uni. It's not the most complicated file in the distro. adam -- =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Adam T. Lindsay, Computing Dept. atl@comp.lancs.ac.uk Lancaster University, InfoLab21 +44(0)1524/510.514 Lancaster, LA1 4WA, UK Fax:+44(0)1524/510.492 -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-