From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/38811 Path: news.gmane.org!not-for-mail From: "Yue Wang" Newsgroups: gmane.comp.tex.context Subject: MKIV Chinese typesetting Date: Sun, 27 Jan 2008 17:51:21 +0800 Message-ID: <68bfdc900801270151u7e763fa6y7d8935fc35637b3f@mail.gmail.com> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="iso-2022-jp" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1201427625 1367 80.91.229.12 (27 Jan 2008 09:53:45 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 27 Jan 2008 09:53:45 +0000 (UTC) To: ntg-context@ntg.nl Original-X-From: ntg-context-bounces@ntg.nl Sun Jan 27 10:54:03 2008 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by lo.gmane.org with esmtp (Exim 4.50) id 1JJ4DB-0001uC-Vd for gctc-ntg-context-518@m.gmane.org; Sun, 27 Jan 2008 10:54:02 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 3D3971FBB8; Sun, 27 Jan 2008 10:53:31 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 14831-01-5; Sun, 27 Jan 2008 10:52:45 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 317531FB4E; Sun, 27 Jan 2008 10:52:42 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 8F0561FB4E for ; Sun, 27 Jan 2008 10:52:17 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 17694-01 for ; Sun, 27 Jan 2008 10:51:34 +0100 (CET) Original-Received: from fg-out-1718.google.com (fg-out-1718.google.com [72.14.220.159]) by ronja.ntg.nl (Postfix) with ESMTP id A4A011FB33 for ; Sun, 27 Jan 2008 10:51:21 +0100 (CET) Original-Received: by fg-out-1718.google.com with SMTP id e12so1311855fga.8 for ; Sun, 27 Jan 2008 01:51:21 -0800 (PST) Original-Received: by 10.86.28.5 with SMTP id b5mr3922052fgb.47.1201427481209; Sun, 27 Jan 2008 01:51:21 -0800 (PST) Original-Received: by 10.86.26.1 with HTTP; Sun, 27 Jan 2008 01:51:21 -0800 (PST) Content-Disposition: inline X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.9 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:38811 Archived-At: Dear Hans and other friends: I play a little bit on MKIV's Chinese support and find it is incomplete. It has some serious problems, and lack important features as well. Well, I know it from mk.pdf that Chinese support is under construction, so in this email I just want to give some Chinese horizontal typesetting suggestions for development (vertical typesetting are more frequently used in Hongkong, Taiwan and Japan, so maybe people from there are eager to help): first, a features many Chinese people like: a lot of Chinese users write essays using full-width punct (that is, a punct mark is as the same size as the Chinese characters) and I found that the puncts are compressed well in context with good penalty settings(although there are minor problems in the pdf). however, sometimes it is up to the publisher which kind of punct marks are to be used, and usually many scientific books and papers use half-width puncts. so I think a new feature should be added to map all the Chinese puncts into english while at the same time, a space should be added after the English punct marks. that is, changing 中文,中文 into 中文, 中文 notice the space in the second example there is an opentype feature called hwid, but it should not be used because: - it changes the English alphabets as well - most of the time the function in most Chinese fonts is missing or buggy - it does not automatically add the space These are the minor problems in the mk.pdf: - pp118, penultimate example, box 2, line1, the ' punct mark should not appear at the end of the line - pp118, ultimate example, box 2, line2, in fact, if you want do perfect Chinese typesetting, all the puncts which begin a line or end a line should be closed to the margin line which makes the type block look good (especially when watch the right or left edge of the type block closely) while this ' has many space before it. In English, it is quite easy and the good old tex does a good job because all the letters and puncts are designed carefully, but in Chinese things are much difficult because all the glyphs including the puncts in all the Chinese fonts have the same width, and which makes the problem worse is that different Chinese fonts have different positions for these puncts. So a dynamic way which calculates the most left edge and most right edge of a punct glyph should be used, and it is not available in luatex:( will it be added later? second, and the most important, on the bilingual typesetting: nowadays English words are frequently used in Chinese, so great care must be taken to handle it. many Chinese users write their manuscript in one of the two ways: 中文 English 中文 or 中文English中文 The first way looks good when using a editor with mono space font and the second way is convenient for one to type. So both users will happy if both ways get the same result in context. in MKII, users are forced to use such kind of expression which drives them crazy and should be avoided: 中文English\ 中文 A small skip should be left between Chinese and English which makes the result much better. usually the space is a quarter of a chinese character width. A TeX expression should like: \hspace{0.25em plus 0.125em minus 0.08em} but I think it is better to introduce a \setupxxxwidth for the user. The last important thing for English and Chinese bi-lingual typesetting is that: do not use English glyphs in Chinese fonts because: - recently English glyphs in most of the Chinese fonts are ugly - users have no right to modify the Chinese fonts - many other reasons so when set up a Chinese typeface, it is better to leave a feature for the user to choose the accompany english typeface. while typesetting, Chinese typography rules do not interfere with the english one, that is, chinese part and english part use their own line break model. here are two bugs in recent context: - 中文 English English English 中文will produce 中文EnglishEnglishEnglish中文, the spaces between english words are all missing. - the following script produce an error: Invalid field id penalty for node type glyph (1). Here is a sample: \definefontfeature[chinese][mode=node,script=hang,lang=zht,script=hani,lang=dlft] \definefontsynonym[songti][name:AdobeSongStd-Light][features=Chinese] \definefontsynonym[Serif][songti] definetypeface[song][rm][serif][songti] \setupbodyfont[song,12pt,rm] \starttext 中文 English: 中文 \stoptext third, indenting: Chinese paragraphs are indented with two Chinese characters, and we also indent the first paragraph. context is so powerful that it is easy to do these kind of things, thank you:) and i think it is better to write it into the Chinese typesetting module? Urgency: bi-lingual typesetting > fix the wrong panelty > map the english punct maps as an alternative > indenting > puncts should be close to margin lines. I am eager to see full support for chinese and other Asian languages (and I am eager to help too), and i hope my suggestions here are helpful to the developers. I am grateful to Hans, Taco and and many other guys (eg. Zhichu Chen) that did a very good job in recent MKIV Chinese support. Thank you for your time and effort for continuous contributions! Yue Wang ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________