From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/72990 Path: news.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: Error while compiling : ! String contains an invalid utf-8 sequence. Date: Wed, 07 Dec 2011 18:45:52 +0100 Message-ID: <4EDFA650.5040402@wxs.nl> References: <015801ccaba4$df06e8e0$9d14baa0$@gmx.de> <4ECFF60D.1090804@wxs.nl> <4EDF23B8.5000304@uni-bonn.de> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit X-Trace: dough.gmane.org 1323279979 22231 80.91.229.12 (7 Dec 2011 17:46:19 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 7 Dec 2011 17:46:19 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Wed Dec 07 18:46:15 2011 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1RYLZO-0001Aj-F1 for gctc-ntg-context-518@m.gmane.org; Wed, 07 Dec 2011 18:46:14 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 5A982CB144; Wed, 7 Dec 2011 18:46:13 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 60BZy5nuqYij; Wed, 7 Dec 2011 18:46:07 +0100 (CET) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id DCB34CB13B; Wed, 7 Dec 2011 18:46:06 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 60EF5CB13B for ; Wed, 7 Dec 2011 18:46:05 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id iSdRnp72cHNe for ; Wed, 7 Dec 2011 18:45:59 +0100 (CET) Original-Received: from filter4-nij.mf.surf.net (filter4-nij.mf.surf.net [195.169.124.155]) by balder.ntg.nl (Postfix) with ESMTP id 6C0E7CB139 for ; Wed, 7 Dec 2011 18:45:59 +0100 (CET) Original-Received: from smtp.ziggozakelijk.nl (D57D1DA2.static.ziggozakelijk.nl [213.125.29.162]) by filter4-nij.mf.surf.net (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id pB7HjwSf017697 for ; Wed, 7 Dec 2011 18:45:58 +0100 X-Default-Received-SPF: pass (skip=loggedin (res=PASS)) x-ip-name=10.100.1.100; Original-Received: from [10.100.1.100] (unverified [10.100.1.100]) by pragma-net.nl (SurgeMail 5.3h2) with ESMTP (TLS) id 1973-1713362 for multiple; Wed, 07 Dec 2011 18:45:55 +0100 User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:8.0) Gecko/20111105 Thunderbird/8.0 In-Reply-To: X-Authenticated-User: hagen@controller-9 X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=213.125.29.162; country=NL; region=11; city=Den Haag; latitude=52.0833; longitude=4.3000; http://maps.google.com/maps?q=52.0833,4.3000&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 04G6hJWCp - b7525b81bac0 - 20111207 X-Scanned-By: CanIt (www . roaringpenguin . com) on 195.169.124.155 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:72990 Archived-At: On 7-12-2011 17:37, Alexandre Krispin wrote: > Thank you for your answer ! > > Finally I am back to Latex until this problem is fixed. Though I have no > knowledge about it, it seemed to me that the problem I got was more related > to hanzi script than to bibtex, since the final problem of misplaced dots I > had appeared only with hanzi script. > > Hans Hagen did not replied to it. That's why I suppose that one day if this > problem related to hanzi script becomes a problem of general interest, > maybe it will be fixed... As mentioned on this list using a unicode (utf) engine with bibtex is somewhat tricky. Interesting is that a similar problem occurred with url's: bibtex has hard coded line lengths and if I'm right there has been a fix that made lines wrap on spaces. However that does not help with scripts that have no spaces. Somewhat strange is that bibtex was not made compliant to other tex related programs in the sense that one can configure the line length (which we would then set to 1000 or so). In spite of repeated announcements of bibtex 1.0 (which would probably not fix this issue anyway) it looks like the program is deep frozen. Unfortunately it also has frozen some conventions that make it hard to replace it. Fixing it is probably something that an able programmer can do while watching a movie at the same time were it not that touching bibtex seems to be some kind of sacrilege. I can imagine that someone makes a copy of bibtex, say ctxbibtex, patches it to accept long lines and that we ship that one with context. Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com | www.pragma-pod.nl ----------------------------------------------------------------- ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________