From mboxrd@z Thu Jan 1 00:00:00 1970 To: 9fans@cse.psu.edu Subject: Re: [9fans] plan 9 wiki experiment From: okamoto@granite.cias.osakafu-u.ac.jp MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="upas-hemcltmaaalzltkdvnewzayurw" Message-Id: <20010202081215.9008F199E4@mail.cse.psu.edu> Date: Fri, 2 Feb 2001 17:12:15 +0900 Topicbox-Message-UUID: 56b84476-eac9-11e9-9e20-41e7f4b1d025 This is a multi-part message in MIME format. --upas-hemcltmaaalzltkdvnewzayurw Content-Disposition: inline Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit You are quite right. It was my misunderstanding. The right byte sequence of もしもPlan 9で日本語を使いたいなら、私 are [E3 82 82][E3 81 97][E3 82 82][Plan 9][E3 81 A7][E6 97 A5] [E6 9C AC][E8 AA 9E][E3 82 92][E4 BD BF][E3 81 84][E3 81 9F] [E3 81 84][E3 81 AA][W3 82 89][E3 80 81][E7 A7 81]. Anyway, thank you very much Russ, now we can use UTF-8 on your Wiki server. I replaced the bogus one. Kenji --upas-hemcltmaaalzltkdvnewzayurw Content-Type: message/rfc822 Content-Disposition: inline Received: from granite.cias.osakafu-u.ac.jp ([192.168.1.3]) by granite.cias.osakafu-u.ac.jp; Fri Feb 2 13:41:01 JST 2001 Received: from elmo.cias.osakafu-u.ac.jp (elmo.cias.osakafu-u.ac.jp [157.16.103.2]) by granite.cias.osakafu-u.ac.jp (8.9.3/8.9.3) with ESMTP id OAA25244; Fri, 2 Feb 2001 14:05:48 +0900 Received: from mail.cse.psu.edu (postfix@psuvax1.cse.psu.edu [130.203.4.6]) by elmo.cias.osakafu-u.ac.jp (8.9.3/3.7W-01020211) with ESMTP id OAA26638; Fri, 2 Feb 2001 14:06:23 +0900 (JST) Received: from psuvax1.cse.psu.edu (psuvax1.cse.psu.edu [130.203.18.6]) by mail.cse.psu.edu (CSE Mail Server) with ESMTP id 4757B199EC; Fri, 2 Feb 2001 00:06:07 -0500 (EST) Received: from smtp4.fas.harvard.edu (smtp4.fas.harvard.edu [140.247.30.84]) by mail.cse.psu.edu (CSE Mail Server) with ESMTP id E0015199E7 for <9fans@cse.psu.edu>; Fri, 2 Feb 2001 00:05:02 -0500 (EST) Received: from plan9.cs.bell-labs.com (roam183-121.student.harvard.edu [140.247.183.121]) by smtp4.fas.harvard.edu with SMTP id AAA22677; Fri, 2 Feb 2001 00:05:02 -0500 (EST) From: rsc@plan9.bell-labs.com Message-Id: <200102020505.AAA22677@smtp4.fas.harvard.edu> To: 9fans@cse.psu.edu Subject: Re: [9fans] plan 9 wiki experiment MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Sender: 9fans-admin@cse.psu.edu Errors-To: 9fans-admin@cse.psu.edu X-BeenThere: 9fans@cse.psu.edu X-Mailman-Version: 2.0.1 Precedence: bulk Reply-To: 9fans@cse.psu.edu List-Id: Fans of the OS Plan 9 from Bell Labs <9fans.cse.psu.edu> List-Archive: Date: Fri, 2 Feb 2001 00:05:00 -0500 Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by granite.cias.osakafu-u.ac.jp id OAA25244 0x20 is not part of a valid UTF-8 sequence except when it represents a space character. "[E6 97 20]" is not a valid UTF-8 sequence. Right. He was (intentionally or not) quoting the munged UTF that was coming back from wikifs rather than the original. It was actually that UTF sequences like =E6=97=A5 [E6 97 A5] were turning into bogus sequences like [E6 97 20] in the wikifs parse routine that condenses runs of whitespace into single spaces. Russ --upas-hemcltmaaalzltkdvnewzayurw--