From mboxrd@z Thu Jan 1 00:00:00 1970 From: rsc@plan9.bell-labs.com Message-Id: <200102020505.AAA22677@smtp4.fas.harvard.edu> To: 9fans@cse.psu.edu Subject: Re: [9fans] plan 9 wiki experiment MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit Date: Fri, 2 Feb 2001 00:05:00 -0500 Topicbox-Message-UUID: 56961f7c-eac9-11e9-9e20-41e7f4b1d025 0x20 is not part of a valid UTF-8 sequence except when it represents a space character. "[E6 97 20]" is not a valid UTF-8 sequence. Right. He was (intentionally or not) quoting the munged UTF that was coming back from wikifs rather than the original. It was actually that UTF sequences like 日 [E6 97 A5] were turning into bogus sequences like [E6 97 20] in the wikifs parse routine that condenses runs of whitespace into single spaces. Russ