From mboxrd@z Thu Jan 1 00:00:00 1970 Message-ID: <20031027160618.14972.qmail@g.bio.cse.psu.edu> To: 9fans@cse.psu.edu Subject: Re: [9fans] kfs un-removable file In-reply-to: <3ff7aff7e446ebd1fe0a788c93d5d844@vitanuova.com> References: <3ff7aff7e446ebd1fe0a788c93d5d844@vitanuova.com> From: Scott Schwartz Date: Mon, 27 Oct 2003 11:06:18 -0500 Topicbox-Message-UUID: 79fdae5a-eacc-11e9-9e20-41e7f4b1d025 | invalid utf sequences, aren't there several possible | utf sequences that can validly map to the same character? I think only the shortest such sequence is supposed to be allowed, so maybe calling it an error is better than canonicalizing. On the other hand, Tcl uses a multibyte encoding of \0 to handle embedded nuls. That seems like a useful hack.