From mboxrd@z Thu Jan 1 00:00:00 1970 To: 9fans@cse.psu.edu From: "Douglas A. Gwyn" Message-ID: Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit References: , <00ec01c37784$cbe421e0$c901a8c0@cc77109e> Subject: Re: [9fans] ... in the Kingdom of Sources Date: Thu, 11 Sep 2003 09:07:30 +0000 Topicbox-Message-UUID: 30c64846-eacc-11e9-9e20-41e7f4b1d025 Bruce Ellis wrote: > I guess I somehow missed what is hard about UTF-8 synchronization. We weren't talking about a supposed problem with UTF-8, but about a supposed problem with Standard C wide-character facilities. The Standard C facility has to deal with a much wider variety of multibyte encodings. In any event, flagging a conversion error only by a specific value embedded in the converted data means that the error will be missed unless the application scans the data looking for it. Seems to me that encourages use of erroneous data.