From mboxrd@z Thu Jan 1 00:00:00 1970 Date: Tue, 16 Mar 2010 00:02:51 +0300 From: anonymous To: 9fans@9fans.net Message-ID: <20100315210251.GA26934@machine> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Subject: [9fans] hard-coded UTF-8 in wc.c Topicbox-Message-UUID: e9652a9e-ead5-11e9-9d60-3106f5b1d025 Just looked at source of wc (http://plan9.bell-labs.com/sources/plan9/sys/src/cmd/wc.c). UTF-8 is hard-coded here. What is the reason? Nobody wants to rewrite it, it is optimization or it is impossible to rewrite it using runes for some reason? http://plan9.bell-labs.com/sys/doc/utf.html says all you need to do to change encoding is: 1. Rewrite UTF encoding/decoding code. 2. Convert all text files. 3. Recompile all software. Looks like it is impossible with current code. It is not fixed just because there is more important work or there is some serious problem in design?