From mboxrd@z Thu Jan 1 00:00:00 1970 From: erik quanstrom Date: Mon, 19 Oct 2009 09:55:48 -0400 To: 9fans@9fans.net Message-ID: In-Reply-To: <> References: <> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Subject: Re: [9fans] utf-8 text files from httpd Topicbox-Message-UUID: 8a909170-ead5-11e9-9d60-3106f5b1d025 On Mon Oct 19 09:51:33 EDT 2009, rogpeppe@gmail.com wrote: > there's another problem with file -m that > i've been bitten by before: it ignores any > stuff after the first 6000 bytes. > > so if you've got a mostly-ascii file with some > utf-8 characters 8K in, then it won't be picked up. > > i think file -m should read the whole file, but that's just IMHO. a relic trying to avoid ken's read ahead and firing up the worm drives. why try that hard? just call it utf-8. i can't think of any browsers that would have a problem with that today. - erik