From mboxrd@z Thu Jan  1 00:00:00 1970
Date: Fri,  3 May 2013 18:33:42 +0200
From: tlaronde@polynum.com
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
Message-ID: <20130503163342.GB557@polynum.com>
References: <20130503111644.GA509@polynum.com>
	<257b60.0d6a2e59.xtua.mx@tumtum.plumbweb.net>
Mime-Version: 1.0
In-Reply-To: <257b60.0d6a2e59.xtua.mx@tumtum.plumbweb.net>
User-Agent: Mutt/1.4.2.3i
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Subject: Re: [9fans] Octets regexp
Topicbox-Message-UUID: 544ecafc-ead8-11e9-9d60-3106f5b1d025

On Fri, May 03, 2013 at 09:15:27AM -0400, Tristan wrote:
>
> tcs -f 8859-1
>
> will take your _binary_ files, and replace the bytes 0x80-0xff with the
> unicode points U0080-U00ff, so you can use the standard regexps and tools
> on them. and just convert back afterwards.
>

OK, mea culpa... since I'm french, I focused on the latin1 thinking
this has something to do with my language and the custom to deal with
latin1 on other systems.

I guess I could create a keyboard that produces not UTF-8 but bytes
so to have a mean to input bytes (without resorting to printf or
whatever). Remains the problem of the rendering (or create a
special font that displays octal, hexadecimal or whatever playing
with the index of the glyphes; but this will work for octets, and will
be more difficult if one wants to deal with wydes; impossible with
tetras and octas).

--
	Thierry Laronde <tlaronde +AT+ polynum +dot+ com>
		      http://www.kergis.com/
Key fingerprint = 0FF7 E906 FBAF FE95 FD89  250D 52B1 AE95 6006
F40C