From mboxrd@z Thu Jan 1 00:00:00 1970 Date: Fri, 3 May 2013 18:33:42 +0200 From: tlaronde@polynum.com To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net> Message-ID: <20130503163342.GB557@polynum.com> References: <20130503111644.GA509@polynum.com> <257b60.0d6a2e59.xtua.mx@tumtum.plumbweb.net> Mime-Version: 1.0 In-Reply-To: <257b60.0d6a2e59.xtua.mx@tumtum.plumbweb.net> User-Agent: Mutt/1.4.2.3i Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Subject: Re: [9fans] Octets regexp Topicbox-Message-UUID: 544ecafc-ead8-11e9-9d60-3106f5b1d025 On Fri, May 03, 2013 at 09:15:27AM -0400, Tristan wrote: > > tcs -f 8859-1 > > will take your _binary_ files, and replace the bytes 0x80-0xff with the > unicode points U0080-U00ff, so you can use the standard regexps and tools > on them. and just convert back afterwards. > OK, mea culpa... since I'm french, I focused on the latin1 thinking this has something to do with my language and the custom to deal with latin1 on other systems. I guess I could create a keyboard that produces not UTF-8 but bytes so to have a mean to input bytes (without resorting to printf or whatever). Remains the problem of the rendering (or create a special font that displays octal, hexadecimal or whatever playing with the index of the glyphes; but this will work for octets, and will be more difficult if one wants to deal with wydes; impossible with tetras and octas). -- Thierry Laronde http://www.kergis.com/ Key fingerprint = 0FF7 E906 FBAF FE95 FD89 250D 52B1 AE95 6006 F40C