From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tristan <9p-st@imu.li> To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net> In-Reply-To: <20130503111644.GA509@polynum.com> References: <20130502193952.GA662@polynum.com><257b5f.5863bd23.PHUY.mx@tumtum.plumbweb.net> <20130503111644.GA509@polynum.com> Message-Id: <257b60.0d6a2e59.xtua.mx@tumtum.plumbweb.net> Date: Fri, 3 May 2013 09:15:27 -0400 User-Agent: mx Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Subject: Re: [9fans] Octets regexp Topicbox-Message-UUID: 52e2cd62-ead8-11e9-9d60-3106f5b1d025 > > if we're talking about xd, i'll suggest 'tcs -f 8859-1' again in whic= h case: > My question was _not_ related to text, and _not_ related to "french" i.= e. > 8859-1. I know how to deal with this. tcs -f 8859-1 will take your _binary_ files, and replace the bytes 0x80-0xff with the unicode points U0080-U00ff, so you can use the standard regexps and tools on them. and just convert back afterwards. maybe it's not meant to be used that way, but it _works_. try it. have fun! tristan --=20 All original matter is hereby placed immediately under the public domain.