From mboxrd@z Thu Jan 1 00:00:00 1970 From: erik quanstrom Date: Mon, 30 Nov 2009 08:50:02 -0500 To: 9fans@9fans.net Message-ID: In-Reply-To: <> References: <> MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit Subject: Re: [9fans] =?utf-8?q?gr=C3=ABp_=28rhymes_with_creep=29_and_cptmp?= Topicbox-Message-UUID: a541c2d2-ead5-11e9-9d60-3106f5b1d025 On Mon Nov 30 02:54:45 EST 2009, jason.catena@gmail.com wrote: > Agreed. Part of grep's job is to be a regex engine, so I thought in > general it would be okay to push it here. > > > i played with this a little bit, but quickly ran into problems. > > > "reasonable" re size limits of say 300 characters > > just don't work if you're doing expansion.  expanding "cooperate" > > results in a 460-byte string! > > Where does this 300-character limit come from? If you code them by dict. i used unfold (/n/sources/contrib/quanstro/runetype/unfold.c. ; unfold cooperate | wc -rc 199 454 it turns out that doing regular expressions is difficult, since it's not clear to me what [a-z] should match when unfolded. on the other hand, a folding-based approach makes the meaning of [a-z] clear. it's a good argument for folding. echo 'rhymes with grëep' |../grep/8.out -I 'gr[a-z]ep' rhymes with grëep - erik