From mboxrd@z Thu Jan  1 00:00:00 1970
MIME-Version: 1.0
In-Reply-To: <82380754-4ab1-4c31-b696-0b9c604ec2c9@googlegroups.com>
References: <CAOCRf5U1-fgQ5aRqW1dj0fPE_a0yfpWNnORu4M5j8H6rT4MzNQ@mail.gmail.com>
	<82380754-4ab1-4c31-b696-0b9c604ec2c9@googlegroups.com>
Date: Thu,  4 Apr 2013 14:16:10 +0200
Message-ID: <CAOCRf5UT7ey57neHbYCt4aswRm9WO3CqwBAsEEnSJgHjYn5Ypw@mail.gmail.com>
From: =?UTF-8?B?QmVuY2UgRsOhYmnDoW4=?= <begnoc@gmail.com>
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
Content-Type: multipart/alternative; boundary=14dae9340ee796118b04d987ef87
Subject: Re: [9fans] Acme Edit scriptlets
Topicbox-Message-UUID: 3d6d237e-ead8-11e9-9d60-3106f5b1d025

--14dae9340ee796118b04d987ef87
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Cool.


Here's a script i use to generate case
insensitive regexes. It turns

FooBar

into

[Ff][Oo][Oo][Bb][Aa][Rr]

term% cat /bin/uncase
#!/bin/rc

exec awk '{
lower =3D tolower($0)
upper =3D toupper($0)
len =3D length($0)

for( i =3D 1 ; i <=3D len ; i++ )
printf "[" substr(upper, i, 1) substr(lower, i, 1) "]"
printf "\n"
}'




2013/4/4 Mark van Atten <vanattenmark@gmail.com>

> On Friday, 29 March 2013 01:38:06 UTC+1, Bence F=C3=A1bi=C3=A1n  wrote:
>
> > I did a quick writeup on little Edit scripts
>
> Many thanks, this thread is very useful.
>
> There is also Jason Catena's list of Edit idioms at
> https://raw.github.com/catenate/acme-fonts/master/test/1/acme/Edit/sam
>
> When editing and re-editing latex, I regularly pipe selections
> through a simple-minded script called `chunk' which does most of
> the work for obtaining semantic linebreaks. That goes back to a
> recommendation by Kernighan in his paper `Unix for beginners' of
> 1974; see the quotation, comments and link at [1].
>
>
>
> #!/usr/local/plan9/bin/rc
> # chunk up (to prepare) for semantic linebreaks
>
> # do  not break within \cite
> # do not break within $$ math
> # break after closing parentheses ),]
> # break before an opening parentheses (,[
>
> ssam -e 'x/(^[^%].+\n)+/  y/\\cite[^{]*{(\n|.)*}/ y/\$.*\$/
> x/(([^A-Z]\.)|[,;:!?]|\)|\]) | (\(|\[)/ s/ /\n/' \ | 9 fmt -w 60
> -j
>
>
> For batch processing probably something more sophisticated would
> be needed to leave various environments unchunked. But I don't use
> it that way, and just apply it to selections where I know its use
> makes sense. Usually these are areas where I have just been doing
> a lot of rewriting.
>
> There's no point in chunking up commented material, and sometimes
> it is actually convenient to have a place where I can keep things
> unchunked for reference.
>
> The original chunk command in Writer's Workbench [2], for troff not
> latex, was  based on a parser for English, I think. I find I don't
> want that (because I write in other languages as well), and that
> even in English I don't need it (because the chunking based on
> interpunction is always fine with me, and where I care about the
> remaining cases, I prefer to do it myself; but see [3]).
>
> Mark.
>
>
> [1] http://rhodesmill.org/brandon/2012/one-sentence-per-line/
>
> [2] http://man.cat-v.org/unix_WWB/1/chunk
>
> [3] https://github.com/waldir/semantic-linebreaker
>
>

--14dae9340ee796118b04d987ef87
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Cool.<div><br></div><div><br></div><div style>Here&#39;s a=
 script i use to generate case</div><div style>insensitive regexes. It turn=
s</div><div style><br></div><div style>FooBar</div><div style><br></div><di=
v style>
into</div><div style><br></div><div>[Ff][Oo][Oo][Bb][Aa][Rr]<br></div><div>=
<br></div><div style>term% cat /bin/uncase</div><div><div><div>#!/bin/rc</d=
iv><div><br></div><div>exec awk &#39;{</div><div><span class=3D"" style=3D"=
white-space:pre">	</span>lower =3D tolower($0)</div>
<div><span class=3D"" style=3D"white-space:pre">	</span>upper =3D toupper($=
0)</div><div><span class=3D"" style=3D"white-space:pre">	</span>len =3D len=
gth($0)</div><div><br></div><div><span class=3D"" style=3D"white-space:pre"=
>	</span>for( i =3D 1 ; i &lt;=3D len ; i++ )</div>
<div><span class=3D"" style=3D"white-space:pre">		</span>printf &quot;[&quo=
t; substr(upper, i, 1) substr(lower, i, 1) &quot;]&quot;</div><div><span cl=
ass=3D"" style=3D"white-space:pre">	</span>printf &quot;\n&quot;</div><div>=
}&#39;</div>
</div><div><br></div><div><br></div></div><div class=3D"gmail_extra"><br><b=
r><div class=3D"gmail_quote">2013/4/4 Mark van Atten <span dir=3D"ltr">&lt;=
<a href=3D"mailto:vanattenmark@gmail.com" target=3D"_blank">vanattenmark@gm=
ail.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div class=3D"im">On Friday, 29 March 2013 01:38:06 UTC+1,=
 Bence F=C3=A1bi=C3=A1n =C2=A0wrote:<br>

<br>
&gt; I did a quick writeup on little Edit scripts<br>
<br>
</div>Many thanks, this thread is very useful.<br>
<br>
There is also Jason Catena&#39;s list of Edit idioms at<br>
<a href=3D"https://raw.github.com/catenate/acme-fonts/master/test/1/acme/Ed=
it/sam" target=3D"_blank">https://raw.github.com/catenate/acme-fonts/master=
/test/1/acme/Edit/sam</a><br>
<br>
When editing and re-editing latex, I regularly pipe selections<br>
through a simple-minded script called `chunk&#39; which does most of<br>
the work for obtaining semantic linebreaks. That goes back to a<br>
recommendation by Kernighan in his paper `Unix for beginners&#39; of<br>
1974; see the quotation, comments and link at [1].<br>
<br>
<br>
<br>
#!/usr/local/plan9/bin/rc<br>
# chunk up (to prepare) for semantic linebreaks<br>
<br>
# do =C2=A0not break within \cite<br>
# do not break within $$ math<br>
# break after closing parentheses ),]<br>
# break before an opening parentheses (,[<br>
<br>
ssam -e &#39;x/(^[^%].+\n)+/ =C2=A0y/\\cite[^{]*{(\n|.)*}/ y/\$.*\$/<br>
x/(([^A-Z]\.)|[,;:!?]|\)|\]) | (\(|\[)/ s/ /\n/&#39; \ | 9 fmt -w 60<br>
-j<br>
<br>
<br>
For batch processing probably something more sophisticated would<br>
be needed to leave various environments unchunked. But I don&#39;t use<br>
it that way, and just apply it to selections where I know its use<br>
makes sense. Usually these are areas where I have just been doing<br>
a lot of rewriting.<br>
<br>
There&#39;s no point in chunking up commented material, and sometimes<br>
it is actually convenient to have a place where I can keep things<br>
unchunked for reference.<br>
<br>
The original chunk command in Writer&#39;s Workbench [2], for troff not<br>
latex, was =C2=A0based on a parser for English, I think. I find I don&#39;t=
<br>
want that (because I write in other languages as well), and that<br>
even in English I don&#39;t need it (because the chunking based on<br>
interpunction is always fine with me, and where I care about the<br>
remaining cases, I prefer to do it myself; but see [3]).<br>
<br>
Mark.<br>
<br>
<br>
[1] <a href=3D"http://rhodesmill.org/brandon/2012/one-sentence-per-line/" t=
arget=3D"_blank">http://rhodesmill.org/brandon/2012/one-sentence-per-line/<=
/a><br>
<br>
[2] <a href=3D"http://man.cat-v.org/unix_WWB/1/chunk" target=3D"_blank">htt=
p://man.cat-v.org/unix_WWB/1/chunk</a><br>
<br>
[3] <a href=3D"https://github.com/waldir/semantic-linebreaker" target=3D"_b=
lank">https://github.com/waldir/semantic-linebreaker</a><br>
<br>
</blockquote></div><br></div></div>

--14dae9340ee796118b04d987ef87--