[9fans] sed question (OT)

9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed

* [9fans] sed question (OT)
@ 2009-10-29 15:41 Steve Simon
  2009-10-29 16:06 ` Lorenzo Bolla
                   ` (7 more replies)
  0 siblings, 8 replies; 18+ messages in thread
From: Steve Simon @ 2009-10-29 15:41 UTC (permalink / raw)
  To: 9fans

Sorry, not really the place for such questions but...

I always struggle with sed, awk is easy but sed makes my head hurt.

I am trying to capitalise the first tow words on each line (I could use awk
as well but I have to use sed so it seems churlish to start another process).

capitalising the first word on the line is easy enough:

			h
			s/^(.).*/\1/
			y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
			x
			s/^.(.*)/\1/
			x
			G
			s/\n//

Though there maye be a much easier/more elegant way to do this,
but for the 2nd word it gets much harder.

What I really want is sam's ability to select a letter and operate on it
rather than everything being line based as sed seems to be.

any neat solutions? (extra points awarded for use of the branch operator :-)

-Steve

^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
@ 2009-10-29 16:06 ` Lorenzo Bolla
  2009-10-29 16:33   ` Iruata Souza
  2009-10-29 16:09 ` W B Hacker
                   ` (6 subsequent siblings)
  7 siblings, 1 reply; 18+ messages in thread
From: Lorenzo Bolla @ 2009-10-29 16:06 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

[-- Attachment #1: Type: text/plain, Size: 1180 bytes --]

To capitalize the first letter of each line wouldn't this be enough?

s/^./\u&/

L.


On Thu, Oct 29, 2009 at 3:41 PM, Steve Simon <steve@quintile.net> wrote:

> Sorry, not really the place for such questions but...
>
> I always struggle with sed, awk is easy but sed makes my head hurt.
>
> I am trying to capitalise the first tow words on each line (I could use awk
> as well but I have to use sed so it seems churlish to start another
> process).
>
> capitalising the first word on the line is easy enough:
>
>                        h
>                        s/^(.).*/\1/
>
>  y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
>                        x
>                        s/^.(.*)/\1/
>                        x
>                        G
>                        s/\n//
>
> Though there maye be a much easier/more elegant way to do this,
> but for the 2nd word it gets much harder.
>
> What I really want is sam's ability to select a letter and operate on it
> rather than everything being line based as sed seems to be.
>
> any neat solutions? (extra points awarded for use of the branch operator
> :-)
>
> -Steve
>
>

[-- Attachment #2: Type: text/html, Size: 1589 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
  2009-10-29 16:06 ` Lorenzo Bolla
@ 2009-10-29 16:09 ` W B Hacker
  2009-10-29 18:52 ` Jason Catena
                   ` (5 subsequent siblings)
  7 siblings, 0 replies; 18+ messages in thread
From: W B Hacker @ 2009-10-29 16:09 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

Steve Simon wrote:
> Sorry, not really the place for such questions but...
>
> I always struggle with sed, awk is easy but sed makes my head hurt.
>
> I am trying to capitalise the first tow words on each line (I could use awk
> as well but I have to use sed so it seems churlish to start another process).
>
> capitalising the first word on the line is easy enough:
>
> 			h
> 			s/^(.).*/\1/
> 			y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
> 			x
> 			s/^.(.*)/\1/
> 			x
> 			G
> 			s/\n//
>
> Though there maye be a much easier/more elegant way to do this,
> but for the 2nd word it gets much harder.
>
> What I really want is sam's ability to select a letter and operate on it
> rather than everything being line based as sed seems to be.
>
> any neat solutions? (extra points awarded for use of the branch operator :-)
>
> -Steve
>
>

I'd be sore tempted to move the needful files into an environment where I could
use multiple passes of 'rpl' (or 'back in the day' BRIEF).

BFBI .. far less capable tools, perhaps - BUT by the time you've figured out how
to even *tell* awk or sed what to do, I'm working on some other task...

'If at first you don't succeed - cheat'

YMMV,

Bill



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 16:06 ` Lorenzo Bolla
@ 2009-10-29 16:33   ` Iruata Souza
  2009-10-29 16:42     ` Lorenzo Bolla
  0 siblings, 1 reply; 18+ messages in thread
From: Iruata Souza @ 2009-10-29 16:33 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

On Thu, Oct 29, 2009 at 2:06 PM, Lorenzo Bolla <lbolla@gmail.com> wrote:
> To capitalize the first letter of each line wouldn't this be enough?
> s/^./\u&/
>
> L.

% echo rwrong | sed 's/^./\u&/'
urwrong



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 16:33   ` Iruata Souza
@ 2009-10-29 16:42     ` Lorenzo Bolla
  0 siblings, 0 replies; 18+ messages in thread
From: Lorenzo Bolla @ 2009-10-29 16:42 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

[-- Attachment #1: Type: text/plain, Size: 374 bytes --]

I forgot the "9".
This works for GNU sed version 4.2.1
L.

On Thu, Oct 29, 2009 at 4:33 PM, Iruata Souza <iru.muzgo@gmail.com> wrote:

> On Thu, Oct 29, 2009 at 2:06 PM, Lorenzo Bolla <lbolla@gmail.com> wrote:
> > To capitalize the first letter of each line wouldn't this be enough?
> > s/^./\u&/
> >
> > L.
>
> % echo rwrong | sed 's/^./\u&/'
> urwrong
>
>

[-- Attachment #2: Type: text/html, Size: 755 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
  2009-10-29 16:06 ` Lorenzo Bolla
  2009-10-29 16:09 ` W B Hacker
@ 2009-10-29 18:52 ` Jason Catena
  2009-10-30 13:35 ` Eris Discordia
                   ` (4 subsequent siblings)
  7 siblings, 0 replies; 18+ messages in thread
From: Jason Catena @ 2009-10-29 18:52 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

> Sorry, not really the place for such questions but...

Try stackoverflow.com.  They delight in problems such as these.

> I am trying to capitalise the first tow words on each line

I store the original line with h, and then pull it back out repeatedly
with G to mangle it.
I got far enough to translate "first second ..." to "First s" with this:

h
s/^(.).*/\1/
y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
G
s/^.([^ ]+ ).*/\1/
s/^.([^ ]+)$/\1/
G
s/^.[^ ]+ (.).*/\1/
#y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
#3y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
s/\n//g

There's a couple problems.  (1) It doesn't handle the case with only
one word on a line, because it's hard to tell, later on, that I pulled
out the single word once already. (2) I'd like to put in one of the
commented-out y commands, but (2a) the first uppercases the entire
pattern space, and (2b) the second refers to line 3 of the entire
file, not line 3 of the pattern space.

> -Steve

Jason Catena

^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
                   ` (2 preceding siblings ...)
  2009-10-29 18:52 ` Jason Catena
@ 2009-10-30 13:35 ` Eris Discordia
  2009-10-30 13:39 ` Eris Discordia
                   ` (3 subsequent siblings)
  7 siblings, 0 replies; 18+ messages in thread
From: Eris Discordia @ 2009-10-30 13:35 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

Listing of file 'sedscr:'

> s/^/ /;
> s/$/aAbBcCdDeEfFgGhHiIjJkKlLmMnNoOpPqQrRsStTuUvVwWxXyYzZ/;
> s/ \([a-z]\)\(.*\1\)\(.\)/ \3\2\3/;
> s/ \([a-z]\)\(.*\1\)\(.\)/ \3\2\3/;
> s/.\{52\}$//;
> s/ //;

$ echo This is a test | sed -f sedscr
This Is a test
$ echo someone forgot to capitalize | sed -f sedscr
Someone Forgot to capitalize

This works with '/usr/bin/sed' from a FreeBSD 6.2-RELEASE installation.

Above sed script stolen from:

<http://dervish.wsisiz.edu.pl/~bse26236/batutil/help/sed/CAPITALI.HTM>

With a minor change: first three words to first two words.




--On Thursday, October 29, 2009 15:41 +0000 Steve Simon
<steve@quintile.net> wrote:

> Sorry, not really the place for such questions but...
>
> I always struggle with sed, awk is easy but sed makes my head hurt.
>
> I am trying to capitalise the first tow words on each line (I could use
> awk as well but I have to use sed so it seems churlish to start another
> process).
>
> capitalising the first word on the line is easy enough:
>
> 			h
> 			s/^(.).*/\1/
> 			y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
> 			x
> 			s/^.(.*)/\1/
> 			x
> 			G
> 			s/\n//
>
> Though there maye be a much easier/more elegant way to do this,
> but for the 2nd word it gets much harder.
>
> What I really want is sam's ability to select a letter and operate on it
> rather than everything being line based as sed seems to be.
>
> any neat solutions? (extra points awarded for use of the branch operator
> :-)
>
> -Steve
>







^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
                   ` (3 preceding siblings ...)
  2009-10-30 13:35 ` Eris Discordia
@ 2009-10-30 13:39 ` Eris Discordia
  2009-10-30 17:30   ` W B Hacker
  2009-10-30 15:29 ` [9fans] sed question (OT) dave.l
                   ` (2 subsequent siblings)
  7 siblings, 1 reply; 18+ messages in thread
From: Eris Discordia @ 2009-10-30 13:39 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

The script has a small "bug" one might say: it capitalizes the first two
words on a line that are _not_ already capitalized. If one of the first two
words is capitalized then the third will get capitalized.

--On Thursday, October 29, 2009 15:41 +0000 Steve Simon
<steve@quintile.net> wrote:

> Sorry, not really the place for such questions but...
>
> I always struggle with sed, awk is easy but sed makes my head hurt.
>
> I am trying to capitalise the first tow words on each line (I could use
> awk as well but I have to use sed so it seems churlish to start another
> process).
>
> capitalising the first word on the line is easy enough:
>
> 			h
> 			s/^(.).*/\1/
> 			y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
> 			x
> 			s/^.(.*)/\1/
> 			x
> 			G
> 			s/\n//
>
> Though there maye be a much easier/more elegant way to do this,
> but for the 2nd word it gets much harder.
>
> What I really want is sam's ability to select a letter and operate on it
> rather than everything being line based as sed seems to be.
>
> any neat solutions? (extra points awarded for use of the branch operator
> :-)
>
> -Steve
>







^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
                   ` (4 preceding siblings ...)
  2009-10-30 13:39 ` Eris Discordia
@ 2009-10-30 15:29 ` dave.l
  2009-10-30 20:53 ` Noah Evans
  2009-11-11 12:32 ` frankg
  7 siblings, 0 replies; 18+ messages in thread
From: dave.l @ 2009-10-30 15:29 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

You can do it, definitely.

Caveat: I'm in bed with a virus and the brain's on impulse power
so these are untested and may be highly suboptimal.

Is the input guaranteed to have 2 words on each line?
What are your definitions of words and blanks?

I know from your snippet that there's no leading blanks and no empty
lines.

Assuming there are 2 words on every line, something like:
h
s/[A-Za-z0-9_-]+(.).*/\1/
y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
G
s/(.)\n([A-Za-z0-9_-]+).(.*)/\2\1\3/

ought to roughly work after your fragment.

If >= 2 words per line isn't assumed:
h
t urnofflag
: urnofflag
s/[A-Za-z0-9_-]+[^ A-Za-z0-9_-]*(.).*/\1/
t for2
b cosnot2wds
: for2
y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
G
s/(.)\n([A-Za-z0-9_-]+[^ A-Za-z0-9_-]*).(.*)/\2\1\3/
b
: cosnot2wds
g

Bizarrely, within it's limitations (\n, \0, size limits), sed is, in
some sense, complete,
since you can store any number of things in the spaces (using  /(.*
\n)/ etc.) and branch conditionally.

Another insane possibility, since there are only 26 variations, is to
do:
	s/^a/A/
	s/^([A-Z][A-Za-z0-9]+[^ A-Za-z0-9_-]*)a/\1A/
	s/^b/B/
	s/^([A-Z][A-Za-z0-9]+[^ A-Za-z0-9_-]*)b/\1B/

You can of course, use sed to create the above script like so:
	echo abcdefghijklmnopqrstuvwxyz | sed ...
Filling in the ellipses is left as an exercise for the already addled
reader.

BTW: if you're shovelling a lot of this kind of muck,
it may, paradoxically, be easier to do it on the command line and use
your shell's variables for the repeated bits of regexps, commands etc.
The only caveats are that this technique will curdle your brain even
more than sed already does
and it may, oddly, be the exception to the rule that rc is more
elegant than sh, due to caret vs. double-quotes.

Apologies for grandstanding, but I used to do this sort of stuff for a
living.
I wrote a piece of training courseware for sed once which had far
worse excesses than the above as examples.
RFC-822 header-reassembly anyone?

I also used to get my intellectual rocks off on stuff like this until
I finally grew up (in my late 40s).

Dave.

SEE ALSO
	teco, assembler, qed.

On 29 Oct 2009, at 15:41, Steve Simon wrote:

> Sorry, not really the place for such questions but...
>
> I always struggle with sed, awk is easy but sed makes my head hurt.
>
> I am trying to capitalise the first tow words on each line (I could
> use awk
> as well but I have to use sed so it seems churlish to start another
> process).
>
> capitalising the first word on the line is easy enough:
>
> 			h
> 			s/^(.).*/\1/
> 			y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
> 			x
> 			s/^.(.*)/\1/
> 			x
> 			G
> 			s/\n//
>
> Though there maye be a much easier/more elegant way to do this,
> but for the 2nd word it gets much harder.
>
> What I really want is sam's ability to select a letter and operate
> on it
> rather than everything being line based as sed seems to be.
>
> any neat solutions? (extra points awarded for use of the branch
> operator :-)
>
> -Steve
>

^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-30 13:39 ` Eris Discordia
@ 2009-10-30 17:30   ` W B Hacker
  2009-10-30 17:39     ` [9fans] sed question (OT) (OT) (OT) Tim Newsham
  0 siblings, 1 reply; 18+ messages in thread
From: W B Hacker @ 2009-10-30 17:30 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

Eris Discordia wrote:
> The script has a small "bug" one might say: it capitalizes the first two
> words on a line that are _not_ already capitalized. If one of the first
> two words is capitalized then the third will get capitalized.

Call me a Dinosaur, but - so long as it is ASCII or EBCDIC it is relatively
trivial to implement that in hardware AND NOT have the issue of altering any but
the first two words AND NOT have issues where there is only one word or a
numeral or punctuation or hidden/control character rather than alpha.

Hint: Among other simple stuff, needs XOR capability.

'Dinosaur' 'coz the last time I did one of the key portions of it was converting
a Data Printer CT-1064 chaintrain from HP-3000 MKIII use to work with an S-100
Z-80. That capitalized *every* alpha character, but took just two 74-series IC's
to replace a pair of lookup-table PROMS.

One would need to add logic to detect space or newline, set/unset a few latches
- not a lot more.

Could have built it in less time than this thread has been running...

;-)


Bill
>
> --On Thursday, October 29, 2009 15:41 +0000 Steve Simon
> <steve@quintile.net> wrote:
>
>> Sorry, not really the place for such questions but...
>>
>> I always struggle with sed, awk is easy but sed makes my head hurt.
>>
>> I am trying to capitalise the first tow words on each line (I could use
>> awk as well but I have to use sed so it seems churlish to start another
>> process).
>>
>> capitalising the first word on the line is easy enough:
>>
>>             h
>>             s/^(.).*/\1/
>>             y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
>>             x
>>             s/^.(.*)/\1/
>>             x
>>             G
>>             s/\n//
>>
>> Though there maye be a much easier/more elegant way to do this,
>> but for the 2nd word it gets much harder.
>>
>> What I really want is sam's ability to select a letter and operate on it
>> rather than everything being line based as sed seems to be.
>>
>> any neat solutions? (extra points awarded for use of the branch operator
>> :-)
>>
>> -Steve
>>
>
>
>
>
>
>




^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT) (OT) (OT)
  2009-10-30 17:30   ` W B Hacker
@ 2009-10-30 17:39     ` Tim Newsham
  2009-10-30 18:14       ` [9fans] sed question (OT) (OT) (OT) (OT) (OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT) W B Hacker
  0 siblings, 1 reply; 18+ messages in thread
From: Tim Newsham @ 2009-10-30 17:39 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

> Call me a Dinosaur, but - so long as it is ASCII or EBCDIC it is relatively
> trivial to implement that in hardware AND NOT have the issue of altering any
> but the first two words AND NOT have issues where there is only one word or a
> numeral or punctuation or hidden/control character rather than alpha.

You should have added an extra "(OT)" to the subject line.
I'm adding a few more just to be fair.

> Could have built it in less time than this thread has been running...

then what have you been doing all this time?

> Bill

Tim Newsham
http://www.thenewsh.com/~newsham/



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT) (OT) (OT) (OT) (OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)
  2009-10-30 17:39     ` [9fans] sed question (OT) (OT) (OT) Tim Newsham
@ 2009-10-30 18:14       ` W B Hacker
  0 siblings, 0 replies; 18+ messages in thread
From: W B Hacker @ 2009-10-30 18:14 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

Tim Newsham wrote:
>> Call me a Dinosaur, but - so long as it is ASCII or EBCDIC it is
>> relatively trivial to implement that in hardware AND NOT have the
>> issue of altering any but the first two words AND NOT have issues
>> where there is only one word or a numeral or punctuation or
>> hidden/control character rather than alpha.
>
> You should have added an extra "(OT)" to the subject line.
> I'm adding a few more just to be fair.
>
>> Could have built it in less time than this thread has been running...
>
> then what have you been doing all this time?
>
>> Bill
>
> Tim Newsham
> http://www.thenewsh.com/~newsham/
>
>

Honestly?

Trying to determine what a valid USE for capitalizing exactly the first 'n'
words on a line might be.

Especially as it calls for ONE or TWO but never THREE or more.

Document 'sideheads', maybe??

- but those may not be limited to 2 words.

The need is as puzzling as some of the solutions..

Bill



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
                   ` (5 preceding siblings ...)
  2009-10-30 15:29 ` [9fans] sed question (OT) dave.l
@ 2009-10-30 20:53 ` Noah Evans
  2009-11-11 12:32 ` frankg
  7 siblings, 0 replies; 18+ messages in thread
From: Noah Evans @ 2009-10-30 20:53 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

This kind of problem is character processing, which I would argue is
C's domain. You can massage awk and sed to do the job for you, but at
least for me it's conceptually simpler to just bang out the following
C program:

#include <u.h>
#include <libc.h>
#include <bio.h>

#define	isupper(r)	(L'A' <= (r) && (r) <= L'Z')
#define	islower(r)	(L'a' <= (r) && (r) <= L'z')
#define	isalpha(r)	(isupper(r) || islower(r))
#define	isspace(r)	((r) == L' ' || (r) == L'\t' \
			|| (0x0A <= (r) && (r) <= 0x0D))
#define	toupper(r)	((r)-'a'+'A')

void
usage(char *me)
{
	fprint(2, "%s: usage\n", me);
}

void
main(int argc, char **argv)
{
	Biobuf in, out;
	int c, waswhite, nwords;

	ARGBEGIN{
	default:
		usage(argv[0]);
	}ARGEND;
	Binit(&in, 0, OREAD);
	Binit(&out, 1, OWRITE);
	
	waswhite = 0;
	nwords = 0;
	while((c = Bgetc(&in)) != Beof){
		if(isalpha(c))
		if(waswhite)
		if(nwords < 2){
			if(islower(c))
				c = toupper(c);
			nwords++;
		}
		if(isspace(c))
			waswhite = 1;
		else
			waswhite = 0;
		if(c == '\n')
			nwords = 0;
		Bputc(&out, c);
	}
	exits(0);
}

Noah


On Thu, Oct 29, 2009 at 4:41 PM, Steve Simon <steve@quintile.net> wrote:
> Sorry, not really the place for such questions but...
>
> I always struggle with sed, awk is easy but sed makes my head hurt.
>
> I am trying to capitalise the first tow words on each line (I could use awk
> as well but I have to use sed so it seems churlish to start another process).
>
> capitalising the first word on the line is easy enough:
>
>                        h
>                        s/^(.).*/\1/
>                        y/abcdefghijklmnopqrstuvwxyz/ABCDEFGHIJKLMNOPQRSTUVWXYZ/
>                        x
>                        s/^.(.*)/\1/
>                        x
>                        G
>                        s/\n//
>
> Though there maye be a much easier/more elegant way to do this,
> but for the 2nd word it gets much harder.
>
> What I really want is sam's ability to select a letter and operate on it
> rather than everything being line based as sed seems to be.
>
> any neat solutions? (extra points awarded for use of the branch operator :-)
>
> -Steve
>
>



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
                   ` (6 preceding siblings ...)
  2009-10-30 20:53 ` Noah Evans
@ 2009-11-11 12:32 ` frankg
  7 siblings, 0 replies; 18+ messages in thread
From: frankg @ 2009-11-11 12:32 UTC (permalink / raw)
  To: 9fans

On Oct 30, 12:58Â pm, noah.ev...@gmail.com (Noah Evans) wrote:
> This kind of problem is character processing, which I would argue is
> C's domain. You can massage awk and sed to do the job for you, but at
> least for me it's conceptually simpler to just bang out the following
> C program:
>
> #include <u.h>
> #include <libc.h>
> #include <bio.h>
>
> #define isupper(r) Â  Â  Â (L'A' <= (r) && (r) <= L'Z')
> #define islower(r) Â  Â  Â (L'a' <= (r) && (r) <= L'z')
> #define isalpha(r) Â  Â  Â (isupper(r) || islower(r))
> #define isspace(r) Â  Â  Â ((r) == L' ' || (r) == L'\t' \
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  || (0x0A <= (r) && (r) <= 0x0D))
> #define toupper(r) Â  Â  Â ((r)-'a'+'A')
>
> void
> usage(char *me)
> {
> Â  Â  Â  Â  fprint(2, "%s: usage\n", me);
>
> }
>
> void
> main(int argc, char **argv)
> {
> Â  Â  Â  Â  Biobuf in, out;
> Â  Â  Â  Â  int c, waswhite, nwords;
>
> Â  Â  Â  Â  ARGBEGIN{
> Â  Â  Â  Â  default:
> Â  Â  Â  Â  Â  Â  Â  Â  usage(argv[0]);
> Â  Â  Â  Â  }ARGEND;
> Â  Â  Â  Â  Binit(&in, 0, OREAD);
> Â  Â  Â  Â  Binit(&out, 1, OWRITE);
>
> Â  Â  Â  Â  waswhite = 0;
> Â  Â  Â  Â  nwords = 0;
> Â  Â  Â  Â  while((c = Bgetc(&in)) != Beof){
> Â  Â  Â  Â  Â  Â  Â  Â  if(isalpha(c))
> Â  Â  Â  Â  Â  Â  Â  Â  if(waswhite)
> Â  Â  Â  Â  Â  Â  Â  Â  if(nwords < 2){
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  if(islower(c))
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  c = toupper(c);
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  nwords++;
> Â  Â  Â  Â  Â  Â  Â  Â  }
> Â  Â  Â  Â  Â  Â  Â  Â  if(isspace(c))
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  waswhite = 1;
> Â  Â  Â  Â  Â  Â  Â  Â  else
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  waswhite = 0;
> Â  Â  Â  Â  Â  Â  Â  Â  if(c == '\n')
> Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  nwords = 0;
> Â  Â  Â  Â  Â  Â  Â  Â  Bputc(&out, c);
> Â  Â  Â  Â  }
> Â  Â  Â  Â  exits(0);
>
> }
>
> Noah
>

Simple, and wrong. You need to initialize waswhite to 1, not 0.



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
       [not found] <<A3AADD7F-E09D-49F9-8A5B-3D6B720046A4@mac.com>
@ 2009-10-30 16:16 ` erik quanstrom
  0 siblings, 0 replies; 18+ messages in thread
From: erik quanstrom @ 2009-10-30 16:16 UTC (permalink / raw)
  To: 9fans

On Fri Oct 30 11:31:24 EDT 2009, dave.l@mac.com wrote:
> You can do it, definitely.
>

well played!

- erik



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
       [not found] <<d1c554290910290929p3980a256hf075042ca3a3917b@mail.gmail.com>
@ 2009-10-29 16:31 ` erik quanstrom
  0 siblings, 0 replies; 18+ messages in thread
From: erik quanstrom @ 2009-10-29 16:31 UTC (permalink / raw)
  To: 9fans

On Thu Oct 29 12:31:23 EDT 2009, iru.muzgo@gmail.com wrote:
> On Thu, Oct 29, 2009 at 2:08 PM, erik quanstrom <quanstro@quanstro.net> wrote:
> >> To capitalize the first letter of each line wouldn't this be enough?
> >>
> >> s/^./\u&/
> >
> > ; echo abc def | sed 's/^.\u&/'
> > sed: s command garbled: s/^.\u&/
> >
>
>  i guess you missed the second slash
>

now it is less helpful:

; echo abc def | sed 's/^./\u&/'
uabc def

- erik



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
  2009-10-29 16:08 ` erik quanstrom
@ 2009-10-29 16:29   ` Iruata Souza
  0 siblings, 0 replies; 18+ messages in thread
From: Iruata Souza @ 2009-10-29 16:29 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

On Thu, Oct 29, 2009 at 2:08 PM, erik quanstrom <quanstro@quanstro.net> wrote:
>> To capitalize the first letter of each line wouldn't this be enough?
>>
>> s/^./\u&/
>
> ; echo abc def | sed 's/^.\u&/'
> sed: s command garbled: s/^.\u&/
>

 i guess you missed the second slash



^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [9fans] sed question (OT)
       [not found] <<80c99e790910290906t36766978kcd38c9583392e038@mail.gmail.com>
@ 2009-10-29 16:08 ` erik quanstrom
  2009-10-29 16:29   ` Iruata Souza
  0 siblings, 1 reply; 18+ messages in thread
From: erik quanstrom @ 2009-10-29 16:08 UTC (permalink / raw)
  To: 9fans

> To capitalize the first letter of each line wouldn't this be enough?
>
> s/^./\u&/

; echo abc def | sed 's/^.\u&/'
sed: s command garbled: s/^.\u&/

- erik



^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2009-11-11 12:32 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-10-29 15:41 [9fans] sed question (OT) Steve Simon
2009-10-29 16:06 ` Lorenzo Bolla
2009-10-29 16:33   ` Iruata Souza
2009-10-29 16:42     ` Lorenzo Bolla
2009-10-29 16:09 ` W B Hacker
2009-10-29 18:52 ` Jason Catena
2009-10-30 13:35 ` Eris Discordia
2009-10-30 13:39 ` Eris Discordia
2009-10-30 17:30   ` W B Hacker
2009-10-30 17:39     ` [9fans] sed question (OT) (OT) (OT) Tim Newsham
2009-10-30 18:14       ` [9fans] sed question (OT) (OT) (OT) (OT) (OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT)(OT) W B Hacker
2009-10-30 15:29 ` [9fans] sed question (OT) dave.l
2009-10-30 20:53 ` Noah Evans
2009-11-11 12:32 ` frankg
     [not found] <<80c99e790910290906t36766978kcd38c9583392e038@mail.gmail.com>
2009-10-29 16:08 ` erik quanstrom
2009-10-29 16:29   ` Iruata Souza
     [not found] <<d1c554290910290929p3980a256hf075042ca3a3917b@mail.gmail.com>
2009-10-29 16:31 ` erik quanstrom
     [not found] <<A3AADD7F-E09D-49F9-8A5B-3D6B720046A4@mac.com>
2009-10-30 16:16 ` erik quanstrom

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).