* [9fans] awk
@ 2002-11-07 6:46 Kenji Arisawa
2002-11-20 14:49 ` Kenji Arisawa
0 siblings, 1 reply; 13+ messages in thread
From: Kenji Arisawa @ 2002-11-07 6:46 UTC (permalink / raw)
To: 9fans
I tested some awk string functions to examine if
they can handle UFT-8 code well.
The bollow is my text code:
#!/bin/rc
#
# Can awk function handle UTF strings ?
#
echo 'ベル:研究所' | awk '{
print $0 # ベル:研究所
print length($0) # 6
print index($0,":") # 3
print match($0,":.*"),RSTART, RLENGTH # 7 7 4
print substr($0,3) # :研究所
a=$0; sub(":.+", "alice", a); print a # ベルalice
}'
Output is commented after `#' in each line.
Function `match' returns byte position that is inconsitent
with others. I believe this is a bug.
Kenji Arisawa
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [9fans] awk
2002-11-07 6:46 [9fans] awk Kenji Arisawa
@ 2002-11-20 14:49 ` Kenji Arisawa
0 siblings, 0 replies; 13+ messages in thread
From: Kenji Arisawa @ 2002-11-20 14:49 UTC (permalink / raw)
To: 9fans
Hello,
I said:
> I tested some awk string functions to examine if
> they can handle UFT-8 code well.
> The bollow is my text code:
> #!/bin/rc
> #
> # Can awk function handle UTF strings ?
> #
> echo 'ベル:研究所' | awk '{
> print $0 # ベル:研究所
> print length($0) # 6
> print index($0,":") # 3
> print match($0,":.*"),RSTART, RLENGTH # 7 7 4
> print substr($0,3) # :研究所
> a=$0; sub(":.+", "alice", a); print a # ベルalice
> }'
>
> Output is commented after `#' in each line.
> Function `match' returns byte position that is inconsitent
> with others. I believe this is a bug.
>
It seems this bug is fixed in recent update.
Thanks.
Kenji Arisawa
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2003-09-13 0:40 boyd
0 siblings, 0 replies; 13+ messages in thread
From: boyd @ 2003-09-13 0:40 UTC (permalink / raw)
To: 9fans
yes russ, the bwk and plan 9 sources are in 'sync':
brahma% pwd
/usr/boyd/src/awk/bwk
brahma% for (i in *.[chy]) { echo $i ; diff -b $i /n/sources/plan9/sys/src/cmd/awk | wc -l }
awk.h
73
awkgram.y
7
b.c
diff: cannot open /n/sources/plan9/sys/src/cmd/awk/b.c: '/n/sources/plan9/sys/src/cmd/awk/b.c' does not exist
0
lex.c
75
lib.c
120
main.c
37
maketab.c
38
missing95.c
diff: cannot open /n/sources/plan9/sys/src/cmd/awk/missing95.c: '.../missing95.c' does not exist
0
parse.c
14
proctab.c
4
proto.h
106
run.c
403
tran.c
106
ytab.c
diff: cannot open /n/sources/plan9/sys/src/cmd/awk/ytab.c: '/n/sources/plan9/sys/src/cmd/awk/ytab.c' does not exist
0
ytab.h
diff: cannot open /n/sources/plan9/sys/src/cmd/awk/ytab.h: '/n/sources/plan9/sys/src/cmd/awk/ytab.h' does not exist
0
QED
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [9fans] awk
@ 2002-11-07 9:56 okamoto
0 siblings, 0 replies; 13+ messages in thread
From: okamoto @ 2002-11-07 9:56 UTC (permalink / raw)
To: 9fans
This is a multi-part message in MIME format.
--upas-tmthlxvzkzzomvhnwqtpczltyj
Content-Disposition: inline
I'm not insulting you, but...
As is seen here recently, we seem to have small developpersnow.
Furthermore, this is an example of an application bug, and it's
deeply related to consistency of usage of UTF-8 in an application.
Taking into consideration of these facts, I think you'd better to report
the fix for it, because I believe you can do it. I'm supposing this seems
not to be a serious bug, probably just in a match function etc.. No I have
no idea for this though.
just my two cents,
Kenji
--upas-tmthlxvzkzzomvhnwqtpczltyj
Content-Type: message/rfc822
Content-Disposition: inline
Received: from granite.cias.osakafu-u.ac.jp ([192.168.1.3]) by diabase; Thu Nov 7 15:51:17 JST 2002
Received: from elmo.cias.osakafu-u.ac.jp (elmo.cias.osakafu-u.ac.jp [157.16.103.2])
by granite.cias.osakafu-u.ac.jp (8.9.3/8.9.3) with ESMTP id PAA00935
for <okamoto@granite.cias.osakafu-u.ac.jp>; Thu, 7 Nov 2002 15:47:15 +0900
Received: from mail.cse.psu.edu (psuvax1.cse.psu.edu [130.203.4.6])
by elmo.cias.osakafu-u.ac.jp (8.9.3/3.7W-02110515) with ESMTP id PAA28312
for <okamoto@granite.cias.osakafu-u.ac.jp>; Thu, 7 Nov 2002 15:47:18 +0900 (JST)
Received: from psuvax1.cse.psu.edu (psuvax1.cse.psu.edu [130.203.30.6])
by mail.cse.psu.edu (CSE Mail Server) with ESMTP
id D2303199BE; Thu, 7 Nov 2002 01:47:08 -0500 (EST)
Delivered-To: 9fans@cse.psu.edu
Received: from pc.aichi-u.ac.jp (a130035.usr.starcat.ne.jp [61.211.130.35])
by mail.cse.psu.edu (CSE Mail Server) with SMTP id 4C02B19995
for <9fans@cse.psu.edu>; Thu, 7 Nov 2002 01:46:32 -0500 (EST)
Message-ID: <d7e2337e18e882fc2b734291a9cc9365@ar.aichi-u.ac.jp>
From: "Kenji Arisawa" <arisawa@ar.aichi-u.ac.jp>
To: 9fans@cse.psu.edu
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Subject: [9fans] awk
Sender: 9fans-admin@cse.psu.edu
Errors-To: 9fans-admin@cse.psu.edu
X-BeenThere: 9fans@cse.psu.edu
X-Mailman-Version: 2.0.11
Precedence: bulk
Reply-To: 9fans@cse.psu.edu
X-Reply-To: "Kenji Arisawa" <arisawa@aichi-u.ac.jp>
List-Id: Fans of the OS Plan 9 from Bell Labs <9fans.cse.psu.edu>
List-Archive: <https://lists.cse.psu.edu/archives/9fans/>
Date: Thu, 7 Nov 2002 15:46:29 +0900
Content-Transfer-Encoding: quoted-printable
X-MIME-Autoconverted: from 8bit to quoted-printable by granite.cias.osakafu-u.ac.jp id PAA00935
I tested some awk string functions to examine if
they can handle UFT-8 code well.
The bollow is my text code:
#!/bin/rc
#
# Can awk function handle UTF strings ?
#
echo '=E3=83=99=E3=83=AB:=E7=A0=94=E7=A9=B6=E6=89=80' | awk '{
print $0 # =E3=83=99=E3=83=AB:=E7=A0=94=E7=A9=B6=E6=89=80
print length($0) # 6
print index($0,":") # 3
print match($0,":.*"),RSTART, RLENGTH # 7 7 4
print substr($0,3) # :=E7=A0=94=E7=A9=B6=E6=89=80
a=3D$0; sub(":.+", "alice", a); print a # =E3=83=99=E3=83=ABalice
}'
Output is commented after `#' in each line.
Function `match' returns byte position that is inconsitent
with others. I believe this is a bug.
Kenji Arisawa
--upas-tmthlxvzkzzomvhnwqtpczltyj--
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [9fans] awk
2001-06-11 10:13 Laura Creighton
2001-06-11 12:27 ` Boyd Roberts
@ 2001-06-18 9:12 ` Rick O'shay
1 sibling, 0 replies; 13+ messages in thread
From: Rick O'shay @ 2001-06-18 9:12 UTC (permalink / raw)
To: 9fans
In article <200106111013.MAA14426@boris.cd.chalmers.se>, "Laura Creighton"
<lac@cd.chalmers.se> wrote:
> it is very cool, but these days I do all my old awk things in python.
> Laura
On linuces, you'll go back once you start using gawk-3.1.0. It has very
highly abstracted Plan-9zy syntactical model for socket programming. Once
gawk incorporates a multithreading model, I won't have to make any more
excuses for not wanting to know Perl.
-m
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [9fans] awk
2001-06-11 10:13 Laura Creighton
@ 2001-06-11 12:27 ` Boyd Roberts
2001-06-18 9:12 ` Rick O'shay
1 sibling, 0 replies; 13+ messages in thread
From: Boyd Roberts @ 2001-06-11 12:27 UTC (permalink / raw)
To: 9fans
i even persuaded it to 'parse' mime as one company's 'rat squad'
[les boeuf-carrotes] had to round up the odd scapegoat [tête de
turc].
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2001-06-11 10:13 Laura Creighton
2001-06-11 12:27 ` Boyd Roberts
2001-06-18 9:12 ` Rick O'shay
0 siblings, 2 replies; 13+ messages in thread
From: Laura Creighton @ 2001-06-11 10:13 UTC (permalink / raw)
To: matt; +Cc: lac, 9fans
it is very cool, but these days I do all my old awk things in python.
Laura
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2000-03-14 10:06 Arnold
0 siblings, 0 replies; 13+ messages in thread
From: Arnold @ 2000-03-14 10:06 UTC (permalink / raw)
In article <20000312225042.13487.qmail@g.bio.cse.psu.edu> you write:
>(Wow... I go on vacation, and the list springs to life.)
Hopefully NOT cause and effect, but one can never tell. :-)
Arnold
--
Arnold Robbins | If There's More Than One Way To Do It,
Guest account at Emory Math/CS | then why is there only one implementation
Email: arnold@skeeve.com | of perl?
| -- Arnold Robbins
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2000-03-13 6:25 Bengt
0 siblings, 0 replies; 13+ messages in thread
From: Bengt @ 2000-03-13 6:25 UTC (permalink / raw)
> From: bobf@plan9.bell-labs.com
> we are trying to port the new ksh, but it is a total
> mess so i don't know if we'll get it done in time,
> but we are trying.
Not that I care about ksh, you may leave it and concentrate on other, more important things,
so as to make 'it' on time.
However, I am _really_ interested in just what it is that oyu want to get done in time.
Especially since there has been mentionings before of a new Plan9 version...
Best Wishes, Bengt
===============================================================
Everything aforementioned should be regarded as totally private
opinions, and nothing else. bengt@softwell.se
``His great strength is that he is uncompromising. It would make
him physically ill to think of programming in C++.''
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2000-03-13 1:21 Vladimir
0 siblings, 0 replies; 13+ messages in thread
From: Vladimir @ 2000-03-13 1:21 UTC (permalink / raw)
--- bobf@plan9.bell-labs.com wrote:
> the plan 9 port of awk
> is useless without plan 9: you need plan 9 both to build
> it and to run it, so there is no reason to release other
> than as part of the system.
I still don 't have the full plan9 - just the pcdist.
The precompiled binary should work - if you have the right
to make it available.
Thanks
Vladimir
__________________________________________________
Do You Yahoo!?
Talk to your friends online with Yahoo! Messenger.
http://im.yahoo.com
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2000-03-12 23:29 bobf
0 siblings, 0 replies; 13+ messages in thread
From: bobf @ 2000-03-12 23:29 UTC (permalink / raw)
> Conversely, if I recall correctly, the verison that shipped with the
> second edition wasn't the most recent. Ditto for ksh.
to answer vladimir's question, the plan 9 port of awk
is useless without plan 9: you need plan 9 both to build
it and to run it, so there is no reason to release other
than as part of the system.
the version of awk that we ship is always ported from
brian's most recent version in his home directory and
is therefore in sync with what brian is currently releasing.
we are trying to port the new ksh, but it is a total
mess so i don't know if we'll get it done in time,
but we are trying.
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2000-03-12 22:50 Scott
0 siblings, 0 replies; 13+ messages in thread
From: Scott @ 2000-03-12 22:50 UTC (permalink / raw)
(Wow... I go on vacation, and the list springs to life.)
> But, what about the Plan9 port of it?
Conversely, if I recall correctly, the verison that shipped with the
second edition wasn't the most recent. Ditto for ksh.
Fixed in Brazil?
^ permalink raw reply [flat|nested] 13+ messages in thread
* [9fans] awk
@ 2000-03-12 21:50 Vladimir
0 siblings, 0 replies; 13+ messages in thread
From: Vladimir @ 2000-03-12 21:50 UTC (permalink / raw)
--- rob pike <rob@plan9.bell-labs.com> wrote:
> The source to awk has been available on Brian Kernighan's
> web page "since the web was invented", according to him.
>
> -rob
>
Yes it is.
But, what about the Plan9 port of it?
Due to the licence - have anyone the right to make it
available in public?
Thank you
Vladimir.
__________________________________________________
Do You Yahoo!?
Talk to your friends online with Yahoo! Messenger.
http://im.yahoo.com
^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2003-09-13 0:40 UTC | newest]
Thread overview: 13+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2002-11-07 6:46 [9fans] awk Kenji Arisawa
2002-11-20 14:49 ` Kenji Arisawa
-- strict thread matches above, loose matches on Subject: below --
2003-09-13 0:40 boyd
2002-11-07 9:56 okamoto
2001-06-11 10:13 Laura Creighton
2001-06-11 12:27 ` Boyd Roberts
2001-06-18 9:12 ` Rick O'shay
2000-03-14 10:06 Arnold
2000-03-13 6:25 Bengt
2000-03-13 1:21 Vladimir
2000-03-12 23:29 bobf
2000-03-12 22:50 Scott
2000-03-12 21:50 Vladimir
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).