From mboxrd@z Thu Jan 1 00:00:00 1970 Message-ID: <7871fcf50802261838k1312b9doc979c4a065c93e6b@mail.gmail.com> Date: Tue, 26 Feb 2008 21:38:40 -0500 From: "Joel C. Salomon" To: "Fans of the OS Plan 9 from Bell Labs" <9fans@cse.psu.edu> Subject: Re: [9fans] awk, not utf aware... In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <62de1c2c6c8cd64152312915825615eb@plan9.bell-labs.com> Topicbox-Message-UUID: 63d4b2de-ead3-11e9-9d60-3106f5b1d025 On Tue, Feb 26, 2008 at 4:21 PM, Pietro Gagliardi wrote: > And it's wonderful that the C standard defines a character literal as > so: But it leaves the meaning of a literal like 'abcd' up to the compiler. I did something very perverse -- but 'legal' -- in the compiler I started writing for class... Also recall that sizeof('c') == sizeof(int). I suspect, though, that literals like 'abcd' are left from the B (word-addressable, not byte-addressable) days. A quick check of /sys/src/cmd/cc/lex.c shows that kenc disallows such horrors. --Joel