I am aware of the prevalence of NUL-terminated strings, since I've coded in C in the past, that's why I wrote 'considerable bother to fix it'. Nevertheless, for a purpose such as argument passing, size + data is clearly better (easier to secure and more flexible) On Thu, Dec 10, 2015 at 2:59 PM, Nikolay Aleksandrovich Pavlov (ZyX) < kp-pav@yandex.ru> wrote: > 10.12.2015, 07:18, "D Gowers" : > > Ah, okay. That (commandline arguments not being able to contain NUL) > seems.. a bit anachronistic. But I guess it's never been enough of a > problem to warrant the considerable bother to fix it. Fair enough. > > This has nothing to do with the commandline itself. In some very earlier > days it was decided that strings will be NUL-terminated (in place of e.g. > being structs with size_t size and char *data) and this statement sneaked > into many parts of many standards. If you write C code you will have > problems when dealing with NUL-terminated string because every library > function that accepts something other then void* pointer with “generic > data” assumes that string should terminate with NUL. Projects like zsh or > almost every programming language have to write their own string > implementations: in zsh it is C strings with escaped characters, in most > other cases it is length+data pair. > > Since one of the functions having NUL convention is exec* function family > which is used to launch programs and another is main() function on the > other side that accepts NUL-terminated strings you cannot really do > anything to fix this: replacing one of the core conventions is *very* > expensive, especially since you must do this in a backward-compatible way. > > > On Thu, Dec 10, 2015 at 2:26 PM, Nikolay Aleksandrovich Pavlov (ZyX) < > kp-pav@yandex.ru> wrote: > >> 10.12.2015, 04:52, "D Gowers" : > >>> Test case: > >>> > >>> v=$(printf foo\\0bar);expr length "$v";expr length $v > >>> > >>> alternatively: > >>> > >>> v=foo$'\0'bar;expr length "$v";expr length $v > >>> > >>> In zsh, the values returned are 3 and 3. > >>> In dash and zsh, the values returned are 6 and 6. > >>> > >>> Both of those results are wrong, AFAICS (foo$'0'bar is 7 characters > long). > >>> But the zsh result is more severely wrong. I could understand the > bash/dash > >>> result, at least, as 'NULL characters are not counted towards length'. > >> > >> Both results are *right*. In both cases you ask the length of the > string and you get it. > >> > >> In dash (also posh, bash and busybox ash) zero byte is skipped when > storing. So length of the $v *is* six. You may question whether it is right > storing without zero byte, but the fact that all four shells have exactly > the same behaviour makes me think this is part of the POSIX standard. In > any case non-C strings are not on the list of features of these shells > unlike zsh (it also internally uses C NUL-terminated strings, but zero > bytes and some other characters are “metafied” (i.e. escaped) and > unmetafied when passed to the outer world e.g. by doing `echo $v` to pass > string to terminal). > >> > >> As I said in zsh zero byte is stored. But C strings which are the only > ones that can be arguments to any program are **NUL-terminated**. So what > you do is passing string "foo" because NUL terminates the string. You > cannot possibly get the answer you think is right here thus, unless you > reimplement `expr` as a zsh function. > >> > >>> > >>> In any case, it is easily demonstrated that the string is not 3 > characters > >>> long, by running 'echo "$V"' or 'print "$v"' or 'echo ${#v}' > >>> > >>> `zsh --version` = 'zsh 5.2 (x86_64-unknown-linux-gnu)' >