zsh-users
 help / color / mirror / code / Atom feed
From: nix@myproxylists.com
To: "Bart Schaefer" <schaefer@brasslantern.com>
Cc: zsh-users@zsh.org
Subject: Re: How to add string to end of each array element without loops
Date: Sat, 26 Mar 2011 16:04:28 +0200	[thread overview]
Message-ID: <dbcc1b5d31243610e6af2523d1750113.squirrel@gameframe.net> (raw)
In-Reply-To: <110326001415.ZM31882@torch.brasslantern.com>

> On Mar 26,  4:39am, nix@myproxylists.com wrote:
> }
> } Line 143: print -l ${(n)PROXIES} >> $Proxy_List causes segfault. I can't
> } see where the problem is, if you try IP range up to one million, it will
> } work good.
>
> Aha.  I'm able to reproduce this with
>
> typeset -A a
> a=({5000000..1} {1..5000000})
> print -l $a >> /dev/null
>
> What's actually failing is builtin.c line 283, the declaration
>
> 	VARARR(char *, argarr, argc + 1);
>
> This is attempting to allocate the entire argument list of "print"
> (which in my example is 5 million words) on the C function call
> stack.  Even with "unlimit stacksize" this is likely to overflow.
> The program merely doesn't discover that this has failed until a
> few lines later when it tries to use the first element of the array.
>

I actually tried to double the limit for stacksize 'limit stacksize 32'
but it did not fixed it. Looks like the limit is something like one
million for 'print -l $a >> /dev/null'.

> There's really nothing that can be done about this.  The whole shell
> paradigm of passing arguments as an argv[] array means that $a has
> to be expanded and then "print" called with the words by value.
>
> So what you have to do to work around this is NOT rely on "print -l"
> to insert the newlines, and instead insert them yourself:
>
> print ${(F)PROXIES} >> $Proxy_List
>
> This passes a single giant word with embedded newlines to "print" and
> thus avoids allocating all that space on the C stack.
>
> You're really past the design limits here of what a language with the
> semantic rules of a command shell is meant to deal with.  If "print"
> were not a builtin you'd have blown out the limits of argument passing
> before even getting as far as you did; even with the (F) trick, using
> /bin/echo will fail with "argument list too long".
>

./SUBNET
[+] Generating IP-addresses ... please wait
[+] 2072640 IP-addresses has been generated ...
[+] Generating proxy list ... please wait
[+] 6217920 proxies has been generated.

The (F} trick fixed it. Im quite confident that nearly none else expect me
need that big arrays. 'SUBNET' script is very fast, it's actually able to
beat in terms of speed solarwinds's advanced subnet calculator when it
comes to generating IPs out of CIDR :)

As I said earlier, my C programming skills are very limited but im getting
quite good with ZSH and PHP. Mixing two powerful scripting languages (or
programming languages) you name it, the results are quite powerful.

> I noticed you loaded the mapfile module even though you don't use it.
> It should work to do
>
> mapfile[$Proxy_list]=${(F)PROXIES}
>
> but whether that's actually faster than "print >>" I haven't checked.
>

I load other modules as well because most of my other tools are using
those modules. I tried also with mapfile but it did segfaulted as well.
Afterwards I noticed it's not even needed in 'SUBNET' script.

Thanks a bunch again. Looks like I have found my ZSH guru :)




      reply	other threads:[~2011-03-26 14:04 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-03-25 21:32 nix
2011-03-25 21:45 ` Mikael Magnusson
2011-03-25 21:47   ` nix
2011-03-25 22:03 ` Bart Schaefer
2011-03-26  1:37   ` nix
2011-03-26  2:10     ` Bart Schaefer
2011-03-26  2:39       ` nix
2011-03-26  7:14         ` Bart Schaefer
2011-03-26 14:04           ` nix [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=dbcc1b5d31243610e6af2523d1750113.squirrel@gameframe.net \
    --to=nix@myproxylists.com \
    --cc=schaefer@brasslantern.com \
    --cc=zsh-users@zsh.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/zsh/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).