caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
From: Neale Pickett <neale-caml@woozle.org>
To: caml-list@pauillac.inria.fr
Subject: Re: [Caml-list] Str.string_match raising Invalid_argument "String.sub" in gc
Date: 23 Aug 2001 09:06:58 -0700	[thread overview]
Message-ID: <w53k7zuzrod.fsf@woozle.org> (raw)
In-Reply-To: <20010823122114.A3873@cs.uu.nl>

Frank Atanassow writes:

> Ocaml does not purport to have no side-effects. It has plenty of
> side-effects.  You must be thinking of Haskell or Miranda.

That's probably half of my problem, then :-)

> I'm pretty sure there is no such optimization, but I'm not sure what
> you're talking about here. Anyway, if an optimization affected the
> behavior of a program, it would not be an optimization but rather an
> compiler bug.

Having slept on it, I think what I was experiencing might be linked with
the fact that the Str library is apparently non-reentrant and my
approach to using the regexp parts of Str.  What I ran into was, I
think, a bug in either the Str library or its documentation.

Originally, I was trying to do something like this:

# let string_lines =
    let sep = Str.regexp "^[ \t\n]*\\(.+\\)" in
    let rec f = function
      | [] -> []
      | s :: rest -> if (Str.string_match sep s 0) then
          (Str.matched_group 1 s) :: (f rest)
      else
          f rest
    in
    f
  in
  string_lines ["  hello"; "  dromedaries"];;
Uncaught exception: Invalid_argument "String.sub".

(Apologies if this is inelegant, I'm just starting out.)

Alain Frisch <frisch@clipper.ens.fr> points out:

> This is wrong; with the current OCaml implementation, the right
> operand of (::) is called first; so (Str.matched_group 1 s) is called
> after subsequent calls to Str.string_match, which is obviously
> incorrect.

I contest that this is obvious.  s is a different string each time f is
called, and so even though I do call Str.string_match multiple times,
it's with a different s.  The manual for the Str libary says only that I
must pass in the same s as was given to string_match, which implies that
s is somehow keyed to its matches.  It sounds as though I shouldn't do
the following:

  Str.string_match sep s 0;
  Str.string_match sep s' 0;
  print_string (Str.matched_group 1 s);

If this is the case, why does Str.matched_group even bother requiring
the original string?

I may be missing some crucial aspect to OCaml, and if so, I apologize
for this excercise in my own ignorance.  With my current understanding
of the language, though, it looks as though to use the regexp parts of
Str, I need to understand the underlying implementation of the library,
or at least know not to call string_match as above.  If the former, I
would consider this a bug; if the latter, it should just be added to the
documentation.  Either way, it's confusing.

> If I understand you correctly (but I don't think I do):

> # Str.split (Str.regexp "[ \t\n]+") "  abc def  ghi j";;
> - : string list = ["abc"; "def"; "ghi"; "j"]

This is, in fact, exactly what I was trying to do.  I wanted to code it
as a recursive function to show a friend the difference between
functional and procedural programming, got caught up in the exception,
and forgot what my original intent was.  Thank you!
-------------------
Bug reports: http://caml.inria.fr/bin/caml-bugs  FAQ: http://caml.inria.fr/FAQ/
To unsubscribe, mail caml-list-request@inria.fr  Archives: http://caml.inria.fr


  reply	other threads:[~2001-08-23 16:07 UTC|newest]

Thread overview: 39+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2001-08-22 18:53 neale-caml
2001-08-22 19:18 ` Alain Frisch
2001-08-22 20:41   ` Neale Pickett
2001-08-23 10:21     ` Frank Atanassow
2001-08-23 16:06       ` Neale Pickett [this message]
2001-08-23 16:25         ` Alain Frisch
2001-08-23 18:14           ` Neale Pickett
2001-08-22 20:23 ` Markus Mottl
2001-08-22 20:31   ` Miles Egan
2001-08-22 20:52     ` Michael Leary
2001-08-23  5:36       ` Jeremy Fincher
2001-08-22 22:06     ` Nicolas George
2001-08-23  7:08       ` [Caml-list] PCRE as standard (Was: Str.string_match raising Invalid_argument...) Florian Hars
2001-08-23 17:31       ` [Caml-list] Str.string_match raising Invalid_argument "String.sub" in gc Brian Rogoff
2001-08-23 18:08         ` [Caml-list] standard regex package Miles Egan
2001-08-23 19:28           ` Brian Rogoff
2001-08-23 19:49             ` Miles Egan
2001-08-23 19:51             ` Gerd Stolpmann
2001-08-23 21:12               ` Brian Rogoff
2001-08-23 21:27               ` Benjamin C. Pierce
2001-08-23 21:49                 ` Gerd Stolpmann
2001-08-23 22:11                   ` Miles Egan
2001-08-23 23:55                     ` Gerd Stolpmann
2001-08-24  9:03                       ` Claudio Sacerdoti Coen
2001-08-24  9:26                       ` Sven
2001-08-27 15:46                         ` [Caml-list] Package dependencies [Was: standard regex package] Ian Zimmerman
2001-08-27 20:50                           ` Gerd Stolpmann
2001-08-24  9:23                   ` [Caml-list] standard regex package Sven
2001-08-27 15:54                     ` Ian Zimmerman
2001-08-30  8:41                       ` Sven
2001-08-23 21:06             ` RE : " Lionel Fourquaux
2001-08-24  9:23               ` [Caml-list] dynamic loading and OS interface Xavier Leroy
2001-08-27 15:16             ` [Caml-list] standard regex package Ian Zimmerman
2001-08-27 15:35               ` Brian Rogoff
2001-08-24  9:13           ` Xavier Leroy
2001-08-24 10:16             ` Markus Mottl
2001-08-24 16:49             ` Miles Egan
     [not found]   ` <w533d6j1vxn.fsf@woozle.org>
     [not found]     ` <20010823112653.A7085@chopin.ai.univie.ac.at>
     [not found]       ` <w5366be7fd0.fsf_-_@woozle.org>
2001-08-23 20:01         ` [Caml-list] Re: [OFF-LIST] Str.string_match raising Invalid_argument "String.sub" in gc Markus Mottl
2001-08-23 20:31           ` Patrick M Doane

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=w53k7zuzrod.fsf@woozle.org \
    --to=neale-caml@woozle.org \
    --cc=caml-list@pauillac.inria.fr \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).