From: eijiro_sumii@anet.ne.jp
To: checker@d6.com
Cc: caml-list@inria.fr, gerd@gerd-stolpmann.de,
sumii@venus.is.s.u-tokyo.ac.jp
Subject: Re: substring match like "strstr"
Date: Tue, 12 Dec 2000 12:28:07 +0900 [thread overview]
Message-ID: <20001212122807K.sumii@yl.is.s.u-tokyo.ac.jp> (raw)
In-Reply-To: <4.3.2.7.2.20001211103237.00c12100@shell16.ba.best.com>
> Any ideas why strstr blows the others away? What's the libc strstr
> look like?
I have no idea, unfortunately...
> I just looked in the MSVC source and it's a braindead while loop
> (copied below), so it's not like it's doing a fancy Boyer-Moore or
> anything.
I don't know anything about strstr in Sun's strstr, but I checked
strstr in GNU libc. It is a quite complicated program, but look like
a brute-force algorithm (that is, no Knuth-Morris-Pratt or anything
like that).
> This is exactly the kind of problem on which I'd expect caml to come
> within 10% of c.
That was what I expected, too.
> I'd say I'd do the tests myself, but I don't have a bunch of gene
> sequences laying around. :)
I've put the current version of my application program at:
http://www.yl.is.s.u-tokyo.ac.jp/~sumii/tmp/hc.tar.gz
If you like, you're welcome to check it yourself, of course.:) You can
run it as "make ; time ./hc". The output should look like:
score = 0.348672
score = 0.391356
(snip)
score = 0.630415
The OCaml function "strstr" is in the file "strstr.ml".
> Okay, I'm curious, so I'll port the code to caml and include it
> below as well (as practice for myself). Can you try it in your test
> harness?
Sure, but please give me a little time. This is a kind of part-time,
weekend job for me, and I can't tell when I have time to do it next...
Eijiro
next prev parent reply other threads:[~2000-12-12 9:29 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2000-12-08 6:04 eijiro_sumii
2000-12-08 12:57 ` Gerd Stolpmann
2000-12-10 13:16 ` eijiro_sumii
2000-12-10 15:39 ` Gerd Stolpmann
2000-12-11 3:57 ` eijiro_sumii
2000-12-12 13:58 ` Julian Assange
2000-12-11 21:07 ` Chris Hecker
2000-12-11 22:22 ` Gerd Stolpmann
2000-12-12 5:06 ` Chris Hecker
2000-12-12 12:28 ` Jean-Christophe Filliatre
2000-12-13 10:02 ` eijiro_sumii
2000-12-13 10:17 ` Eijiro Sumii
2000-12-13 10:53 ` Julian Assange
2000-12-13 13:28 ` Eijiro Sumii
2000-12-12 3:28 ` eijiro_sumii [this message]
2000-12-13 1:12 ` John Prevost
2000-12-13 2:35 ` Chris Hecker
2000-12-12 10:07 ` Sven LUTHER
2000-12-14 3:36 ` eijiro_sumii
2000-12-14 6:48 ` Chris Hecker
2000-12-14 8:02 ` eijiro_sumii
2000-12-14 21:53 ` Stephan Tolksdorf
2000-12-14 21:12 Ruchira Datta
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20001212122807K.sumii@yl.is.s.u-tokyo.ac.jp \
--to=eijiro_sumii@anet.ne.jp \
--cc=caml-list@inria.fr \
--cc=checker@d6.com \
--cc=gerd@gerd-stolpmann.de \
--cc=sumii@venus.is.s.u-tokyo.ac.jp \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).