caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
* [Caml-list] Efficient scanning of large strings from files
@ 2012-03-16 13:03 Philippe Veber
  2012-03-16 14:14 ` Edgar Friendly
                   ` (3 more replies)
  0 siblings, 4 replies; 15+ messages in thread
From: Philippe Veber @ 2012-03-16 13:03 UTC (permalink / raw)
  To: caml users

[-- Attachment #1: Type: text/plain, Size: 608 bytes --]

Dear camlers,

Say that you'd like to search a regexp on a file with lines so long that
you'd rather not load them entirely at once. If you can bound the size of a
match by k << length of a line, then you know that you can only keep a
small portion of the line in memory to search the regexp. Typically you'd
like to access substrings of size k from left to right. I guess such a
thing should involve buffered inputs and avoid copying strings as much as
possible. My question is as follows: has anybody written a library to
access these substrings gracefully and with decent performance?
Cheers,
  Philippe.

[-- Attachment #2: Type: text/html, Size: 651 bytes --]

^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2012-03-21  7:22 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-03-16 13:03 [Caml-list] Efficient scanning of large strings from files Philippe Veber
2012-03-16 14:14 ` Edgar Friendly
2012-03-16 14:48   ` Philippe Veber
2012-03-16 17:02     ` Edgar Friendly
2012-03-19  9:08       ` Philippe Veber
2012-03-19 13:44         ` Edgar Friendly
2012-03-21  7:21           ` Philippe Veber
2012-03-16 17:23   ` Francois????Charles Matthieu????Berenger
2012-03-17 16:53     ` oliver
2012-03-19  9:08     ` Philippe Veber
2012-03-16 14:49 ` Jérémie Dimino
2012-03-18 21:11   ` Török Edwin
2012-03-19  9:11     ` Philippe Veber
2012-03-16 20:11 ` oliver
2012-03-18 23:56 ` oliver

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).