caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
From: Jacques Garrigue <garrigue@math.nagoya-u.ac.jp>
To: caml-list@inria.fr
Subject: Re: [Caml-list] mmap() and strings
Date: Thu, 09 Dec 2004 10:09:01 +0900 (JST)	[thread overview]
Message-ID: <20041209.100901.75479815.garrigue@math.nagoya-u.ac.jp> (raw)
In-Reply-To: <20041208205042.GD1840@ens-lyon.fr>

From: Julien Cristau <julien.cristau@ens-lyon.fr>
> I wrote:
> > > We thought we could use mmap(2), 
> > > but there seems to be no easy solution 
> > > to mmap() a memory region and treat it as a string in ocaml. 
> > 
> On 08/12/2004-21:18, Basile STARYNKEVITCH wrote:
> > Use Bigarray-s for that. They can mmap files (on Unix & Linux) and are
> > already in Ocaml 3.08
> > 
> Actually, i had a look at bigarrays, and it's one of the solutions I 
> considered. However, I'd like to keep strings as data structure, because 
> the operations I have to perform take a string as an argument, and not a 
> (char, Bigarray.int8_unsigned_elt, Bigarray.c_layout) Bigarray.Array1.t, 
> and it would be a pain to change all these functions (if I change them, 
> I'll probably bind mmap() and munmap() directly and call them with 
> MAP_ANONYMOUS, but I'd rather not do that).

I don't know exactly your goal, but if it is just that you don't want
to write a single line of C (and all the boilerplate), then you can
always do some magic (note that this is going to be very dark magic!)

The main problem is way string length is represented.
What you have to do is create a pseudo block header inside a bigarray.
The simplest way is to first create a string of the right size, and
then copy it byte by byte to the bigarray, starting with index (-4)
(for a 32-bit machine) and ending at ((len/4+1)*4) (the last by of the
last word of the string encodes part of the length), using
String.unsafe_get or String.unsafe_blit (more subtle).
Then you want to get a pointer at offset 4 in the string.
Not too hard either:
    (Obj.magic
       (!(snd (Obj.magic biga : Obj.t * int ref)) + 2)
       : string)

Now this has lots of dependencies on the behavior of the compiler and
how bigarrays are represented, but I believe this should work.
Not however that if you have problems with that, debugging can become
hairy, in this completely unsafe world.

Cheers,

Jacques Garrigue


  reply	other threads:[~2004-12-09  1:09 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-12-08 20:04 Julien Cristau
2004-12-08 20:24 ` [Caml-list] " Basile STARYNKEVITCH
2004-12-08 20:50   ` Julien Cristau
2004-12-09  1:09     ` Jacques Garrigue [this message]
2004-12-09  1:42       ` Jacques Garrigue
2004-12-09 10:32         ` David Baelde

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20041209.100901.75479815.garrigue@math.nagoya-u.ac.jp \
    --to=garrigue@math.nagoya-u.ac.jp \
    --cc=caml-list@inria.fr \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).