caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
From: "Milan Stanojević" <milanst@gmail.com>
To: Martin Jambon <martin.jambon@ens-lyon.org>
Cc: Jeremy Yallop <yallop@gmail.com>, Caml List <caml-list@inria.fr>
Subject: Re: [Caml-list] segfault in simple program with 4.02 native
Date: Sat, 6 Sep 2014 02:00:40 -0400	[thread overview]
Message-ID: <CAKR7PS_brNtfaYmL+tSFCt=DnRX3jaQ=+5x-98pp=5JqXpKXtg@mail.gmail.com> (raw)
In-Reply-To: <540AA0DB.1040202@ens-lyon.org>

Could you do a dirty trick where you define a record that is the same
as the one you have now except that is has mutable fields, then you do
your parsing like now and then at the end return a record with
immutable fields (using (Obj.magic mutable : immutable)? You just need
to make sure that your mutable record doesn't escape your code.

On Sat, Sep 6, 2014 at 1:51 AM, Martin Jambon
<martin.jambon@ens-lyon.org> wrote:
> On Fri 05 Sep 2014 05:12:44 PM PDT, Jeremy Yallop wrote:
>>
>> On 6 September 2014 00:39, Martin Jambon <martin.jambon@ens-lyon.org>
>> wrote:
>>>
>>> That code is generated by atdgen. What happens is that we have to either
>>> create an empty record when starting to parse a list of unordered JSON
>>> fields, or use a bunch `let <field name> = ref None in` for each field
>>> and
>>> create the record in the end. While the latter approach is not much more
>>> work to implement, the resulting code was found to be significantly
>>> slower.
>>>
>>> The reason why it's using `Obj.magic 0.0` is that it worked in all cases
>>> (and has been for the past 4 years). Obtaining a well-formed constant
>>> value
>>> for any type is not trivial, so this what we have.
>>>
>>> It's very possible that it's now broken with OCaml 4.02. First try a
>>> 'make
>>> test' from atdgen's source directory (https://github.com/mjambon/atdgen)
>>> and
>>> see if it passes.
>>
>>
>> It does seem to be broken, and the change in behaviour with 4.0.2 is
>> apparently due to improved constant propagation
>> (http://caml.inria.fr/mantis/view.php?id=5779).
>>
>> The compiler now takes more advantage of immutability to improve the
>> memory usage and performance of programs.  It's safe (or ought to be
>> safe) to assume that immutable record fields are never updated, so the
>> values used to initialize the fields can be propagated to other parts
>> of the program.  Here's a small example that shows the change in
>> behaviour between 4.01 and 4.02.
>>
>>     type t = { s : string }
>>     let x = { s =  "one" }
>>     let () = Obj.(set_field (repr x) 0 (repr "two"))
>>     let () = print_endline x.s
>>
>> Using OCaml 4.01 the third line overwrites the field 's'  and the
>> fourth line reads the updated field and prints "two".  Using OCaml
>> 4.02 the initial value of the field is propagated past the write to
>> the code in the fourth line, so the program prints "one".
>>
>> The code currently generated by atdgen assumes that it's safe to treat
>> fields as if they were mutable -- that is, it assumes that it's safe
>> to initialize a field with a value of the wrong type, so long as the
>> value is overwritten before the field is first read.  I don't think
>> such tricks were ever explicitly guaranteed to work, but they're now
>> much more likely to fail, leading to the dummy initial value being
>> accessed at an inappropriate type.
>
>
> Thanks for the explanation, Jeremy. I guess atdgen will have to use "option
> refs" after all unless someone has a better idea.
>
> ATD definition:
>
> type t = {
>  ?field0: foo option;
>  ~field1: string;
>  field2: int;
> }
>
> Generated OCaml code:
>
> let field0 = ref None in
> let field1 = ref "" in
> let field2 = ref None in
> ...
> (* parse json fields coming in an unknown order *)
> ...
> {
>  field0 = !field0;
>  field1 = !field1;
>  field2 = (match !field2 with None -> error ... | Some x - >x);
>
> }
>
>
> --
> Caml-list mailing list.  Subscription management and archives:
> https://sympa.inria.fr/sympa/arc/caml-list
> Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
> Bug reports: http://caml.inria.fr/bin/caml-bugs

  reply	other threads:[~2014-09-06  6:01 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-09-05 21:33 Ashish Agarwal
2014-09-05 21:50 ` Andy Ray
2014-09-05 21:56 ` Richard W.M. Jones
2014-09-05 22:01   ` Sebastien Mondet
2014-09-05 22:06   ` Ashish Agarwal
2014-09-05 22:13     ` Richard W.M. Jones
2014-09-05 22:18       ` Richard W.M. Jones
2014-09-05 22:36         ` Török Edwin
2014-09-05 22:39         ` Martin Jambon
2014-09-05 23:39           ` Ashish Agarwal
2014-09-05 23:59             ` Martin Jambon
2014-09-06  0:12           ` Jeremy Yallop
2014-09-06  5:51             ` Martin Jambon
2014-09-06  6:00               ` Milan Stanojević [this message]
2014-09-06  7:46                 ` Frédéric Bour
2014-09-06 19:15                   ` Martin Jambon
2014-09-06 19:08                 ` Martin Jambon
2014-09-06 20:31                   ` David MENTRÉ
2014-09-06 21:57                     ` Martin Jambon
2014-09-07  7:34                       ` David MENTRÉ
2014-09-07 18:47               ` Alain Frisch
2014-09-08  1:28                 ` Martin Jambon
2014-09-13 10:26                   ` Martin Jambon
2014-09-14  7:41                     ` Martin Jambon
2014-09-05 22:18       ` Christoph Höger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAKR7PS_brNtfaYmL+tSFCt=DnRX3jaQ=+5x-98pp=5JqXpKXtg@mail.gmail.com' \
    --to=milanst@gmail.com \
    --cc=caml-list@inria.fr \
    --cc=martin.jambon@ens-lyon.org \
    --cc=yallop@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).