In the case of 2), that's interesting because such a type of Some None is sort of antithetical to describing anything sensical. Not that it's not pragmatic or doesn't occur-I'm sure some functions get combined in ways that stuff like this crops up. But I think of the typing system as being badly exercised if something like this arises-but I really see your point in terms of implementation. Thanks for writing this.

I'm sure that ctypes would likely be able to answer exactly this problem because it boils down to C. But it was a general question about extensions to the typing system.

On Thu, Aug 20, 2015 at 5:10 AM, Goswin von Brederlow <goswin-v-b@web.de> wrote:
On Tue, Aug 18, 2015 at 09:44:04PM +0200, Gabriel Scherer wrote:
> We've discussed optimisations of ('a option) in the past. Certainly
> some things could be done, but it's unclear to me how much value there is in
> optimizing ('a option) specifically: what if, for example, we later
> understand that ('a, exn) result is the more general abstraction that
> we should have used instead, and rely on it heavily in libraries, will
> we de-optimize options and work on optimizing results?
>
> Note that your idea of "either a failure of a value" can be achieved,
> in some monomorphic cases (specifically when you know 'a and it has a
> product structure) by using a specific type declaration:
>
>   type my_struct =
>     | None
>     | Some of int * int array * string
>
> This will be represented as efficiently as the tuple (int * int array
> * string), yet it has a default case (or two, or another case with
> exceptions, whatever -- this is more flexible than just options). With
> inline records in 4.03 -- not yet released -- you will even be able to
> have some of the product structure mutable:
>
>   type my_struct =
>     | None
>     | Some of { mutable count : int; values : int array; name : string }

Except here you are talking about an already boxed value. The Some
part is encoded in the tag of the box you already have.

In the case of a pointer you have not box to put the Some tag on.

> On Tue, Aug 18, 2015 at 9:01 PM, Kenneth Adam Miller
> <kennethadammiller@gmail.com> wrote:
> > Well, it's not restricted to pointers - In general I would think that the
> > type annotation for Some | None would be left alone. I just used pointer as
> > an example because pointers exclude a value, 0x0, from the valid set. In
> > which case None is encoded as 0x0.
> >
> > Thanks for the bit about polymorphism in the context of what a compiler
> > would see - clients that do not see the hypothetical additional annotation
> > for that specific type to allow a format wouldn't have the augmented
> > operational needs to work on such an instance correctly. Got it!

There are 2 problems:

1) polymorphism

A function taking an 'a option could get 0, which it would easily
detect as 0, a pointer to some value or a pointer to box taged Some.
But there is no way to separate the last two cases. The pointer
doesn't even have to point to an ocaml value so dereferencing and
checking if it is a box with Some tag is not an option. And even if
you could any variant type with a boxed constructor will have a tag of
0, same as Some. Having different representation for pointer option
and other option doesn't work.

2) 'a option option

For 'a option None becomes 0x0 and Some pointer becomes pointer. But
what about 'a option option. Do you repeat the process? Then None and
Some None would both become 0x0 and can't be separated. And if you
don't repeat the process you break polymorphism (see 1).


What you can do is create a new type that behaves like

    type 'a ptr_option = NULL | 'a

but the type would have to be abstract and you would have to have a
function to convert it into an 'a option for pattern matching like

    match to_option ptr_opt with
    | None -> ...
    | Some ptr -> ...

and hope the optimizer eliminates the allocation and boxing.

Doesn't ocaml-ctypes already have such a type?

> > On Tue, Aug 18, 2015 at 2:57 PM, Hendrik Boom <hendrik@topoi.pooq.com>
> > wrote:
> >>
> >> On Tue, Aug 18, 2015 at 01:06:55PM -0400, Kenneth Adam Miller wrote:
> >> > I was wondering if cases where format control is possible in typing
> >> > constructs can allow things like restricting the implementation size
> >> > after
> >> > compilation of a specific variant type. Say, for instance that I wanted
> >> > to
> >> > have a malloc implementation instead of returning a Some 'a | None type
> >> > that compiles down to a boxed case of first a word and then the
> >> > subsequent
> >> > 'a instance, down to the 'a instance, where in the values of the word
> >> > enum
> >> > (or tag) are not present in the possibilities of the 'a instance.
> >> >
> >> > Maybe it sounds silly, but in really tight loops you want to squeeze for
> >> > efficiency. So I was wondering if maybe the same actual code be used
> >> > with
> >> > the same sanity of type checking, but some annotation provided at the
> >> > type
> >> > declaration to allow such optimization to take place.
> >>
> >> Let's see.  OCaml steals a bit to indicate whether a valus is a pointer
> >> or not, right?  Could that bit see duual usage for the option type?  So
> >> that if it's an optional pointer type, the bit is left off, and if it's
> >> an optional nonpointer type, it's turned on (and set to point to
> >> location zero, which the GC couls check for)?
> >>
> >> THe proble I see with this is if the 'a is passed to a generic function
> >> where iti isn't statically known where it's a pinter or not.  The
> >> conpiler will not know whether to test for absence or presence of the
> >> bit.
> >>
> >> -- hendrik

MfG
        Goswin

--
Caml-list mailing list.  Subscription management and archives:
https://sympa.inria.fr/sympa/arc/caml-list
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs