From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <rathereasy@gmail.com>
X-Original-To: caml-list@sympa.inria.fr
Delivered-To: caml-list@sympa.inria.fr
Received: from mail3-relais-sop.national.inria.fr (mail3-relais-sop.national.inria.fr [192.134.164.104])
	by sympa.inria.fr (Postfix) with ESMTPS id 369667EE51
	for <caml-list@sympa.inria.fr>; Mon, 29 Apr 2013 07:17:04 +0200 (CEST)
Received-SPF: None (mail3-smtp-sop.national.inria.fr: no sender
  authenticity information available from domain of
  rathereasy@gmail.com) identity=pra; client-ip=209.85.128.169;
  receiver=mail3-smtp-sop.national.inria.fr;
  envelope-from="rathereasy@gmail.com";
  x-sender="rathereasy@gmail.com";
  x-conformance=sidf_compatible
Received-SPF: Pass (mail3-smtp-sop.national.inria.fr: domain of
  rathereasy@gmail.com designates 209.85.128.169 as permitted
  sender) identity=mailfrom; client-ip=209.85.128.169;
  receiver=mail3-smtp-sop.national.inria.fr;
  envelope-from="rathereasy@gmail.com";
  x-sender="rathereasy@gmail.com";
  x-conformance=sidf_compatible; x-record-type="v=spf1"
Received-SPF: None (mail3-smtp-sop.national.inria.fr: no sender
  authenticity information available from domain of
  postmaster@mail-ve0-f169.google.com) identity=helo;
  client-ip=209.85.128.169;
  receiver=mail3-smtp-sop.national.inria.fr;
  envelope-from="rathereasy@gmail.com";
  x-sender="postmaster@mail-ve0-f169.google.com";
  x-conformance=sidf_compatible
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: At4BAPsAflHRVYCpm2dsb2JhbAA4Gg6DL6wuiWWINXcIFg4BAQEBAQYLCwkUKIIfAQEEAScZARsSCwEDAQsGBQsaISIBEQEFAQoSBhMSCYdoAQMJBgwvoCqMMYJ9g2IKGScDClmHXgEFDI1WgTQEB4NPA4kSjgyBJo4fFimDeF8cgS4
X-IPAS-Result: At4BAPsAflHRVYCpm2dsb2JhbAA4Gg6DL6wuiWWINXcIFg4BAQEBAQYLCwkUKIIfAQEEAScZARsSCwEDAQsGBQsaISIBEQEFAQoSBhMSCYdoAQMJBgwvoCqMMYJ9g2IKGScDClmHXgEFDI1WgTQEB4NPA4kSjgyBJo4fFimDeF8cgS4
X-IronPort-AV: E=Sophos;i="4.87,571,1363129200"; 
   d="scan'208";a="12533644"
Received: from mail-ve0-f169.google.com ([209.85.128.169])
  by mail3-smtp-sop.national.inria.fr with ESMTP/TLS/RC4-SHA; 29 Apr 2013 07:17:03 +0200
Received: by mail-ve0-f169.google.com with SMTP id pa12so1307522veb.28
        for <caml-list@inria.fr>; Sun, 28 Apr 2013 22:17:01 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20120113;
        h=mime-version:x-received:in-reply-to:references:date:message-id
         :subject:from:to:cc:content-type;
        bh=QOqsyDTD/1NV4LHVC07xeLalLzcpeiCqg20le/w42kE=;
        b=BMMgRK6lmQ66eIuROsCd2De8j1h3utTqLXL0K9vUyoLOqfOSOBAvlFwJB3LI9XGUtv
         1l+vwpJU2+WmiJU0vQkBhkh8U5CNN/POnP9Ib/V04fhsEpu0SDNou0ZBC4Jq0+WyjNsS
         bfp+tknWreh5BYO2ksLCJ44eBe8rZK/q65xmhaZDLBsxMs54qwJzdleEhwCUOD6HVw7Y
         RdUII+/FfnP/pH5vd4pdPRr78VEL7/f2bP0DVDjY52yLBgVkocE+9W/FVc15OtBKvRby
         NskeNEv/kfHAEtbKfDuv+NpW3tvtMuXVitoNxJqx20TWMO/cA0HNKs8MuxS/xYWzywNU
         N9ng==
MIME-Version: 1.0
X-Received: by 10.52.89.229 with SMTP id br5mr1193494vdb.24.1367212621852;
 Sun, 28 Apr 2013 22:17:01 -0700 (PDT)
Received: by 10.220.200.66 with HTTP; Sun, 28 Apr 2013 22:17:01 -0700 (PDT)
In-Reply-To: <00C57DF0-C6F0-4EDE-8607-2155F3A17146@math.nagoya-u.ac.jp>
References: <00C57DF0-C6F0-4EDE-8607-2155F3A17146@math.nagoya-u.ac.jp>
Date: Sun, 28 Apr 2013 22:17:01 -0700
Message-ID: <CAK0y-36uPAAXGanrhZKQOQAPsELjTFxM1Av820da-mU4kqTVfg@mail.gmail.com>
From: Jacques Le Normand <rathereasy@gmail.com>
To: Jacques Garrigue <garrigue@math.nagoya-u.ac.jp>
Cc: OCaML List Mailing <caml-list@inria.fr>
Content-Type: multipart/alternative; boundary=20cf307f3762a8a23604db78fe95
Subject: Re: [Caml-list] Request for feedback: A problem with injectivity
 and GADTs


--20cf307f3762a8a23604db78fe95
Content-Type: text/plain; charset=ISO-8859-1

Fully injective types are the norm. It would have been nice to have a "non
injectivity" or "phantom type" annotation. Since phantom types are seldom
used, beginners won't get confused. It might even help beginners understand
the concept of a phantom type.


On Sat, Apr 27, 2013 at 5:02 PM, Jacques Garrigue <
garrigue@math.nagoya-u.ac.jp> wrote:

> Some of you may already be aware of PR#5985
> http://caml.inria.fr/mantis/view.php?id=5985
>
> To explain very shortly what is happening, a bug was found in the ocaml
> compiler, both 4.00 and the development version, such that one that
> convert a value from any type to any other type, bypassing all checks.
> This is what we call a soundness bug, a hole in the type system.
>
> Such problems happen at times, and we are usually very fast to fix them.
> However this time the problem is bit more subtle, because fixing it
> straightforwardly may make impossible to write a whole category of
> GADTs. Hence this request for feedback.
>
> Practical description of the problem:
>
> It is quite usual to write type witnesses using a GADT:
>
> type _ ty =
>   | Int : int ty
>   | Pair : 'a ty * 'b ty -> ('a * 'b) ty
>   | Hashtbl : 'a ty * 'b ty -> ('a, 'b) Hashtbl.t ty
>
> This looks fine, but now suppose that we write the following code:
>
> module F (X : sig type 'a t end) = struct
>    type _ ty =
>     | Int : int ty
>     | Pair : 'a ty * 'b ty -> ('a * 'b) ty
>     | T : 'a ty -> 'a X.t ty
> end
>
> The idea is to create a type witness with the
> type constructor X.t instead of Hashtbl.t.
> Now I can use it this way:
>
> module U = struct type 'a t = unit end
> module M = F(U)
> let w = M.T M.Int
> (* val w : int U.t M.ty *)
> let w' = match w with M.T x -> x | _ -> assert false
> (* val w' : '_a M.ty = M.Int *)
>
> What this means is that we can now give an arbitrary type
> to a witness for int. You can easily use it to build a function
> from int to any  type:
>
> let f : type a. a M.ty -> int -> a =
>   fun w x -> match w with M.Int -> x | _ -> assert false;;
> let g : int -> float = f w';;
>
> If we look at the source of the problem, it lies in the fact U.t is not
> injective.
> I.e. when we define a GADT, we assume that all the type variables in
> the return type can be determined from the type parameter.
> That is, from (int U.t M.ty) we assume that the type of the argument of T
> is (int M.ty).
> However, since U.t is not injective, int U.t is actually equal to any type
> 'a U.t,
> and we can convert.
> Note that, for known types, injectivity was already checked.
> I.e., the following definition is not accepted:
>
> type _ ty =
>   | Int : int ty
>   | Pair : 'a ty * 'b ty -> ('a * 'b) ty
>   | T : 'a ty -> 'a U.t ty
>
> However, abstract types were assumed to be injective.
> Here we see that you cannot take this for granted.
>
> The problem is of course not limited to type witnesses, and not even GADTs:
> types with constrained parameters (introduced at the very beginning of
> ocaml),
> can also trigger it.
> And you don't need a functor to trigger it: once you know that two types
> are equal
> in some scope, there are many ways to leak that information.
>
> =============================
> Here comes the request for feedback:
>
> The fix is simple enough: we should track injectivity, and assume that
> abstract
> types are not injective by default.
> However, this also means that the first type I defined above (using
> Hashtbl.t)
> will be refused.
>
> How many of you are using GADTs in this way, and how dependent are you on
> abstract types ?
>
>
> If we need to be able to define GADTs relying on the injectivity of some
> abstract
> types, a straightforward solution is to add injectivity annotations on
> type parameters,
> the same way it was done for variance:
>
>   type #'a t
>
> would mean that t is injective.
> This is implemented in branches/non-vanishing, in the subversion
> repository.
>         svn checkout http://caml.inria.fr/svn/branches/non-vanishing
> Note that in that branch Hashtbl.t in the standard library is marked as
> injective.
> This introduces some new syntax, and libraries have to be modified to
> benefit,
> so the question is do we really need it?
>
>
> Jacques Garrigue
>
> P.S.
> If you are curious about other difficulties of GADTs, the is also
> branches/abstract-new
> which introduces a notion of new types, either concrete or abstract, which
> are guaranteed
> to be distinct from each other, and any other concrete type. This is
> useful when
> checking the exhaustiveness of pattern-matching, as we can then discard
> impossible
> branches. That branch is completely experimental.
>
>
> --
> Caml-list mailing list.  Subscription management and archives:
> https://sympa.inria.fr/sympa/arc/caml-list
> Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
> Bug reports: http://caml.inria.fr/bin/caml-bugs

--20cf307f3762a8a23604db78fe95
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Fully injective types are the norm. It would have been nic=
e to have a &quot;non injectivity&quot; or &quot;phantom type&quot; annotat=
ion. Since phantom types are seldom used, beginners won&#39;t get confused.=
 It might even help beginners understand the concept of a phantom type.<br>
<div style><br></div></div><div class=3D"gmail_extra"><br><br><div class=3D=
"gmail_quote">On Sat, Apr 27, 2013 at 5:02 PM, Jacques Garrigue <span dir=
=3D"ltr">&lt;<a href=3D"mailto:garrigue@math.nagoya-u.ac.jp" target=3D"_bla=
nk">garrigue@math.nagoya-u.ac.jp</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Some of you may already be aware of PR#5985<=
br>
<a href=3D"http://caml.inria.fr/mantis/view.php?id=3D5985" target=3D"_blank=
">http://caml.inria.fr/mantis/view.php?id=3D5985</a><br>
<br>
To explain very shortly what is happening, a bug was found in the ocaml<br>
compiler, both 4.00 and the development version, such that one that<br>
convert a value from any type to any other type, bypassing all checks.<br>
This is what we call a soundness bug, a hole in the type system.<br>
<br>
Such problems happen at times, and we are usually very fast to fix them.<br>
However this time the problem is bit more subtle, because fixing it<br>
straightforwardly may make impossible to write a whole category of<br>
GADTs. Hence this request for feedback.<br>
<br>
Practical description of the problem:<br>
<br>
It is quite usual to write type witnesses using a GADT:<br>
<br>
type _ ty =3D<br>
=A0 | Int : int ty<br>
=A0 | Pair : &#39;a ty * &#39;b ty -&gt; (&#39;a * &#39;b) ty<br>
=A0 | Hashtbl : &#39;a ty * &#39;b ty -&gt; (&#39;a, &#39;b) Hashtbl.t ty<b=
r>
<br>
This looks fine, but now suppose that we write the following code:<br>
<br>
module F (X : sig type &#39;a t end) =3D struct<br>
=A0 =A0type _ ty =3D<br>
=A0 =A0 | Int : int ty<br>
=A0 =A0 | Pair : &#39;a ty * &#39;b ty -&gt; (&#39;a * &#39;b) ty<br>
=A0 =A0 | T : &#39;a ty -&gt; &#39;a X.t ty<br>
end<br>
<br>
The idea is to create a type witness with the<br>
type constructor X.t instead of Hashtbl.t.<br>
Now I can use it this way:<br>
<br>
module U =3D struct type &#39;a t =3D unit end<br>
module M =3D F(U)<br>
let w =3D M.T M.Int<br>
(* val w : int U.t M.ty *)<br>
let w&#39; =3D match w with M.T x -&gt; x | _ -&gt; assert false<br>
(* val w&#39; : &#39;_a M.ty =3D M.Int *)<br>
<br>
What this means is that we can now give an arbitrary type<br>
to a witness for int. You can easily use it to build a function<br>
from int to any =A0type:<br>
<br>
let f : type a. a M.ty -&gt; int -&gt; a =3D<br>
=A0 fun w x -&gt; match w with M.Int -&gt; x | _ -&gt; assert false;;<br>
let g : int -&gt; float =3D f w&#39;;;<br>
<br>
If we look at the source of the problem, it lies in the fact U.t is not inj=
ective.<br>
I.e. when we define a GADT, we assume that all the type variables in<br>
the return type can be determined from the type parameter.<br>
That is, from (int U.t M.ty) we assume that the type of the argument of T i=
s (int M.ty).<br>
However, since U.t is not injective, int U.t is actually equal to any type =
&#39;a U.t,<br>
and we can convert.<br>
Note that, for known types, injectivity was already checked.<br>
I.e., the following definition is not accepted:<br>
<br>
type _ ty =3D<br>
=A0 | Int : int ty<br>
=A0 | Pair : &#39;a ty * &#39;b ty -&gt; (&#39;a * &#39;b) ty<br>
=A0 | T : &#39;a ty -&gt; &#39;a U.t ty<br>
<br>
However, abstract types were assumed to be injective.<br>
Here we see that you cannot take this for granted.<br>
<br>
The problem is of course not limited to type witnesses, and not even GADTs:=
<br>
types with constrained parameters (introduced at the very beginning of ocam=
l),<br>
can also trigger it.<br>
And you don&#39;t need a functor to trigger it: once you know that two type=
s are equal<br>
in some scope, there are many ways to leak that information.<br>
<br>
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D<br>
Here comes the request for feedback:<br>
<br>
The fix is simple enough: we should track injectivity, and assume that abst=
ract<br>
types are not injective by default.<br>
However, this also means that the first type I defined above (using Hashtbl=
.t)<br>
will be refused.<br>
<br>
How many of you are using GADTs in this way, and how dependent are you on<b=
r>
abstract types ?<br>
<br>
<br>
If we need to be able to define GADTs relying on the injectivity of some ab=
stract<br>
types, a straightforward solution is to add injectivity annotations on type=
 parameters,<br>
the same way it was done for variance:<br>
<br>
=A0 type #&#39;a t<br>
<br>
would mean that t is injective.<br>
This is implemented in branches/non-vanishing, in the subversion repository=
.<br>
=A0 =A0 =A0 =A0 svn checkout <a href=3D"http://caml.inria.fr/svn/branches/n=
on-vanishing" target=3D"_blank">http://caml.inria.fr/svn/branches/non-vanis=
hing</a><br>
Note that in that branch Hashtbl.t in the standard library is marked as inj=
ective.<br>
This introduces some new syntax, and libraries have to be modified to benef=
it,<br>
so the question is do we really need it?<br>
<br>
<br>
Jacques Garrigue<br>
<br>
P.S.<br>
If you are curious about other difficulties of GADTs, the is also branches/=
abstract-new<br>
which introduces a notion of new types, either concrete or abstract, which =
are guaranteed<br>
to be distinct from each other, and any other concrete type. This is useful=
 when<br>
checking the exhaustiveness of pattern-matching, as we can then discard imp=
ossible<br>
branches. That branch is completely experimental.<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
<br>
--<br>
Caml-list mailing list. =A0Subscription management and archives:<br>
<a href=3D"https://sympa.inria.fr/sympa/arc/caml-list" target=3D"_blank">ht=
tps://sympa.inria.fr/sympa/arc/caml-list</a><br>
Beginner&#39;s list: <a href=3D"http://groups.yahoo.com/group/ocaml_beginne=
rs" target=3D"_blank">http://groups.yahoo.com/group/ocaml_beginners</a><br>
Bug reports: <a href=3D"http://caml.inria.fr/bin/caml-bugs" target=3D"_blan=
k">http://caml.inria.fr/bin/caml-bugs</a></font></span></blockquote></div><=
br></div>

--20cf307f3762a8a23604db78fe95--