caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
From: Peter Zotov <whitequark@whitequark.org>
To: Malcolm Matalka <mmatalka@gmail.com>
Cc: caml-list <caml-list@inria.fr>, info@gerd-stolpmann.de
Subject: Re: [Caml-list] [ANN] ppx_protobuf
Date: Mon, 05 May 2014 02:21:09 +0400	[thread overview]
Message-ID: <fbb0c47eec050f267acabfa089db8862@whitequark.org> (raw)
In-Reply-To: <87bnvd374f.fsf@gmail.com>

On 2014-05-04 19:18, Malcolm Matalka wrote:
> In my fantasy scenario you could annotate the accessor functions in a
> module.

I have just found this article:
http://cedeela.fr/universal-serialization-and-deserialization.html

Hopefully it can give some inspiration on how to implement such
a serialization library.

> 
> Peter Zotov <whitequark@whitequark.org> writes:
> 
>> On 2014-05-04 08:49, Malcolm Matalka wrote:
>>> Not exactly. I don't mean I want a functor, I just used that style to
>>> express that I think it would be best if these sort of things worked 
>>> on
>>> a module-to-module level rather than type.  That way I can separate 
>>> out
>>> the data type and it's business logic from its encoding/decoding 
>>> logic.
>>> I want to decouple a type definition from all of the transformations
>>> that can be done on the type.  Everything an still happen at a
>>> preprocessor point, but I just want it to happen on a module level.
>> 
>> Still not a good idea. Consider the annotations like @key and 
>> @encoding:
>> where would you specify them? If right on the type signature, then 
>> what
>> is the point of separation?
>> 
>>> 
>>> 
>>> Peter Zotov <whitequark@whitequark.org> writes:
>>> 
>>>> On 2014-05-03 22:46, Malcolm Matalka wrote:
>>>>> The idea I mean is more to do this at the module level than the 
>>>>> type
>>>>> level, like a functor.  So rather than defining protobuf for a type
>>>>> definition, define it for a module, and have some convention for 
>>>>> how to
>>>>> pick out setter/getter functions.  Then create a new module from 
>>>>> that.
>>>> 
>>>> Oh! You want a functor which would be able to examine the structure
>>>> of the module that was passed to it.
>>>> 
>>>> It's probably technically feasible (you need a syntactic extension
>>>> which would essentially serialize the module that will be passed), 
>>>> but
>>>> it is a really horrible solution:
>>>> 
>>>>   * You won't be able to report some interesting errors (such as
>>>>     incorrect annotations... [@key -1] until runtime.
>>>>   * It will be really slow, because the implementation of the 
>>>> functor
>>>>     will have to traverse the lists of fields dynamically and invoke
>>>>     accessors one by one. My current implementation directly pattern
>>>>     matches the input.
>>>>   * It is just really complicated and does too much at runtime.
>>>> 
>>>>> 
>>>>> For example of the top of my head:
>>>>> 
>>>>> module Foo = sig
>>>>>    type t
>>>>>    val set_x : t -> int -> t
>>>>>    val get_x : t -> int
>>>>> end
>>>>> 
>>>>> Then I can do:
>>>>> 
>>>>> module Foo_protobuf = Protobuf.Make(Foo)
>>>>> 
>>>>> In this case I stole how most people to functors to make it clear 
>>>>> the
>>>>> translation is actually module to module.
>>>>> 
>>>>> The reason I prefer this is because I can also do:
>>>>> 
>>>>> module Foo_xml = Xml.Make(Foo)
>>>>> module Foo_json = Json.Make(Foo)
>>>>> 
>>>>> By separating the mechanism for creating the decoders from the type
>>>>> definition, I can add decoders for any type I want without 
>>>>> disturbing
>>>>> the original definition.  This feels more right to me.  But I have 
>>>>> no
>>>>> idea how to do it.
>>>>> 
>>>>> 
>>>>> Peter Zotov <whitequark@whitequark.org> writes:
>>>>> 
>>>>>> On 2014-05-03 20:08, Malcolm Matalka wrote:
>>>>>>> Nice, great work!
>>>>>>> 
>>>>>>> I'm not actually a huge fan of mixing type definitions and the 
>>>>>>> protocols
>>>>>>> they can be encoded/decoded from.  How hard would it be to take a 
>>>>>>> module
>>>>>>> definition accessors on a type and produce a new module with
>>>>>>> encode/decode functions?  That way I could create JSON, XML, 
>>>>>>> Protobufs,
>>>>>>> etc modules from one module.
>>>>>> 
>>>>>> Do you suggest generating the following signature instead of the 
>>>>>> current
>>>>>> one?
>>>>>> 
>>>>>> type t = ... [@@protobuf]
>>>>>> module Protobuf_t : sig
>>>>>>   val decode : Protobuf.Decoder.t -> t
>>>>>>   val encode : Protobuf.Encoder.t -> t -> unit
>>>>>> end
>>>>>> 
>>>>>> This would be similar to what deriving currently does.
>>>>>> 
>>>>>> In principle, this is not a complex change. It would add just a 
>>>>>> few lines
>>>>>> to ppx_protobuf.
>>>>>> 
>>>>>> However, I don't like it conceptually. I think the flat signature 
>>>>>> is
>>>>>> more natural, it mimics what one would usually write by hand 
>>>>>> without
>>>>>> introducing too much deep nesting of modules. You may notice how
>>>>>> ppx_protobuf doesn't generate the signature items for you; this is
>>>>>> because ppx_protobuf is a mere implementation detail, a convenient
>>>>>> way to generate the serializer/deserializer.
>>>>>> 
>>>>>> I'm not going to oppose addition of such a mode for two reasons:
>>>>>>   * I don't like fighting over minute details.
>>>>>>   * More importantly, deriving, when rewritten with ppx in mind,
>>>>>>     will surely contain this mode for compatibility. ppx_protobuf
>>>>>>     will be (ideally) rewritten over deriving some day.
>>>>>> 
>>>>>> I will happily merge a PR adding such a mode to ppx_protobuf.
>>>>>> 
>>>>>>> 
>>>>>>> Just an idea!
>>>>>>> 
>>>>>>> Peter Zotov <whitequark@whitequark.org> writes:
>>>>>>> 
>>>>>>>> Greetings.
>>>>>>>> 
>>>>>>>> I have just released the first version of ppx_protobuf, a 
>>>>>>>> complete
>>>>>>>> Protocol Buffers implementation. Unlike Google's implementation,
>>>>>>>> ppx_protobuf derives the message structure directly from OCaml 
>>>>>>>> type
>>>>>>>> definitions, which allows a much more seamless integration with
>>>>>>>> OCaml's types. In particular, ppx_protobuf natively supports
>>>>>>>> sum types, while maintaining full backwards compatibility with
>>>>>>>> protoc.
>>>>>>>> 
>>>>>>>> ppx_protobuf uses the extension points API, and thus requires
>>>>>>>> a recent (>= 2014-04-29) 4.02 (trunk) compiler. It also requires
>>>>>>>> an unreleased version of ppx_tools. It is probably easiest
>>>>>>>> to install both from the source repositories[1][2].
>>>>>>>> 
>>>>>>>> The API is extensively documented at [3].
>>>>>>>> 
>>>>>>>> [1]: https://github.com/whitequark/ocaml-ppx_protobuf.git
>>>>>>>> [2]: https://github.com/alainfrisch/ppx_tools.git
>>>>>>>> [3]: 
>>>>>>>> https://github.com/whitequark/ocaml-ppx_protobuf/blob/master/README.md
>>>>>>>> 
>>>>>>>> --
>>>>>>>>   WBR, Peter Zotov.

-- 
Peter Zotov
sip:whitequark@sipnet.ru

  reply	other threads:[~2014-05-04 22:21 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-05-02 14:29 Peter Zotov
2014-05-03 16:08 ` Malcolm Matalka
2014-05-03 16:24   ` Peter Zotov
2014-05-03 18:46     ` Malcolm Matalka
2014-05-03 18:52       ` Peter Zotov
2014-05-04  4:49         ` Malcolm Matalka
2014-05-04  8:55           ` Peter Zotov
2014-05-04 15:18             ` Malcolm Matalka
2014-05-04 22:21               ` Peter Zotov [this message]
2014-05-04 22:38                 ` Daniel Bünzli
2014-05-04 20:34             ` Gerd Stolpmann
2014-05-06  4:29 ` Alain Frisch
2014-05-06  4:59   ` Peter Zotov
2014-05-06  7:33     ` Alain Frisch
2014-05-06 10:42   ` Malcolm Matalka

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=fbb0c47eec050f267acabfa089db8862@whitequark.org \
    --to=whitequark@whitequark.org \
    --cc=caml-list@inria.fr \
    --cc=info@gerd-stolpmann.de \
    --cc=mmatalka@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).