caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
* Re: [Caml-list] Re: Serialisation of PXP DTDs
@ 2008-10-23 18:41 Dario Teixeira
  2008-10-23 18:58 ` Markus Mottl
  0 siblings, 1 reply; 21+ messages in thread
From: Dario Teixeira @ 2008-10-23 18:41 UTC (permalink / raw)
  To: Markus Mottl; +Cc: caml-list

Hi,

> Bin-prot is settled in its design.  We heavily rely on it here
> at Jane Street and store TBs of data in it so there is no way
> it's going to change.  I would say it is future-proof.

Thanks for the clarification, Markus, and I will take a closer
look at bin-prot.  One question, however: is it possible to use
*both* the sexplib and bin-prot syntax extensions on the same
structure?  That way convenience for the developer is preserved,
and users can choose which side of the performance vs readability
trade-off they prefer.

Cheers,
Dario






^ permalink raw reply	[flat|nested] 21+ messages in thread
* Re: Serialisation of PXP DTDs
@ 2008-10-22 23:05 Sylvain Le Gall
  2008-10-23 15:34 ` [Caml-list] " Dario Teixeira
  0 siblings, 1 reply; 21+ messages in thread
From: Sylvain Le Gall @ 2008-10-22 23:05 UTC (permalink / raw)
  To: caml-list

On 22-10-2008, Dario Teixeira <darioteixeira@yahoo.com> wrote:
> Hi,
>
> I am using PXP to parse the MathML2 DTD.  This is a fairly large DTD,
> which even on a fast machine takes several seconds to parse.  I am
> therefore looking at ways to serialise a parsed DTD, in a such a way
> that it can be reused by other processes.
>
> Does PXP already offer primitives for (un)serialising DTDs?  (I couldn't
> find any).  Note that using Marshal is out of the question, because DTDs
> are stored as objects, and we all know that objects cannot be serialised
> across process boundaries.  But are there alternative solutions I'm
> overlooking?
>
> On a more general but related note, I think we should start an OSP
> discussion about standardising serialisation methods.  The rationale
> should be obvious.  Myself, I am partial to Sexplib, since it is
> reasonably fast, very simple to use, human-readable, and future-proof.
> I reckon that bin-prot could also be considered, as long as at some
> point the binary format is "set in stone", or at least deserialisers
> are always backwards compatible.  Any other opinions?
>

You seem to have already some ideas. The best, before doing any
discussion on this topic is to try to implement/benchmark the different
solution (at least doing something partial).

Sexplib/bin-prot/json/marshal need to be compared on a real example. 

You seems to need this for a particular task. Could you try to implement
on your particular example the different approach and give us some
benchmark/ease of use/ease of implement level ?

Without this number, I think an OSP discussion is pointless.

(but with this number at least on a small example, if your use case is
not easy, I think an OSP discussion will be very interesting).

Regards,
Sylvain Le Gall


^ permalink raw reply	[flat|nested] 21+ messages in thread

end of thread, other threads:[~2008-10-26 19:47 UTC | newest]

Thread overview: 21+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-10-23 18:41 [Caml-list] Re: Serialisation of PXP DTDs Dario Teixeira
2008-10-23 18:58 ` Markus Mottl
2008-10-23 20:04   ` Dario Teixeira
  -- strict thread matches above, loose matches on Subject: below --
2008-10-22 23:05 Sylvain Le Gall
2008-10-23 15:34 ` [Caml-list] " Dario Teixeira
2008-10-23 16:37   ` Stefano Zacchiroli
2008-10-23 16:53     ` Markus Mottl
2008-10-23 19:26     ` Dario Teixeira
2008-10-23 21:05       ` Mauricio Fernandez
2008-10-23 22:18         ` Gerd Stolpmann
2008-10-23 22:50           ` Mauricio Fernandez
2008-10-23 22:21         ` Dario Teixeira
2008-10-23 23:36           ` Mauricio Fernandez
2008-10-24  9:11             ` Mikkel Fahnøe Jørgensen
2008-10-24 14:03               ` Markus Mottl
2008-10-25 18:58                 ` Mauricio Fernandez
2008-10-26 18:15                   ` Markus Mottl
2008-10-26 19:47                     ` Mauricio Fernandez
2008-10-24 21:39               ` Mauricio Fernandez
2008-10-24 22:27                 ` Mikkel Fahnøe Jørgensen
2008-10-25 19:19                   ` Mauricio Fernandez
2008-10-23 16:46   ` Markus Mottl

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).