caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
* Serialisation of PXP DTDs
@ 2008-10-22 20:11 Dario Teixeira
  2008-10-22 23:05 ` Sylvain Le Gall
  2008-10-23 14:55 ` [Caml-list] " Gerd Stolpmann
  0 siblings, 2 replies; 24+ messages in thread
From: Dario Teixeira @ 2008-10-22 20:11 UTC (permalink / raw)
  To: caml-list

Hi,

I am using PXP to parse the MathML2 DTD.  This is a fairly large DTD,
which even on a fast machine takes several seconds to parse.  I am
therefore looking at ways to serialise a parsed DTD, in a such a way
that it can be reused by other processes.

Does PXP already offer primitives for (un)serialising DTDs?  (I couldn't
find any).  Note that using Marshal is out of the question, because DTDs
are stored as objects, and we all know that objects cannot be serialised
across process boundaries.  But are there alternative solutions I'm
overlooking?

On a more general but related note, I think we should start an OSP
discussion about standardising serialisation methods.  The rationale
should be obvious.  Myself, I am partial to Sexplib, since it is
reasonably fast, very simple to use, human-readable, and future-proof.
I reckon that bin-prot could also be considered, as long as at some
point the binary format is "set in stone", or at least deserialisers
are always backwards compatible.  Any other opinions?

Thanks for your time!
Cheers,
Dario Teixeira






^ permalink raw reply	[flat|nested] 24+ messages in thread
* Re: [Caml-list] Re: Serialisation of PXP DTDs
@ 2008-10-23 18:41 Dario Teixeira
  2008-10-23 18:58 ` Markus Mottl
  0 siblings, 1 reply; 24+ messages in thread
From: Dario Teixeira @ 2008-10-23 18:41 UTC (permalink / raw)
  To: Markus Mottl; +Cc: caml-list

Hi,

> Bin-prot is settled in its design.  We heavily rely on it here
> at Jane Street and store TBs of data in it so there is no way
> it's going to change.  I would say it is future-proof.

Thanks for the clarification, Markus, and I will take a closer
look at bin-prot.  One question, however: is it possible to use
*both* the sexplib and bin-prot syntax extensions on the same
structure?  That way convenience for the developer is preserved,
and users can choose which side of the performance vs readability
trade-off they prefer.

Cheers,
Dario






^ permalink raw reply	[flat|nested] 24+ messages in thread

end of thread, other threads:[~2008-10-26 19:47 UTC | newest]

Thread overview: 24+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-10-22 20:11 Serialisation of PXP DTDs Dario Teixeira
2008-10-22 23:05 ` Sylvain Le Gall
2008-10-23 15:34   ` [Caml-list] " Dario Teixeira
2008-10-23 16:37     ` Stefano Zacchiroli
2008-10-23 16:53       ` Markus Mottl
2008-10-23 19:26       ` Dario Teixeira
2008-10-23 21:05         ` Mauricio Fernandez
2008-10-23 22:18           ` Gerd Stolpmann
2008-10-23 22:50             ` Mauricio Fernandez
2008-10-23 22:21           ` Dario Teixeira
2008-10-23 23:36             ` Mauricio Fernandez
2008-10-24  9:11               ` Mikkel Fahnøe Jørgensen
2008-10-24 14:03                 ` Markus Mottl
2008-10-25 18:58                   ` Mauricio Fernandez
2008-10-26 18:15                     ` Markus Mottl
2008-10-26 19:47                       ` Mauricio Fernandez
2008-10-24 21:39                 ` Mauricio Fernandez
2008-10-24 22:27                   ` Mikkel Fahnøe Jørgensen
2008-10-25 19:19                     ` Mauricio Fernandez
2008-10-23 16:46     ` Markus Mottl
2008-10-23 14:55 ` [Caml-list] " Gerd Stolpmann
2008-10-23 18:41 [Caml-list] " Dario Teixeira
2008-10-23 18:58 ` Markus Mottl
2008-10-23 20:04   ` Dario Teixeira

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).