From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail3-relais-sop.national.inria.fr (mail3-relais-sop.national.inria.fr [192.134.164.104]) by yquem.inria.fr (Postfix) with ESMTP id 7D03EBC57 for ; Mon, 12 Apr 2010 16:36:13 +0200 (CEST) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AsoBACfNwkuCcYARmWdsb2JhbACPX4toAQECAQgLCgcRIrtHhQwEg0I X-IronPort-AV: E=Sophos;i="4.52,190,1270418400"; d="scan'208";a="48356556" Received: from coriana6.cis.mcmaster.ca ([130.113.128.17]) by mail3-smtp-sop.national.inria.fr with ESMTP; 12 Apr 2010 16:35:54 +0200 Received: from Gorash7.UTS.McMaster.CA (Gorash7.UTS.mcmaster.ca [130.113.196.61]) by coriana6.cis.mcmaster.ca (8.13.7/8.13.7) with ESMTP id o3CEZkbU026895 for ; Mon, 12 Apr 2010 10:35:52 -0400 (EDT) Received: from cgpsrv2.cis.mcmaster.ca (univmail.CIS.McMaster.CA [130.113.64.46]) by Gorash7.UTS.McMaster.CA (8.13.7/8.13.7) with ESMTP id o3CEZbql019798 for ; Mon, 12 Apr 2010 10:35:37 -0400 Received: from [129.97.169.240] (account carette@univmail.cis.mcmaster.ca [129.97.169.240] verified) by cgpsrv2.cis.mcmaster.ca (CommuniGate Pro SMTP 5.2.12) with ESMTPSA id 298007396 for caml-list@inria.fr; Mon, 12 Apr 2010 10:35:37 -0400 Message-ID: <4BC32FAC.8060703@mcmaster.ca> Date: Mon, 12 Apr 2010 10:35:24 -0400 From: Jacques Carette User-Agent: Thunderbird 2.0.0.24 (Windows/20100228) MIME-Version: 1.0 To: caml-list@inria.fr Subject: Unicode solutions Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-PMX-Version-Mac: 5.5.9.395186, Antispam-Engine: 2.7.2.376379, Antispam-Data: 2010.4.12.142415 X-PerlMx-Spam: Gauge=IIIIIIII, Probability=8%, Report=' BODY_SIZE_1000_LESS 0, BODY_SIZE_2000_LESS 0, BODY_SIZE_5000_LESS 0, BODY_SIZE_600_699 0, BODY_SIZE_7000_LESS 0, TO_NO_NAME 0, __CT 0, __CTE 0, __CT_TEXT_PLAIN 0, __HAS_MSGID 0, __MIME_TEXT_ONLY 0, __MIME_VERSION 0, __MOZILLA_MSGID 0, __SANE_MSGID 0, __USER_AGENT 0' X-Spam-Flag: NO X-Spam: no; 0.00; camomile:01 lex:01 lex:01 modules:02 caml:02 parse:02 parse:02 mcmaster:02 back-end:02 seems:03 seems:03 module:03 jacques:03 jacques:03 library:03 I need to be able to lex/parse and pretty-print unicode with caml. I am aware of several 'solutions' for each part, but I would choose to use whatever library seamlessly allows me to both parse and build pretty-printers for unicode-based input. For example, it is unclear to me that module Print in Batteries integrates well with UTF8.t - but that could just be a documentation issue. Camomile seems to be good for the representation part, but does not seem to offer lex/parse and pretty-print modules. It seems like ulex + dypgen works for the front-end part. Back-end? [It's not clear to me that ulex + menhir works] Jacques