From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail2-relais-roc.national.inria.fr (mail2-relais-roc.national.inria.fr [192.134.164.83]) by yquem.inria.fr (Postfix) with ESMTP id 98172BC37 for ; Sat, 14 Nov 2009 19:08:54 +0100 (CET) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ArkAAAqB/kpDww+RmWdsb2JhbACEco0hgQaJBQEBAQEBCAsKBxOIM51VKIE5hWKIawEGBAGBK4I4VASBbQ X-IronPort-AV: E=Sophos;i="4.44,742,1249250400"; d="scan'208";a="36745134" Received: from web111503.mail.gq1.yahoo.com ([67.195.15.145]) by mail2-smtp-roc.national.inria.fr with SMTP; 14 Nov 2009 19:08:53 +0100 Received: (qmail 12234 invoked by uid 60001); 14 Nov 2009 18:08:52 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1258222132; bh=fTsTITCif3jTHj322pHpGgg2HMtqzpPJ781r++hZ7wc=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=Q554W1PCaKzdsm6gfoq12I3wJUIjp0leT99gE9OrbFL/DEJUX5+YsFrdN6bmnkyS44GVJdvbD8nGcs6hSvOIeUd5wE9zsEBtu6HhHToo+rm4RmYmLbBjDB3puY9ED6IoMACtGR2v4BqQvYF6ZGwWvRnE+xoV8CaDhUiDK/Si9aY= DomainKey-Signature:a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=lUnb3AdqUIwyzqzKShjgFVzvHChzmkqd3W+U070Yuqj8ipcKwUV1hsiBAjopUekkn80ESFAP/tLlxNXe7utDe9xqGqZarSsdKxsTpSvZDNf+AwEO/ugD+4UASzjfgagEuT/y2t0kangrf0FcZdtKcTYlrfMULBZsSTq/8iaka9Y=; Message-ID: <411864.11130.qm@web111503.mail.gq1.yahoo.com> X-YMail-OSG: 5pxy4AgVM1kUBAF.mVRpgAVEnePg7HZ1dLS6fcGgTj13wR8Ml2hEP9hPilQ5ip60NcHWpXfjQBfX6pXHCQ1YmfHFRG0c9EPKoHfVgzUWHZSguKiMj2_ttlUgEuZwgaqFtzucAXJGTl0EALN.i4lRafaZyew3Wp3mRMN3v2kHL_ds5w739_EdW38u0ferwzuXRsuI9jTXCwZnq8KknFHZ.R4ww5ftLp5_WR7wDVVQb.hxVCzApWN3kt3mQmr4N7sNB35ZIasi0s_QgH3.0LQgOphbKD2n2hYmqsYE.it0xHl3DHH4y8C2R5Lm1GmrU4xQpyIMoHs91ywN8sxFIY0ipuy6_lC3gQcMA3MejqBp6hV4uvtMPmgHQkzG2GKDnR_i4MlMHvnrf4OncD11goSWo5CvC98mTpnhvexLVi8CAowC7vZmtIKAwmnlLZ0dz.KvyyGUmjBOdQxW4chEPV4yfmmofoP3d56mJMNILd9SsYExv42uNArkFdFJQBgcB9s- Received: from [213.205.71.58] by web111503.mail.gq1.yahoo.com via HTTP; Sat, 14 Nov 2009 10:08:52 PST X-Mailer: YahooMailClassic/8.1.6 YahooMailWebService/0.7.361.4 Date: Sat, 14 Nov 2009 10:08:52 -0800 (PST) From: Dario Teixeira Subject: Re: [Caml-list] The lexer hack To: caml-list@yquem.inria.fr, Micha In-Reply-To: <200911141720.13847.micha-1@fantasymail.de> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam: no; 0.00; lexer:01 lexer:01 tokens:01 parser:01 parser:01 viewcvs:01 lib:01 cheers:01 caml-list:01 grammar:02 implemented:02 root:04 hack:04 meantime:05 simplify:05 Hi,=0A=0A> if the lexer cannot decide it on the tokens seen, a packrat=0A> = parser (like Aurochs) may be a better choice, since in a PEG=0A> there is n= o seperate lexer, it's all one grammar, so you don't=0A> have this problem.= =0A=0ABut does Aurochs also handle UTF8 streams?=0A=0AIn the meantime I've = implemented the parser using Ulex/Menhir=0Awith the "dummy action" trick I = mentioned before. It allowed=0Ame to simplify the tokenizer tremendously, = though it's still=0Apresent:=0A=0Ahttps://forge.ocamlcore.org/plugins/scmsv= n/viewcvs.php/trunk/lambdoc/src/lib/lambdoc_read_lambtex/?root=3Dlambdoc=0A= =0ACheers,=0ADario Teixeira=0A=0A=0A=0A