From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from majordomo@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id VAA08503; Sat, 20 Sep 2003 21:45:58 +0200 (MET DST) Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id VAA25770 for ; Sat, 20 Sep 2003 21:45:57 +0200 (MET DST) Received: from nef.ens.fr (nef.ens.fr [129.199.96.32]) by concorde.inria.fr (8.11.1/8.11.1) with ESMTP id h8KJjuH07372 for ; Sat, 20 Sep 2003 21:45:56 +0200 (MET DST) Received: from clipper.ens.fr (clipper-gw.ens.fr [129.199.1.22]) by nef.ens.fr (8.12.10/1.01.28121999) with ESMTP id h8KJjuv1077259 for ; Sat, 20 Sep 2003 21:45:56 +0200 (CEST) Received: from localhost (frisch@localhost) by clipper.ens.fr (8.12.3/jb-1.1) id h8KJjrLC016670 for ; Sat, 20 Sep 2003 21:45:53 +0200 (MET DST) X-Authentication-Warning: clipper.ens.fr: frisch owned process doing -bs Date: Sat, 20 Sep 2003 21:45:53 +0200 (MET DST) From: Alain.Frisch@ens.fr X-X-Sender: frisch@clipper.ens.fr Reply-To: Alain.Frisch@ens.fr To: Caml list Subject: [Caml-list] ulex: lexer generator for Unicode Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Scanned: by amavisd-milter (http://amavis.org/) X-Loop: caml-list@inria.fr X-Spam: no; 0.00; alain:01 frisch:01 camlp:01 cduce:01 0.1.:01 alain:01 unicode:01 unicode:01 ocaml:01 caml:01 tar:01 syntax:02 lexbuf:03 tarball:05 embedded:05 Sender: owner-caml-list@pauillac.inria.fr Precedence: bulk Hello list, I started working on a lexer generator for Unicode. The architecture is similar to ocamllex, except that ulex lexers use a new kind of lexbuf that holds Unicode code points. It is possible to inject in these lexbufs any Unicode stream (for the moment, adapters are provided only for Latin1/utf8 streams/strings/channels, but you can also pass Unicode code points as integers). Lexer specifications are embedded in OCaml code, and parsed with a Camlp4 syntax extension. As several people showed interest in Unicode support for Caml, I thought I could make a preliminary release to collect feedback on the design of ulex. Here is the tarball: http://www.cduce.org/download/ulex-0.1.tar.gz -- Alain ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners