From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Original-To: caml-list@sympa.inria.fr Delivered-To: caml-list@sympa.inria.fr Received: from mail3-relais-sop.national.inria.fr (mail3-relais-sop.national.inria.fr [192.134.164.104]) by sympa.inria.fr (Postfix) with ESMTPS id D95CE7FD3C for ; Sun, 22 Nov 2015 08:58:42 +0100 (CET) IronPort-PHdr: 9a23:TcWEAhSf0GPQ0UpPV94vm87bytpsv+yvbD5Q0YIujvd0So/mwa64bBaN2/xhgRfzUJnB7Loc0qyN4/2mBzNLu83e+Fk5M7VyFDY9wf0MmAIhBMPXQWbaF9XNKxIAIcJZSVV+9Gu6O0UGUOz3ZlnVv2HgpWVKQka3CwN5K6zPF5LIiIzvjqbpq8CVPVoD3mHiKZpJbzyI7izp/vEMhoVjLqtjgjDomVBvP9ps+GVzOFiIlAz97MrjtLRq8iBXpu5zv5UYCfayV+0CQLdZFDUrNXwurI2u7EGbDFjH2nxJfH8flJVPAhPyxhbwWYu55iXgv+x23yiBe8fxR7cocT+v5uFlUkm7pj0AMmsQ9n3QjIRahalboRTp8xV534nSaoieNPt4VrLUfdQdA2FGW5ACBGR6HoqgYt5XXKI6NuFCoty4/gNWoA== Authentication-Results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=antonbachin@yahoo.com; spf=Pass smtp.mailfrom=antonbachin@yahoo.com; spf=None smtp.helo=postmaster@nm2-vm1.bullet.mail.ne1.yahoo.com Received-SPF: None (mail3-smtp-sop.national.inria.fr: no sender authenticity information available from domain of antonbachin@yahoo.com) identity=pra; client-ip=98.138.91.33; receiver=mail3-smtp-sop.national.inria.fr; envelope-from="antonbachin@yahoo.com"; x-sender="antonbachin@yahoo.com"; x-conformance=sidf_compatible Received-SPF: Pass (mail3-smtp-sop.national.inria.fr: domain of antonbachin@yahoo.com designates 98.138.91.33 as permitted sender) identity=mailfrom; client-ip=98.138.91.33; receiver=mail3-smtp-sop.national.inria.fr; envelope-from="antonbachin@yahoo.com"; x-sender="antonbachin@yahoo.com"; x-conformance=sidf_compatible; x-record-type="v=spf1" Received-SPF: None (mail3-smtp-sop.national.inria.fr: no sender authenticity information available from domain of postmaster@nm2-vm1.bullet.mail.ne1.yahoo.com) identity=helo; client-ip=98.138.91.33; receiver=mail3-smtp-sop.national.inria.fr; envelope-from="antonbachin@yahoo.com"; x-sender="postmaster@nm2-vm1.bullet.mail.ne1.yahoo.com"; x-conformance=sidf_compatible X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: A0DdAAAXdVFWliFbimJegi6CTxCtU4FPhTiIDYQNhg8CgSQ8EAEBAQEBAQEBEAEBAQEHDQkJHzCCLYIIAQEDASMdATgBBAsGBRoCJgICRRIGiCsBAwoIrwGBaoJ5hSYBIycDhHkBAQEBAQEBAQEBAQEBAQEBAQEBAQESBgwCAXKFU4IQgm6EWYMcL4EVBY4UiDyNMRSJCSCTDjiCLyMdgV9ThSsBAQE X-IPAS-Result: A0DdAAAXdVFWliFbimJegi6CTxCtU4FPhTiIDYQNhg8CgSQ8EAEBAQEBAQEBEAEBAQEHDQkJHzCCLYIIAQEDASMdATgBBAsGBRoCJgICRRIGiCsBAwoIrwGBaoJ5hSYBIycDhHkBAQEBAQEBAQEBAQEBAQEBAQEBAQESBgwCAXKFU4IQgm6EWYMcL4EVBY4UiDyNMRSJCSCTDjiCLyMdgV9ThSsBAQE X-IronPort-AV: E=Sophos;i="5.20,331,1444687200"; d="scan'208";a="154686841" Received: from nm2-vm1.bullet.mail.ne1.yahoo.com ([98.138.91.33]) by mail3-smtp-sop.national.inria.fr with ESMTP/TLS/RC4-SHA; 22 Nov 2015 08:58:41 +0100 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1448179119; bh=Hc+/bPajp5o2hSme4nAH+myQSkklsqwxJw1DUWrQ8Us=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From:Subject; b=jP19j5/OpjfpooOPaF/FUppC5EcvdmXZzu2/H9uGp9ofdpbFfTx7Ze1GkrEz78VeWPNo5yjaE49zOkZIQswSZMFgvqRR+5Arv7HjJVbF75qRMw4KxtwdXDFo30aRmHUVoiszdO6AZo09TwBpYU5wj1dp8OWHnIo2YvoD4CpxrHXUtIgXicZaa8vpQGSy/a9+gwg5eAekKw818JzdqRLBdrvnKC6mYm8NBzF+APzFPsjeK+f0Z/ERYcFo8Fmai4tSd0VaXk9RAqKVD6pKS7Cz6az4Qe5yhqleNJH+BHqDaaCVe6BZg/Fxx+4siiHv3DPMmRlQxG0DVo6nnDJnriwL8A== Received: from [98.138.226.178] by nm2.bullet.mail.ne1.yahoo.com with NNFMP; 22 Nov 2015 07:58:39 -0000 Received: from [98.138.226.125] by tm13.bullet.mail.ne1.yahoo.com with NNFMP; 22 Nov 2015 07:58:39 -0000 Received: from [127.0.0.1] by smtp204.mail.ne1.yahoo.com with NNFMP; 22 Nov 2015 07:58:39 -0000 X-Yahoo-Newman-Id: 46344.33775.bm@smtp204.mail.ne1.yahoo.com X-Yahoo-Newman-Property: ymail-3 X-YMail-OSG: dA0q6z4VM1kGX5RCQM7wqgBS9mY3hBlwsPldkpOn.1I8K5I xaO1FrztCz4SEw84LXcLPRRBOmZoPkqSjUAMLh507QK9nDqgdDxZ6jV7P0tH hFD5njU5kMPFXTGdHrY4MwmSHMj0uNdb9_kvud88s_uAXD6uA3O1e4a4cDuk zDZy57Yr7IMDWihebMZHvx8u0K7pZooGK3ox6cHHnYgEGQZfsiBcGxLBbW9Y X3dgebqsSuabczE76zH.xh_AozVA1cvNLIjob1ex4Id1jSxbMQprQefTP7_c Hdo5oQ2LhyrVacKzYfdznZZgljuGiH6fCPu39YLycX4G1eaWxFg0TxQDWT1r cj.lJRTWlnlyOXkM_gYE75bA_9h0vwA_gzLt.2LdO6o5YoVT3IHmLcW5KTN7 z4x1PVlAMazOwNccjjyBI17LhZlmWZp4_YmadaR9KN7rpGvk80cSkp7_NWan 5rE5WzayQkwMoYDUvzFM9ekjtgyDaUsdtKuRVg8BGT07QtN0yONtwxO26Qjb lKShGK3oopzplyMhOx4ZQoFO.QDzALNo- X-Yahoo-SMTP: ddtAESaswBBsjSthz_dVP91gr2NDfymF Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2070.6\)) From: Anton Bachin In-Reply-To: <564AF405.10400@cea.fr> Date: Sun, 22 Nov 2015 01:58:37 -0600 Cc: caml-list@inria.fr X-Mao-Original-Outgoing-Id: 469871917.329135-f6fbf74af14edfce78023edb77d5339e Content-Transfer-Encoding: quoted-printable Message-Id: <1A569326-8749-4332-A88C-719165728F20@yahoo.com> References: <4824377F-4045-4D47-9BAB-E06B0C939988@yahoo.com> <564AF405.10400@cea.fr> To: =?utf-8?Q?Fran=C3=A7ois_Bobot?= X-Mailer: Apple Mail (2.2070.6) Subject: Re: [Caml-list] [ANN] Lambda Soup - HTML scraping and rewriting with CSS selectors Hello Fran=C3=A7ois, First, I have to apologize. It seems my mail client thought all of Caml-lis= t is spam. I have been disappointed that the list is so quiet. Turns out I shoul= d be disappointed at my mail client instead. > Does the types are as powerful as the one in tyxml? Is it possible to have > this kind of selectors on top of tyxml? The types are not as powerful as the ones in tyxml, in the sense of being precise. Lambda Soup only distinguishes between "definitely elements", "definitely documents", and "anything, including an element, or document, o= r any other node". It doesn't know about element types or any constraints on their composition. It does, however, seem to be possible to have the selectors on top of tyxml= . I am not very familiar with tyxml, but I do see types Xml.elt and Xml.econten= t, and value content. It may be possible to either use those directly as the internal DOM representation of Lambda Soup, and build traversals over them,= and it is almost certainly possible to convert them to Lambda Soup's current internal representation, the same way Lambda Soup currently converts Nethtm= l=E2=80=99s parse trees. If there is interest in this, perhaps with some study Lambda Soup can be modified to use tyxml =E2=80=93 perhaps functorized. Regards, Anton