From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/5082 Path: news.gmane.org!not-for-mail From: =?ISO-8859-1?Q?Pablo_Rodr=EDguez?= Newsgroups: gmane.text.pandoc Subject: HTML attributes not being stripped off Date: Sun, 11 Nov 2012 12:19:15 +0100 Message-ID: <509F89B3.4070403@web.de> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 X-Trace: ger.gmane.org 1352632761 20117 80.91.229.3 (11 Nov 2012 11:19:21 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 11 Nov 2012 11:19:21 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCD6NVO5UINBBOET72CAKGQEVAG3LJA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Sun Nov 11 12:19:31 2012 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-wi0-f186.google.com ([209.85.212.186]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1TXVZa-00010K-J9 for gtp-pandoc-discuss@m.gmane.org; Sun, 11 Nov 2012 12:19:30 +0100 Original-Received: by mail-wi0-f186.google.com with SMTP id hn6sf647894wib.3 for ; Sun, 11 Nov 2012 03:19:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=x-beenthere:received-spf:message-id:date:from:user-agent :mime-version:to:subject:x-provags-id:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type; bh=gY74h1ar8dkLZs3j89CkShVoXA/noI4hSug6nAlZJK0=; b=AHn87LKDOyBlnjv+EpA2ZtY4XDrCE6XONfmQRtRXBRHgcfusEtIGuOOOSu4VINdqmj 4fV29wc9ygoUH2EJosXRb8pc9pTyY0gIAGfWiz6Lh/4rJIVMzkwkRQJ+0JOfjnKUVrts Fmhpn8Cl8cvtvmF65zAFn2LxoGahGKcLUW8i33PaJgy+uzr8l0t2HOp9jJwKfitXvoan DfDkET1ouqcjdFhhXU5Az+vc0cwDUT7JgsS14zstQEtY69eN3HW+CSMFHpLqN0DdWYjR Agsddj60lQO3CzZfuP9lFEQCtw7yH5Uscg4zEtlVszpJHlgJt/r1xADDA20trhjuLmYw ZGtg== Original-Received: by 10.180.78.229 with SMTP id e5mr973079wix.4.1352632761238; Sun, 11 Nov 2012 03:19:21 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.14.176.133 with SMTP id b5ls3691627eem.1.gmail; Sun, 11 Nov 2012 03:19:19 -0800 (PST) Original-Received: by 10.14.215.136 with SMTP id e8mr17173809eep.6.1352632759761; Sun, 11 Nov 2012 03:19:19 -0800 (PST) Original-Received: by 10.14.215.136 with SMTP id e8mr17173808eep.6.1352632759753; Sun, 11 Nov 2012 03:19:19 -0800 (PST) Original-Received: from mout.web.de (mout.web.de. [212.227.17.11]) by gmr-mx.google.com with ESMTP id z47si958317eel.0.2012.11.11.03.19.19; Sun, 11 Nov 2012 03:19:19 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of oinos-S0/GAf8tV78@public.gmane.org designates 212.227.17.11 as permitted sender) client-ip=212.227.17.11; Original-Received: from [192.168.1.33] ([83.60.73.155]) by smtp.web.de (mrweb002) with ESMTPSA (Nemesis) id 0LfAbI-1T0PnT1AVM-00pN4v for ; Sun, 11 Nov 2012 12:19:19 +0100 User-Agent: Mozilla/5.0 (X11; Linux i686; rv:15.0) Gecko/20120911 Thunderbird/15.0.1 X-Provags-ID: V02:K0:o2JCuCou1wL5iQEJzT3kgMfm4fdXszqIOdYzPfNnyda TqTssXAPWYPwusRmQpCwXs3yqjbLPphLfbGHqslAzeSP9q9Vnz x2vHdr7JtRCa7Y348WiWyOfnX2WtlLK30NRJRDqrBfPmmWoqgI 7EuGtTxKD6XWNIe0z8Dpwp18OfRirgSQqYFDbUcaXU1Ol3gk+y Wt496Tv6bP/0IMEO+QISQ== X-Original-Sender: oinos-S0/GAf8tV78@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: best guess record for domain of oinos-S0/GAf8tV78@public.gmane.org designates 212.227.17.11 as permitted sender) smtp.mail=oinos-S0/GAf8tV78@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-Subscribe: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:5082 Archived-At: Hi John, I'm using pandoc mainly to generate ePub files. I used textile first as source language, but it isn't fully implemented by pandoc and textile itself has issues with multiparagraph elements. It seems HTML is probably a much better option for pandoc as source language, although I have to forget footnotes. There is no way to have it all. But pandoc strips almost all attributes from HTML elements. A minimal sample:
  1. Well there is no other way to tag lingua latina.

  2. Or even classes or ids.

    .
Would it be possible that there is an option that doesn't strip off attributes from HTML code? BTW, when converting from HTML to another HTML code, at least id, class and lang attributes shouldn't be stripped off by default. Many thanks for your help, Pablo -- http://www.ousia.tk