From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32922 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: Getting pandoc to convert Github Markdown documents with HTML tags to PDF Date: Wed, 5 Jul 2023 11:41:38 -0700 Message-ID: References: <529BC174-779A-4D98-BCC9-F59AEAAC2B9D@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3696.120.41.1.3\)) Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="4997"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDW7ZIEHTIIBBZPSS2SQMGQE6LDYH3Q-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Jul 05 20:41:45 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-io1-f55.google.com ([209.85.166.55]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1qH7Rh-00019b-LU for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 05 Jul 2023 20:41:45 +0200 Original-Received: by mail-io1-f55.google.com with SMTP id ca18e2360f4ac-7775a282e25sf286003439f.0 for ; Wed, 05 Jul 2023 11:41:45 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1688582504; cv=pass; d=google.com; s=arc-20160816; b=N4p0AvsXPxZZJV4MyFvGh3tBj1AtUsE0zHliOF/y3gojC43I7xg6icODfToEGKZ1IF WD1T1pDYKOPy5eY6t+3d8+7qhtOGFHGxioLu/ZmJbW6bzsLbGeD0Xhx+VJRUKTr8sYOR 2dNzYx2zsjeOAfdnGao8Tr50xyppfsg8HJOj2yQc0LY86a4yLwL0JRHReMfC56Ytcn4p 5U7iuFwWm10PQ8vXjPtLdv8Z5maUklJaXClMvYUEZRRFef1aJzNCnATcXslIOOUdJtxW 3aqZ2If56mZ/7NshYTnobSzRzFkmD9OpgE/Lj2m+0ylqFsUzNefgs5nqCW9ciE/sql2r U1YQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:message-id:in-reply-to:to :references:date:subject:mime-version:content-transfer-encoding:from :sender:dkim-signature:dkim-signature; bh=+obYXAxOyoojnnUAUo7IInhEJKPHyJIWVd+vcK0LTsM=; fh=A7KGSvm30SBY9b2v+N53j+lkchMNZtkZbRzF4WqsV70=; b=XGoD1Xc8d377vmdxzJnsFVfpvYz8jLyyjOEdBcfgCEbEeTEO1pB7qKd1KAT0lezBso Z0tSgT1VtStMuUyAx2C5/6cvtQ+wN+Ak86UaAiJzzgkMSZNTe/uh1pMgvTN2T3tWU8Bn VNHksfO/RMqTmzgPXI566DpHyoI4Yn7sfTQt5RSYXXcKyVFqJdV3mHLgada7HXmoy2tJ FOy3I9+t8Ej7lS1uOrPeEO+q3zZeZDWXWoXJqoPh5UfCJlbu4CbT2KliX0Dkfwp9lgMT qcA4sqczYpB6moAghCt2S83AUE3+aQtjPCETYOLB4MWOhXPkmKQMV5a91jp08s56axhg VfgQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=S+Ywycae; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20221208; t=1688582504; x=1691174504; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:sender:from:to:cc:subject:date :message-id:reply-to; bh=+obYXAxOyoojnnUAUo7IInhEJKPHyJIWVd+vcK0LTsM=; b=Jl/offdu9r1i9B8qTt9cbuNSwJK+9hEQWFqC8gyR8JFklCgaxYUkffbhxCnN0sfs07 7JT+NeL2XqaAChp+89hUuY4JSTFG2hw1mk7jI8C8EebvJhYVQmSlHnQBqTs8xSdPwvc6 rWLyIPeJ+xyrhpywbT4kQKmcjJ+Kww+ojD8YIpQ+AzCGQ2ZPTtglPsvX/Ade7heO/adL f3+hjVA3gxSqbrv+VXs782P2krktmivpV5B0NXeTp2ATpbt2myE8+CD8m0fiOeczNYFh j940tzBDfZwoRxImd+3lfq9tCAUuAF9AgNzBL DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1688582504; x=1691174504; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:from:to:cc:subject:date:message-id :reply-to; bh=+obYXAxOyoojnnUAUo7IInhEJKPHyJIWVd+vcK0LTsM=; b=RXyRS1K8nnd8ss9735xMzCGAOoQi1+SJfwCql1Hc8Qd4o45/0nEXoG73MsOjuJmu6j CiVHdvExZmMpWkdYP6UOYhS2vJg9vmYI2z7DDaomKXvFirhTlteCOcXAMYYDS09U8E6D E2b5L2pgfySONAeSA6KWC10pIaKQZdnFIOa0J9vg0qpgTZ+g0Nl4BlyqDUVlkA2UKVc0 9ioCDhTQ73RRkSsLX7i3TyVpp2oxgo9y7gSayZUGsrgvv7LsLCymoUV6gsOEVa63JrsX ForGf+8PksXvmRMiTWhYeND21zlhXGQCw2QlcwbIMmeYossTIKY X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1688582504; x=1691174504; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:x-beenthere:x-gm-message-state :sender:from:to:cc:subject:date:message-id:reply-to; bh=+obYXAxOyoojnnUAUo7IInhEJKPHyJIWVd+vcK0LTsM=; b=dfjin0UPQ4oXKGNnPhenm1gNMUAi7rLGw+ZpIbNoDwwVKQ05qt/7hObcMEPrcU3Cpr MpE7JSI1RhPImdaGKkP/WZZqqPwYONwYEslHxx/mstctS6moXXall2IeJOaDtSYFVQp2 76nvu3RNR97x7Fqo//aZilkoGWjQGnc8NwkCIjt3S/CagBWBXUIFlUDO5LmRgmL91t8d LdAODpWjgo+0vvWZQphcWuTmuwdegdZjJ/mTAIi2mT5dcKEwbcMOldBTcN Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: ABy/qLaAsyYd8t8VPU3gRxtSRt0c4BTHOcrmn6FC6L/4rKHPmpD0w7IZ Hk039QhC+KaSfGnpkTqErvA= X-Google-Smtp-Source: APBJJlGO2Avoj6Hh0E98ZhXjTZRH+jZzaGOxU0/VgpouRPwTaqdnZ071Z6S0I4NeoZWzG+a0fWJg5g== X-Received: by 2002:a92:c608:0:b0:345:de70:ae61 with SMTP id p8-20020a92c608000000b00345de70ae61mr17776213ilm.25.1688582504380; Wed, 05 Jul 2023 11:41:44 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a92:c642:0:b0:342:ff1:3d84 with SMTP id 2-20020a92c642000000b003420ff13d84ls2119800ill.0.-pod-prod-04-us; Wed, 05 Jul 2023 11:41:41 -0700 (PDT) X-Received: by 2002:a6b:f006:0:b0:783:37b2:4b37 with SMTP id w6-20020a6bf006000000b0078337b24b37mr4276ioc.3.1688582501433; Wed, 05 Jul 2023 11:41:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1688582501; cv=none; d=google.com; s=arc-20160816; b=gfRWj4NXuZlcAXVatnbFmy15oTmlF+88eNCVDXLGyQMYE1jCJVcCBJ2Ph2cIf9fpMp Cxs/DIxZiFg5y2BBd8XX42axq7wgb2CyXZVub5qVobYu7IiD0A16hH49ue2FJ4mCjwGw +cBIshvG0GWg11utXX9CqnPDFkuImOS6JALdEViIcecHJmu/bVC64wQJguI/FifasODk humlh5antsO7idWVNpLpa0zTEyLT0/v98VBXtOw41N9Z1jazyyHO7gZFYlxc0yj1t/E1 sFZWnW9GpOK3mkVy2/Ep5IsCt0XCvt4Ra/d40oQ9MpwafCzRhxZbdIoZqWCf3wURM2PM A64w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=message-id:in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:dkim-signature; bh=htU3lyIcx/gkfWmXmJUYPxI8WcaCXAn4Q4Pp1HyS3Vc=; fh=m01AhCNo7xUywHldCVYouaJypLlN7JgtNYbImzBf4N4=; b=rHd9Me34m9d6RK4uFx/SPxJnTLmF+MuR0Ti3DSozaN0Zhp1tQjwnhXZSiqP11E1P9z o48gNa0yzrsSAz6sTjPezwcmYn8e4UenRbJd6kjqtpGvk1wmARGWi6ZMC1xPrwExGc8z nwieCLehaV2ccI0bdVG0ZqVndKfOA3vjvf2B1R7d5W2HH8o7u5tDmCgDAZGfo/jh+Sb0 TRb9LNQIIccWPNxMBUA3X6xMsKAx7ro08JHFeVADiV/agLnJlA0NWvQroKYFPme72HBy hll55ElMRHf+aLPg3hKXf0UHzJNTAWHCKZHetVUnitGGDKcHJyZxMIKqpkcITdULKuyP em0A== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=S+Ywycae; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Original-Received: from mail-pg1-x533.google.com (mail-pg1-x533.google.com. [2607:f8b0:4864:20::533]) by gmr-mx.google.com with ESMTPS id bm10-20020a05663842ca00b0042af5dea7cbsi1083523jab.3.2023.07.05.11.41.41 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 05 Jul 2023 11:41:41 -0700 (PDT) Received-SPF: pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) client-ip=2607:f8b0:4864:20::533; Original-Received: by mail-pg1-x533.google.com with SMTP id 41be03b00d2f7-51452556acdso3966835a12.2 for ; Wed, 05 Jul 2023 11:41:41 -0700 (PDT) X-Received: by 2002:a05:6a20:841a:b0:12f:bca:c2c6 with SMTP id c26-20020a056a20841a00b0012f0bcac2c6mr5725552pzd.35.1688582500381; Wed, 05 Jul 2023 11:41:40 -0700 (PDT) Original-Received: from smtpclient.apple ([2601:644:4701:23f0:4c6f:8f1d:18b:8c65]) by smtp.gmail.com with ESMTPSA id o11-20020a170902bccb00b001a0448731c2sm19296127pls.47.2023.07.05.11.41.39 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Wed, 05 Jul 2023 11:41:39 -0700 (PDT) In-Reply-To: X-Mailer: Apple Mail (2.3696.120.41.1.3) X-Original-Sender: fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=S+Ywycae; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32922 Archived-At: — and ℋ will be parsed as unicode characters and these will be p= assed through to the HTML. You can check the intermediate HTML file (again it will be printed with --v= erbose) to confirm this. It may be that the program that is being invoked to go from HTML -> PDF (wk= htmltopdf ?) doesn't handle these characters properly. You could try adding the `--ascii` option which will force entities to be u= sed. > On Jul 4, 2023, at 4:07 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote: >=20 > Thanks. I invoked pandoc -f gfm MyDoc. -o MyDoc.pdf and in the resulting = PDF document the subscripts are still ignored. When running it with --verbo= se in the resulting output I saw numerous instances of=20 >=20 > [INFO] Not rendering RawInline (Format "html") "" > [INFO] Not rendering RawInline (Format "html") "" >=20 > However, when I added -t html5 to the invocation the diagnostics above di= sappear, and the subscripts are indeed present in the converted PDF file. T= hanks for the tip - it has indeed improved things. Now it is still the case= that things like — or ℋ are ignored by pandoc. Any suggestions = on how to get pandoc to process them?=20 >=20 > I am using the following: >=20 > pandoc 3.1.4 > Features: +server +lua > Scripting engine: Lua 5.4 >=20 >=20 >=20 >=20 >=20 >=20 > On Tue, Jul 4, 2023 at 3:50=E2=80=AFPM John MacFarlane wrote: > HTML tags should be passed through to HTML formats. >=20 > Have you looked at the intermediate HTML produced? You can use --verbose= to see it. >=20 > This seems to work fine: >=20 > % pandoc -t html5 > _A__m_ >

Am

>=20 > PS. You probably want to use -f gfm if you're targeting GitHub Markdown. >=20 > Pandoc version? >=20 >=20 >=20 > > On Jul 3, >=20 > > 2023, at 3:41 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote: > >=20 > > I have a Github Markdown document that contains HTML tags - mostly to d= o with special characters (e.g. ℋ) and stuff to place pictures where I= want in the page. The thing is, pandoc seems to ignore the HTML tags. Is t= his a limitation intrinsic to pandoc, or is there any way to get pandoc to = process such tags and produce the right output?=20 > >=20 > > The pandoc invocation that I am currently using for converting my Githu= b Markdown documents to PDF is > >=20 > > $ pandoc --resource-path=3D/home/abc/Repos.wiki -t html5 --pdf-engine= =3Dwkhtmltopdf --metadata pagetitle=3D"MyDoc.md" --css github.css -o MyDoc.= pdf > >=20 > > The default invocation pandoc MyDoc.md -o MyDoc.pdf is not dealing with= images properly (in that it sometimes rearranges surrounding paragraphs th= e wrong way) and it seems to be unable to deal with expressions like _A__m_
, in that the and directives seem to be ignored. > >=20 > > --=20 > > You received this message because you are subscribed to the Google Grou= ps "pandoc-discuss" group. > > To unsubscribe from this group and stop receiving emails from it, send = an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > > To view this discussion on the web visit https://groups.google.com/d/ms= gid/pandoc-discuss/b1dae07b-11d1-4c98-8fcf-369f2b23a54cn%40googlegroups.com= . >=20 > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/529BC174-779A-4D98-BCC9-F59AEAAC2B9D%40gmail.com. >=20 > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/CAFy1yb2op3Aq%3DP4L7xpNwPBBHtopKMx%2BurWz%2B-VQ%2B5Mh0CM%3= DhQ%40mail.gmail.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/F4D52E47-33F8-4A2C-9A56-679BD5240ABD%40gmail.com.