From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32924 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: Getting pandoc to convert Github Markdown documents with HTML tags to PDF Date: Wed, 5 Jul 2023 12:47:24 -0700 Message-ID: <2EAB1263-5AFF-41B5-A875-ABB40CACE349@gmail.com> References: <529BC174-779A-4D98-BCC9-F59AEAAC2B9D@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3731.600.7\)) Content-Type: multipart/alternative; boundary="Apple-Mail=_2DA25CC3-4A22-4626-98AB-9C32D4076491" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="13410"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDW7ZIEHTIIBBWMRS6SQMGQEPO4ZTKA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Jul 05 21:47:41 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-il1-f191.google.com ([209.85.166.191]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1qH8TV-0003Gy-Pg for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 05 Jul 2023 21:47:41 +0200 Original-Received: by mail-il1-f191.google.com with SMTP id e9e14a558f8ab-345e8b04c78sf32306155ab.3 for ; Wed, 05 Jul 2023 12:47:41 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1688586460; cv=pass; d=google.com; s=arc-20160816; b=Y9iV+elVN36EfwZVcDMyjonb8/4A1khHDRmXklrNWuEy1gaytIHZXM0IjCY43nLr+v 2B6eeX26EwVx7NEit88hDsiRId+RGcWBhN7uOagEgdws1eMlNaESVP0X2HKHyb0MYPPt NTe4YZnk9UXQ22F96XnWYS+NSfN6yYEox9gNzpE4cleWcnrQK33WHaXmRXaCU/EMbXJp u+eCeQZO1ExPOzN8tRTego59szEZc5i4R7cvPjsparq3890mh+gdHGEZWp3cnBDYwvBr h8TMA+8hj0kQpoC3oOgeSuSlTrOEx28ouHKJdv1VVFQnuQxDHeu84dy61mDjS4EeWbHQ ymLA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:message-id:in-reply-to:to :references:date:subject:mime-version:from:sender:dkim-signature :dkim-signature; bh=dXp8GFr+Z4qxl+0MHZ9A/O6rzpcvfyNqz7xLC5yNlPo=; fh=A7KGSvm30SBY9b2v+N53j+lkchMNZtkZbRzF4WqsV70=; b=ud6ME6rzezygZB3aJYAasooiYLl+twtFzh57LCkNtkdEfAb4oUa8GSF2sdPOV34Kb9 toWL/lkt60CcO1PaBPjVY+Q+9gIlpmbBLkw8BI4ukSazopOBzOZqkUsxxbU8d0SIQO96 YPI7FUOEHsjU7QyR2lxmj83giU+spVeLEqJcVEojWR4CyQwA6J0uIL2Yxd2uh0MjDDXj wn6on6Hvk4F4Tap2q5+KIBqGqf7f715YjHWasHEBtntwDfs32t2CW1M1Jqbs41egbF62 i1EhwRuA1xirufOYzhiGLgxe9b3CkhSpzFml8MBPgjDgrnUqaZJSQFKBsheRHYZ3RO+E c44w== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=a++bpsh0; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2001:4860:4864:20::35 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20221208; t=1688586460; x=1691178460; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version:from:sender :from:to:cc:subject:date:message-id:reply-to; bh=dXp8GFr+Z4qxl+0MHZ9A/O6rzpcvfyNqz7xLC5yNlPo=; b=F8kuZbWkTfC6kw20sCd9eW7ueKYKGeZ+ZKI9K8wlcBYBWTP/Asos61OSr0hOJRV80u ZE7CaZJGTdcoSqqdYUi4J2teoO8ba0cphoDELlja1wa8UaMqkOH8RF1trMUikdE8nEJz q2kOUxKHzEqOT64uvf3bGt21okM8sI2PcsWyq3y/a63A17Al0gQ59fXEgvcYkrBOho41 JUasCT8PwNXHdVtrljNFsvRYkYvUZiBlviw7b3SxfxujB7TeAw7zoYOYNBSn1Tbduai8 LLbh2dv5FlrwbSSb2C/pKBzEacebEMRWZArAqyO4T+BOKVNlGvsTzcUEbW/8PZ4kgqPv DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1688586460; x=1691178460; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version:from:from:to:cc :subject:date:message-id:reply-to; bh=dXp8GFr+Z4qxl+0MHZ9A/O6rzpcvfyNqz7xLC5yNlPo=; b=Xn+i9fYx5p2YPnb0BHe0bvizwKZeA9JaOf7Drxu2TEOfcCJPaNr+VLLh7TuraJ9Vcs 70WwJ+385RR52ySaHYPHapiUIBHAnZpFCBY/CBRkvq30+xMe8eHUO36ra1PTXamJuQue xV5591Y7CYGqhRJV3e2d/8AsyL02iadr+Gu5s1q8rZ/sTWnGgZLr0QRnyjnDr/bUmazT dR7ojaFgQKOrAQgtVnQe6acFix3AKf7h5pnoFONtLHr5/Fg0luFCMyyECrf8+CYnzDM7 bG4pF/BWQshXIZTAnX2cg2RgnPsXgApQHVEI4M80kIM6ge3ghQURn8/r+hzWCgGKCIuP vBgA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1688586460; x=1691178460; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version:from :x-beenthere:x-gm-message-state:sender:from:to:cc:subject:date :message-id:reply-to; bh=dXp8GFr+Z4qxl+0MHZ9A/O6rzpcvfyNqz7xLC5yNlPo=; b=aGev/oXjV2EhaDjT6vE3hmVxLujwEtlxUwz4vT1I0C9fmz1o7Tnnsg1sMcwxHGOEky hrSLpw5nfG1tuOBBZdjOzlg5XbqVJqkMN3iGEmYC5n/vZzQIzNt5rAkCtTbO5IVR2YrW AukqwVbRUNY0gmA4PAJ/UmVCmvJpEQ49EmRJuCC70wlV2gSXyqtCaNmApIiUfYlqN7u2 iu09gEyXY9DY28a6KqaXtH2ZC4dhZ1pQfEKRNt2smkmRXLUD+C65wblZJIF/ULnDt2sQ UJoXd3 Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: ABy/qLbpukbrn0ifAjb9+HB06AnLHFxBFRj6snKPHI+YV6dFjed/+LDJ 0VvsyslHrN7dUMznbHmYmRU= X-Google-Smtp-Source: APBJJlHwAjaqYcTHY1EQZk8O0ducPnTPEJKtsvgIy/I7b5YNJLwkm2enu89eSKmhS7TYyoiKrffqlw== X-Received: by 2002:a92:2803:0:b0:33b:c0d8:26af with SMTP id l3-20020a922803000000b0033bc0d826afmr123558ilf.7.1688586460354; Wed, 05 Jul 2023 12:47:40 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6e02:15c3:b0:346:2f5e:1569 with SMTP id q3-20020a056e0215c300b003462f5e1569ls11440ilu.2.-pod-prod-02-us; Wed, 05 Jul 2023 12:47:37 -0700 (PDT) X-Received: by 2002:a5e:8b4b:0:b0:783:4bd2:3fd2 with SMTP id z11-20020a5e8b4b000000b007834bd23fd2mr82258iom.17.1688586457031; Wed, 05 Jul 2023 12:47:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1688586457; cv=none; d=google.com; s=arc-20160816; b=084TXwNJtDg6yQXGFm0mZL9IC4mK3fBnBH9beCgE1Bp94REZq1Lh9SAkL1m6v15QJl /9YwBp8pNLUx+dgp4zkLHATXKqKh19ylTKkgPQ+bDQXGDDrwsL+GW9QWolBQIJ2O9Ity opKkdcO2Az4TX6SNAteZ6aa57c4k3uOxLhiqpZgawRG4gv0vp2FlnOCDqxTAEyBKq7or idpQoTWPpAS+pJhmONjYFOjFQlRxC6SYyPfWmQU0SDxzfw8utVo2lnbbFnsKFcQGKXnh FGs2GWv7t/TqYMB9y59SCrd/8cNOAHVVQyx6Oy4FAFjYDj4/1sP6VE+Cxf8yWkEs92Gx oD3A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=message-id:in-reply-to:to:references:date:subject:mime-version:from :dkim-signature; bh=b7OCcJYkhrL+Hig9mmaxvCO4l3CzQgLFk/5+53/c/kM=; fh=m01AhCNo7xUywHldCVYouaJypLlN7JgtNYbImzBf4N4=; b=GAzDhjcrlt6k9W49pKQInkrnzSn9KlOdG4a8SAvgw43GhCwupXN8PsJD+9dcA4lsbO al+xJmxEwXYCtRCteY5kXHSccKO7ytirQs5/sEKvuurB5xkp2S9YX/WWvXSVG0sL/yQJ LYjZB5CRLFH4te+6XjZHvGl+nFAX/AcIazGpVTHVDOry+uB0nykmO0O8rSNSeZVsFp1H 41ygfQQMQjxBgd6PgxaNOocUQBZ9iwrFGQCsSChPDwr+RXt3iHaFYKqnTVi0ZvxQoLbr PXB8danBOeLq9bQz9QK/dHtylnxikt4BicDO/fmOj/ObxpV+fkga7Rgs24z3T+3DLN6+ Vpnw== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=a++bpsh0; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2001:4860:4864:20::35 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Original-Received: from mail-oa1-x35.google.com (mail-oa1-x35.google.com. [2001:4860:4864:20::35]) by gmr-mx.google.com with ESMTPS id cs14-20020a056638470e00b0042a49b96029si1836512jab.2.2023.07.05.12.47.37 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 05 Jul 2023 12:47:37 -0700 (PDT) Received-SPF: pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2001:4860:4864:20::35 as permitted sender) client-ip=2001:4860:4864:20::35; Original-Received: by mail-oa1-x35.google.com with SMTP id 586e51a60fabf-1b012c3ce43so5147fac.3 for ; Wed, 05 Jul 2023 12:47:37 -0700 (PDT) X-Received: by 2002:a05:6870:eca6:b0:1b0:4e0f:3822 with SMTP id eo38-20020a056870eca600b001b04e0f3822mr6636oab.18.1688586456138; Wed, 05 Jul 2023 12:47:36 -0700 (PDT) Original-Received: from smtpclient.apple ([2607:f140:4208:8000:1058:466:889b:2a65]) by smtp.gmail.com with ESMTPSA id e25-20020aa78c59000000b00675701f456csm16045237pfd.54.2023.07.05.12.47.35 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Wed, 05 Jul 2023 12:47:35 -0700 (PDT) In-Reply-To: X-Mailer: Apple Mail (2.3731.600.7) X-Original-Sender: fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=a++bpsh0; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2001:4860:4864:20::35 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32924 Archived-At: --Apple-Mail=_2DA25CC3-4A22-4626-98AB-9C32D4076491 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="UTF-8" That's a font issue. You may need to specify a font with --pdf-engine-opts or try another PDF engine that works with HTML > On Jul 5, 2023, at 12:16 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote: >=20 > Thanks. What I see for ℋ is that a non-printable character is genera= ted. When using --verbose --ascii -t html5 it appears as ℋ in the re= sulting file, and just an empty space (as far as I can see) in the PDF file= . >=20 > On Wed, Jul 5, 2023 at 12:41=E2=80=AFPM John MacFarlane > wrote: >> — and ℋ will be parsed as unicode characters and these will b= e passed through to the HTML. >> You can check the intermediate HTML file (again it will be printed with = --verbose) to confirm this. >> It may be that the program that is being invoked to go from HTML -> PDF = (wkhtmltopdf ?) doesn't handle these characters properly. >> You could try adding the `--ascii` option which will force entities to b= e used. >>=20 >> > On Jul 4, 2023, at 4:07 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org > wrote: >> >=20 >> > Thanks. I invoked pandoc -f gfm MyDoc. -o MyDoc.pdf and in the resulti= ng PDF document the subscripts are still ignored. When running it with --ve= rbose in the resulting output I saw numerous instances of=20 >> >=20 >> > [INFO] Not rendering RawInline (Format "html") "" >> > [INFO] Not rendering RawInline (Format "html") "" >> >=20 >> > However, when I added -t html5 to the invocation the diagnostics above= disappear, and the subscripts are indeed present in the converted PDF file= . Thanks for the tip - it has indeed improved things. Now it is still the c= ase that things like — or ℋ are ignored by pandoc. Any suggestio= ns on how to get pandoc to process them?=20 >> >=20 >> > I am using the following: >> >=20 >> > pandoc 3.1.4 >> > Features: +server +lua >> > Scripting engine: Lua 5.4 >> >=20 >> >=20 >> >=20 >> >=20 >> >=20 >> >=20 >> > On Tue, Jul 4, 2023 at 3:50=E2=80=AFPM John MacFarlane > wrote: >> > HTML tags should be passed through to HTML formats. >> >=20 >> > Have you looked at the intermediate HTML produced? You can use --verb= ose to see it. >> >=20 >> > This seems to work fine: >> >=20 >> > % pandoc -t html5 >> > _A__m_ >> >

Am

>> >=20 >> > PS. You probably want to use -f gfm if you're targeting GitHub Markdow= n. >> >=20 >> > Pandoc version? >> >=20 >> >=20 >> >=20 >> > > On Jul 3, >> >=20 >> > > 2023, at 3:41 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org > wrote: >> > >=20 >> > > I have a Github Markdown document that contains HTML tags - mostly t= o do with special characters (e.g. ℋ) and stuff to place pictures wher= e I want in the page. The thing is, pandoc seems to ignore the HTML tags. I= s this a limitation intrinsic to pandoc, or is there any way to get pandoc = to process such tags and produce the right output?=20 >> > >=20 >> > > The pandoc invocation that I am currently using for converting my Gi= thub Markdown documents to PDF is >> > >=20 >> > > $ pandoc --resource-path=3D/home/abc/Repos.wiki -t html5 --pdf-engi= ne=3Dwkhtmltopdf --metadata pagetitle=3D"MyDoc.md" --css github.css -o MyDo= c.pdf >> > >=20 >> > > The default invocation pandoc MyDoc.md -o MyDoc.pdf is not dealing w= ith images properly (in that it sometimes rearranges surrounding paragraphs= the wrong way) and it seems to be unable to deal with expressions like _A_= _m_, in that the and directives seem to be ignored. >> > >=20 >> > > --=20 >> > > You received this message because you are subscribed to the Google G= roups "pandoc-discuss" group. >> > > To unsubscribe from this group and stop receiving emails from it, se= nd an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org . >> > > To view this discussion on the web visit https://groups.google.com/d= /msgid/pandoc-discuss/b1dae07b-11d1-4c98-8fcf-369f2b23a54cn%40googlegroups.= com. >> >=20 >> > --=20 >> > You received this message because you are subscribed to the Google Gro= ups "pandoc-discuss" group. >> > To unsubscribe from this group and stop receiving emails from it, send= an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org . >> > To view this discussion on the web visit https://groups.google.com/d/m= sgid/pandoc-discuss/529BC174-779A-4D98-BCC9-F59AEAAC2B9D%40gmail.com. >> >=20 >> > --=20 >> > You received this message because you are subscribed to the Google Gro= ups "pandoc-discuss" group. >> > To unsubscribe from this group and stop receiving emails from it, send= an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org . >> > To view this discussion on the web visit https://groups.google.com/d/m= sgid/pandoc-discuss/CAFy1yb2op3Aq%3DP4L7xpNwPBBHtopKMx%2BurWz%2B-VQ%2B5Mh0C= M%3DhQ%40mail.gmail.com. >>=20 >> --=20 >> You received this message because you are subscribed to the Google Group= s "pandoc-discuss" group. >> To unsubscribe from this group and stop receiving emails from it, send a= n email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org . >> To view this discussion on the web visit https://groups.google.com/d/msg= id/pandoc-discuss/F4D52E47-33F8-4A2C-9A56-679BD5240ABD%40gmail.com. >=20 >=20 > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org . > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/CAFy1yb3hBrj7FUSM7wDiFY7hEB%2BGQ1PJSB4RiUo5YRNJnACZjA%40ma= il.gmail.com . --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/2EAB1263-5AFF-41B5-A875-ABB40CACE349%40gmail.com. --Apple-Mail=_2DA25CC3-4A22-4626-98AB-9C32D4076491 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset="UTF-8" That's a font issue.  You ma= y need to specify a font with --pdf-engine-opts
or try another PDF engi= ne that works with HTML

On Jul 5= , 2023, at 12:16 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
Thanks. What I s= ee for &Hscr; is that a non-printable character is generated. When usin= g --verbose --ascii -t html5 it appears as &#x210B; in the re= sulting file, and just an empty space (as far as I can see) in the PDF file= .

On Wed, Jul 5, 2023 at 12:41=E2=80=AFPM John MacFarlane <fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
=
&mdash; and &Hscr= ; will be parsed as unicode characters and these will be passed through to = the HTML.
You can check the intermediate HTML file (again it will be printed with --v= erbose) to confirm this.
It may be that the program that is being invoked to go from HTML -> PDF = (wkhtmltopdf ?) doesn't handle these characters properly.
You could try adding the `--ascii` option which will force entities to be u= sed.

> On Jul 4, 2023, at 4:07 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
>
> Thanks. I invoked pandoc -f gfm MyDoc. -o MyDoc.pdf and in the resulti= ng PDF document the subscripts are still ignored. When running it with --ve= rbose in the resulting output I saw numerous instances of
>
> [INFO] Not rendering RawInline (Format "html") "</sub>"
> [INFO] Not rendering RawInline (Format "html") "<sub>"
>
> However, when I added -t html5 to the invocation the diagnostics above= disappear, and the subscripts are indeed present in the converted PDF file= . Thanks for the tip - it has indeed improved things. Now it is still the c= ase that things like &mdash; or &Hscr; are ignored by pandoc. Any s= uggestions on how to get pandoc to process them?
>
> I am using the following:
>
> pandoc 3.1.4
> Features: +server +lua
> Scripting engine: Lua 5.4
>
>
>
>
>
>
> On Tue, Jul 4, 2023 at 3:50=E2=80=AFPM John MacFarlane <fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org= > wrote:
> HTML tags should be passed through to HTML formats.
>
> Have you looked at the intermediate HTML produced?  You can use -= -verbose to see it.
>
> This seems to work fine:
>
> % pandoc -t html5
> _A_<sub>_m_</sub>
> <p><em>A</em><sub><em>m</em></s= ub></p>
>
> PS. You probably want to use -f gfm if you're targeting GitHub Markdow= n.
>
> Pandoc version?
>
>
>
> > On Jul 3,
>
> > 2023, at 3:41 PM, Luveh Keraph <1.41421-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
> >
> > I have a Github Markdown document that contains HTML tags - mostl= y to do with special characters (e.g. &Hscr;) and stuff to place pictur= es where I want in the page. The thing is, pandoc seems to ignore the HTML = tags. Is this a limitation intrinsic to pandoc, or is there any way to get = pandoc to process such tags and produce the right output?
> >
> > The pandoc invocation that I am currently using for converting my= Github Markdown documents to PDF is
> >
> >  $ pandoc --resource-path=3D/home/abc/Repos.wiki -t html5 --= pdf-engine=3Dwkhtmltopdf --metadata pagetitle=3D"MyDoc.md" --css github.css= -o MyDoc.pdf
> >
> > The default invocation pandoc MyDoc.md -o MyDoc.pdf is not dealin= g with images properly (in that it sometimes rearranges surrounding paragra= phs the wrong way) and it seems to be unable to deal with expressions like = _A_<sub>_m_</sub>, in that the <sub> and </sub> dir= ectives seem to be ignored.
> >
> > --
> > You received this message because you are subscribed to the Googl= e Groups "pandoc-discuss" group.
> > To unsubscribe from this group and stop receiving emails from it,= send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.<= br> > > To view this discussion on the web visit https://groups.goog= le.com/d/msgid/pandoc-discuss/b1dae07b-11d1-4c98-8fcf-369f2b23a54cn%40googl= egroups.com.
>
> --
> You received this message because you are subscribed to the Google Gro= ups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send= an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
> To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/529BC174-779A-4D98-BCC9-F59AEAAC2B9D%40gmail.com.
>
> --
> You received this message because you are subscribed to the Google Gro= ups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send= an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
> To view this discussion on the web visit ht= tps://groups.google.com/d/msgid/pandoc-discuss/CAFy1yb2op3Aq%3DP4L7xpNwPBBH= topKMx%2BurWz%2B-VQ%2B5Mh0CM%3DhQ%40mail.gmail.com.

--
You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/msgid/pan= doc-discuss/F4D52E47-33F8-4A2C-9A56-679BD5240ABD%40gmail.com.

--
You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://g= roups.google.com/d/msgid/pandoc-discuss/CAFy1yb3hBrj7FUSM7wDiFY7hEB%2BGQ1PJ= SB4RiUo5YRNJnACZjA%40mail.gmail.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/msgid/p= andoc-discuss/2EAB1263-5AFF-41B5-A875-ABB40CACE349%40gmail.com.
--Apple-Mail=_2DA25CC3-4A22-4626-98AB-9C32D4076491--