From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/31038 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: PhilMac Newsgroups: gmane.text.pandoc Subject: Pandoc adding backslashes and grave accents when converting to md Date: Tue, 19 Jul 2022 22:03:40 -0700 (PDT) Message-ID: <0e8fb0aa-d500-4152-ab39-a4314ab5d27dn@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_1414_104891774.1658293420196" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="5167"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCNPLQHPYMLBBLMZ32LAMGQEKBVIG3I-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Jul 20 07:03:45 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-yb1-f191.google.com ([209.85.219.191]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1oE1s9-00018t-3O for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 20 Jul 2022 07:03:45 +0200 Original-Received: by mail-yb1-f191.google.com with SMTP id t10-20020a5b07ca000000b0066ec1bb6e2csf12437465ybq.14 for ; Tue, 19 Jul 2022 22:03:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=sender:date:from:to:message-id:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=c/gnSYN86yega7NAsgrx7eEDE7kiZwoHzrBiIlLToDI=; b=rmGS/YS2he+eU8+V9A0kE5ApKFb/WfhwMFtV9Nomg1neTncsy1DMxyFqcM4w/N/B8k mcC/x/se152mBZMbsQiQXKXXpcFpqsSjmTApuWAtJhNjWlS5pb9dO8sLK/yuAE76gKJw dZTd/X163sxz6cLcwUtYrr2NDDwQnjNGSaO49h1CfSwp3TjRoE6WPQfBNFudgtnYrfey CJbvSdiA/s+r9Je1pEvvk2ZNDYEgihPKcJEPOjZvlKC/mAs/y7wvQI46FANak0GtKSuU DnTizjPZBDm/OPoBvrq8ibKIXW9OAxibt9A/e3P8U08knJfnnhWkvJM5XnzOODLCQJ4H iBgA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=sender:x-gm-message-state:date:from:to:message-id:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=c/gnSYN86yega7NAsgrx7eEDE7kiZwoHzrBiIlLToDI=; b=MUWOgqrs4H5/kg6Srl6VFR+tO/xXSSH1IP7p+km91yRazaIBkYWCis82hO+XX3yNwg SMa3CwCOICnyf0I4PtH2Kcna1pg18hYmwlZNcowyWZhsdXluGqaDUN6Wyq+FDh0bo0ru i1OSWg3SpEJcqVbe9EA9lKOaav7UQkFB+QmPBKGNG9dr8Rr9Ydg37GOiTw2y0u1OXigI xz/fPgAop68oQRYR90yZOeGYWYzcZYw2CKFH/5ZEj1GhlbKDTt/vCF+i2oM2pt7T/9Ej B92Jo85xPGgmmfi1KbUca+/nh3h1rfHxez8zgSvVwLJOfBI9RvdzTSNIEMoKH27ABttx sIJQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AJIora9WjhapCHXJhKITK9zc7KZVYQRxhrR04czkjKpzuRfbdfpGAEeI WU9r3hdfa2HgI06OhOzS2Tk= X-Google-Smtp-Source: AGRyM1uGK7mOxfYNY+iAoG0w6CTGHhMB8cR9Qn4DJJ4q3mQvJ37xj+P6KaUSK2V0NlUjCp2iu2pSSQ== X-Received: by 2002:a81:1ec4:0:b0:31d:e31f:1b6e with SMTP id e187-20020a811ec4000000b0031de31f1b6emr30035633ywe.11.1658293424032; Tue, 19 Jul 2022 22:03:44 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a25:2782:0:b0:670:98c6:b898 with SMTP id n124-20020a252782000000b0067098c6b898ls92553ybn.4.-pod-prod-gmail; Tue, 19 Jul 2022 22:03:41 -0700 (PDT) X-Received: by 2002:a25:bcb:0:b0:670:7a72:e8bf with SMTP id 194-20020a250bcb000000b006707a72e8bfmr8074275ybl.580.1658293420839; Tue, 19 Jul 2022 22:03:40 -0700 (PDT) X-Original-Sender: philmac-97jfqw80gc6171pxa8y+qA@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:31038 Archived-At: ------=_Part_1414_104891774.1658293420196 Content-Type: multipart/alternative; boundary="----=_Part_1415_810079645.1658293420196" ------=_Part_1415_810079645.1658293420196 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I'm getting backslashes and grave accents in my files when converting to md= =20 from rtf. (These are files I previously exported from Apple's Pages app to= =20 RTF.) The backslashes come at the ends of lines and also before multiple=20 periods (such as an ellipsis), and the grave accents surround characters=20 including smart quotes and em-dashes and also appear at the start of most= =20 (but not all) lines in the document. The command I'm using to go from rtf= =20 to md is: for f in *.rtf; do pandoc --wrap=3Dnone "$f" -s -o "${f%.rtf}.md"; done When I export the same files from Pages to docx and then convert to md, I= =20 don't get the grave accents, but I do get some backslashes at the ends of= =20 lines. I also get *[=E2=80=9C]{dir=3D"rtl"}* where there's a left double qu= otation=20 mark and *[=E2=80=99]{dir=3D"rtl"}* where there's a smart apostrophe (i.e. = a right=20 single quotation mark). This code is something in HTML=20 = =20 to do with scripts such as Arabic that are read right-to-left=E2=80=94I'm c= lueless=20 as to what that has to do my documents, which use only English and were=20 never in HTML. The command I'm using to go from docx to md is: for f in *.docx; do pandoc --wrap=3Dnone -t markdown-smart "$f" -s -o=20 "${f%.docx}.md"; done (I have to use *-t markdown-smart*, or the smart quotes aren't preserved.= =20 But I have a similar issue if I leave it out: I get ["]{dir=3D"rtl"} and=20 *[']{dir=3D"rtl"}**.)* Does anyone have any thoughts on what might be going on? Clearly there are= =20 issues with these files=E2=80=94though not ones that are apparent in Pages= =E2=80=94but I=20 have no idea what the issues are. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/0e8fb0aa-d500-4152-ab39-a4314ab5d27dn%40googlegroups.com. ------=_Part_1415_810079645.1658293420196 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I'm getting backslashes and grave accents in my files when converting to md= from rtf. (These are files I previously exported from Apple's Pages app to= RTF.) The backslashes come at the ends of lines and also before multiple p= eriods (such as an ellipsis), and the grave accents surround characters inc= luding smart quotes and em-dashes and also appear at the start of most (but= not all) lines in the document. The command I'm using to go from rtf to md=  is:

for f in *.rtf; do pandoc --wr= ap=3Dnone "$f" -s -o "${f%.rtf}.md"; done

When I export the same files from Pages to docx and then convert to md, = I don't get the grave accents, but I do get some backslashes at the ends of= lines. I also get [=E2=80=9C]{dir=3D"= rtl"} where there's a left double quotation= mark and [=E2=80=99]{dir=3D"rtl"= } where there's a smart apostrophe (i.e. a = right single quotation mark). This code is something in HTML t= o do with scripts such as Arabic that are read right-to-left=E2=80=94I'm cl= ueless as to what that has to do my documents, which use only English and w= ere never in HTML. The command I'm using to go from docx to md = is:

for f in *.docx; do pandoc --wr= ap=3Dnone -t markdown-smart "$f" -s -o "${f%.docx}.md"; done

(I have to use -t markdown-smart, or the smart quotes are= n't preserved. But I have a similar issue if I leave it out: I get ["]{dir=3D"rtl"= } and [']{dir=3D"rtl"}.)

<= div>
<= font face=3D"Arial">Does anyone have any thoughts on what might b= e going on? Clearly there are issues with these files=E2=80=94though n= ot ones that are apparent in Pages=E2=80=94but I have no idea what the issu= es are.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d= /msgid/pandoc-discuss/0e8fb0aa-d500-4152-ab39-a4314ab5d27dn%40googlegroups.= com.
------=_Part_1415_810079645.1658293420196-- ------=_Part_1414_104891774.1658293420196--