* Converting HTML with break tags to Markdown leaves slashes in the output -- ?
@ 2016-04-28 22:19 Paul Potts
[not found] ` <f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
0 siblings, 1 reply; 2+ messages in thread
From: Paul Potts @ 2016-04-28 22:19 UTC (permalink / raw)
To: pandoc-discuss
[-- Attachment #1.1: Type: text/plain, Size: 1328 bytes --]
I have some text extracted from a Blogger blog post via a convoluted
process into an HTML file, and I'm then translating that to Markdown.
Blogger's editor inserted various breaks (not really with my consent)
instead of using paragraph tags; a fragment of the extracted HTML looks
like this:
The goal is to be an independent and quite possibly contrarian reviewer.
Check back soon!<br />
<br />
If I translate this using pandoc --from html --to markdown I get slashes
left in the output.
The goal is to be an independent and quite possibly contrarian reviewer.
Check back soon!\
\
Using --to markdown_strict removes these. That's a workaround but I am just
confused as to whether preserving the slashes are intentional and why.
Thanks,
Paul
--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/f045de5f-c008-4608-914b-9ad9f87b6fbc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
[-- Attachment #1.2: Type: text/html, Size: 2198 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: Converting HTML with break tags to Markdown leaves slashes in the output -- ?
[not found] ` <f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2016-04-29 2:20 ` John MACFARLANE
0 siblings, 0 replies; 2+ messages in thread
From: John MACFARLANE @ 2016-04-29 2:20 UTC (permalink / raw)
To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw
Backslash + newline is an alternative pandoc markdown syntax for
a line break within a paragraph.
+++ Paul Potts [Apr 28 16 15:19 ]:
> I have some text extracted from a Blogger blog post via a convoluted
> process into an HTML file, and I'm then translating that to Markdown.
> Blogger's editor inserted various breaks (not really with my consent)
> instead of using paragraph tags; a fragment of the extracted HTML looks
> like this:
> The goal is to be an independent and quite possibly contrarian
> reviewer. Check back soon!<br />
> <br />
> If I translate this using pandoc --from html --to markdown I get
> slashes left in the output.
> The goal is to be an independent and quite possibly contrarian
> reviewer. Check back soon!\
> \
> Using --to markdown_strict removes these. That's a workaround but I am
> just confused as to whether preserving the slashes are intentional and
> why.
> Thanks,
> Paul
>
> --
> You received this message because you are subscribed to the Google
> Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To post to this group, send email to
> [2]pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit
> [3]https://groups.google.com/d/msgid/pandoc-discuss/f045de5f-c008-4608-
> 914b-9ad9f87b6fbc%40googlegroups.com.
> For more options, visit [4]https://groups.google.com/d/optout.
>
>References
>
> 1. mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
> 2. mailto:pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
> 3. https://groups.google.com/d/msgid/pandoc-discuss/f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org?utm_medium=email&utm_source=footer
> 4. https://groups.google.com/d/optout
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2016-04-29 2:20 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-04-28 22:19 Converting HTML with break tags to Markdown leaves slashes in the output -- ? Paul Potts
[not found] ` <f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2016-04-29 2:20 ` John MACFARLANE
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).