public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed
* Converting HTML with break tags to Markdown leaves slashes in the output -- ?
@ 2016-04-28 22:19 Paul Potts
       [not found] ` <f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 2+ messages in thread
From: Paul Potts @ 2016-04-28 22:19 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 1328 bytes --]

I have some text extracted from a Blogger blog post via a convoluted 
process into an HTML file, and I'm then translating that to Markdown.

Blogger's editor inserted various breaks (not really with my consent) 
instead of using paragraph tags; a fragment of the extracted HTML looks 
like this:

The goal is to be an independent and quite possibly contrarian reviewer. 
Check back soon!<br />
<br />

If I translate this using pandoc --from html --to markdown I get slashes 
left in the output.

The goal is to be an independent and quite possibly contrarian reviewer. 
Check back soon!\
\ 

Using --to markdown_strict removes these. That's a workaround but I am just 
confused as to whether preserving the slashes are intentional and why.

Thanks,

Paul

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/f045de5f-c008-4608-914b-9ad9f87b6fbc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 2198 bytes --]

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Converting HTML with break tags to Markdown leaves slashes in the output -- ?
       [not found] ` <f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2016-04-29  2:20   ` John MACFARLANE
  0 siblings, 0 replies; 2+ messages in thread
From: John MACFARLANE @ 2016-04-29  2:20 UTC (permalink / raw)
  To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw

Backslash + newline is an alternative pandoc markdown syntax for
a line break within a paragraph.

+++ Paul Potts [Apr 28 16 15:19 ]:
>   I have some text extracted from a Blogger blog post via a convoluted
>   process into an HTML file, and I'm then translating that to Markdown.
>   Blogger's editor inserted various breaks (not really with my consent)
>   instead of using paragraph tags; a fragment of the extracted HTML looks
>   like this:
>   The goal is to be an independent and quite possibly contrarian
>   reviewer. Check back soon!<br />
>   <br />
>   If I translate this using pandoc --from html --to markdown I get
>   slashes left in the output.
>   The goal is to be an independent and quite possibly contrarian
>   reviewer. Check back soon!\
>   \
>   Using --to markdown_strict removes these. That's a workaround but I am
>   just confused as to whether preserving the slashes are intentional and
>   why.
>   Thanks,
>   Paul
>
>   --
>   You received this message because you are subscribed to the Google
>   Groups "pandoc-discuss" group.
>   To unsubscribe from this group and stop receiving emails from it, send
>   an email to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
>   To post to this group, send email to
>   [2]pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
>   To view this discussion on the web visit
>   [3]https://groups.google.com/d/msgid/pandoc-discuss/f045de5f-c008-4608-
>   914b-9ad9f87b6fbc%40googlegroups.com.
>   For more options, visit [4]https://groups.google.com/d/optout.
>
>References
>
>   1. mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
>   2. mailto:pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
>   3. https://groups.google.com/d/msgid/pandoc-discuss/f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org?utm_medium=email&utm_source=footer
>   4. https://groups.google.com/d/optout


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2016-04-29  2:20 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-04-28 22:19 Converting HTML with break tags to Markdown leaves slashes in the output -- ? Paul Potts
     [not found] ` <f045de5f-c008-4608-914b-9ad9f87b6fbc-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2016-04-29  2:20   ` John MACFARLANE

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).