public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed
From: John MacFarlane <jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org>
To: totyped-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org,
	pandoc-discuss
	<pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
Subject: Re: Problem When Transcribing Greek HTML Letters to LaTex
Date: Thu, 11 Apr 2019 21:54:22 -0700	[thread overview]
Message-ID: <m2y34fvn5t.fsf@johnmacfarlane.net> (raw)
In-Reply-To: <885bafeb-28cc-4f7f-ad1f-f7708ed8fb4d-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>

totyped-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org writes:

> I see, thank you very much for your help ! 
> So, you replaced basically all `<`with `&lt`, correct ? 

All the < except the ones that are parts of HTML tags.

> Also: Do you understand, why, if you go on my Stackoverflow question 
> https://stackoverflow.com/q/55642183/6114310, the string can be perfectly 
> represented, if I just type it as is into Stackoverflow ? 

StackOverflow is parsing as Markdown, which allows raw HTML.
And it's more forgiving of invalid HTML, apparently,
than pandoc's HTML parser.

Interestingly, pandoc does the same thing. If you read
markdown, it allows this:

% pandoc -f markdown -t latex
<i><λv<sub>1</sub>,w> + <v<sub>2</sub>,w></i>
\textless λv1,w\textgreater{} + \textless v2,w\textgreater{}

Here the emphasis disappears, though, because teh HTML
tag <i> is parsed as raw HTML which won't be passed
through to latex:

% pandoc -f markdown -t native
<i><λv<sub>1</sub>,w> + <v<sub>2</sub>,w></i>
[Para [RawInline (Format "html") "<i>",Str "<\955v",RawInline (Format "html") "<sub>",Str "1",RawInline (Format "html") "</sub>",Str ",w>",Space,Str "+",Space,Str "<v",RawInline (Format "html") "<sub>",Str "2",RawInline (Format "html") "</sub>",Str ",w>",RawInline (Format "html") "</i>"]]

But if you read HTML, it doesn't like the unescaped <.

% pandoc -f html -t latex
\emph{1,w\textgreater{} + 2,w\textgreater{}}

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/m2y34fvn5t.fsf%40johnmacfarlane.net.
For more options, visit https://groups.google.com/d/optout.


  parent reply	other threads:[~2019-04-12  4:54 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-04-11 23:08 totyped-Re5JQEeQqe8AvxtiuMwx3w
     [not found] ` <4160d0dd-940f-405e-9ecb-0f1eca421b14-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2019-04-12  0:04   ` John MacFarlane
     [not found]     ` <yh480k8swgxf5n.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
2019-04-12  3:27       ` totyped-Re5JQEeQqe8AvxtiuMwx3w
     [not found]         ` <885bafeb-28cc-4f7f-ad1f-f7708ed8fb4d-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2019-04-12  4:54           ` John MacFarlane [this message]
     [not found]             ` <m2y34fvn5t.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
2019-04-12 13:56               ` totyped-Re5JQEeQqe8AvxtiuMwx3w
     [not found]                 ` <7711d138-8d66-44bb-bc74-d3102bfe30d1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2019-04-12 14:41                   ` John MacFarlane
     [not found]                     ` <m2sgunb81m.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
2019-04-12 18:14                       ` totyped-Re5JQEeQqe8AvxtiuMwx3w
2019-04-12 20:38                   ` Kolen Cheung
2019-04-12 12:28           ` Joseph Reagle
2019-04-12 20:31   ` Kolen Cheung

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=m2y34fvn5t.fsf@johnmacfarlane.net \
    --to=jgm-tvlzxgkolnx2fbvcvol8/a@public.gmane.org \
    --cc=pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org \
    --cc=totyped-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).