Piping pandoc's output through a simple text filter using Perl's text match/line range operator should do the trick by not printing lines on and between the begin/end HTML markers.

````perl
#!/usr/bin/env perl

use 5.010001;
use strict;
use warnings;
use utf8;
use open qw[ :utf8 :std ];

while(<>) {  # loop on STDIN
    if ( /^\#\+BEGIN_HTML/ .. /^\#\+END_HTML/ ) {
        # skip if on/between the fences
    }
    else {
        print $_;
    }
}

__END__
````

Den 30 aug 2017 21:36 skrev "Uwe Brauer" <oub-vOKZqXPyTgQ@public.gmane.org.es>:

Hi

I just converted
https://www.theguardian.com/politics/2017/aug/30/may-to-press-japan-on-its-eu-trade-deal-in-hopes-of-a-model-for-uk
To org mode using pandoc (pandoc 1.19 in Kubuntu 14.04)

But the file containts lines such as

#+BEGIN_HTML
  <div itemprop="publisher" itemtype="https://schema.org/Organization">
#+END_HTML

#+BEGIN_HTML
  <div itemprop="logo" itemtype="https://schema.org/ImageObject">
#+END_HTML


Couldn't they just be ignored in the conversion process since they don't provide much help for org
mode.

Thanks

Uwe Brauer

--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe@googlegroups.com.
To post to this group, send email to pandoc-discuss@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/877exkx2zw.fsf%40mat.ucm.es.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/CAFC_yuTLQ3aj-j6CHgQekOPt-1oZgxp52QWR6X2rRjXunB7K5A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.