Can't you use an editor with grep, searching for something like the pattern <meta.*^/> (with appropriate escapes of course).

dr. Hans van der Meer


On 16 May 2022, at 17:08, Pablo Rodriguez via ntg-context <ntg-context@ntg.nl> wrote:

Dear list,

I would like to feed
https://seumasjeltzz.github.io/LinguaeGraecaePerSeIllustrata/001.html as
XML input for ConTeXt.

The problem is that (as many other XML files that I haven’t generated
myself) some <meta> and <link> tags aren’t closed, such as in:

 <meta charset="utf-8">
 <link href="https://fonts/css?greek" rel="stylesheet">
 <link href="style.css" rel="stylesheet">

So, all that I get is the following message:

 invalid xml file - parsed text

Unsuccessfully I have tried the following:

 \xmlsetsetup{#1}{html/head/(meta|link)}{-}

Is there no way to make ConTeXt more tolerant, so that it is able to
ignore those tags?

Many thanks for your help,

Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________