From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/114971 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Hans van der Meer via ntg-context Newsgroups: gmane.comp.tex.context Subject: Re: ignore not closed tags in XML input Date: Mon, 16 May 2022 17:30:04 +0200 Message-ID: <996C11BF-338C-4764-8A65-00B544EBF391@ziggo.nl> References: Reply-To: mailing list for ConTeXt users Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3696.80.82.1.1\)) Content-Type: multipart/mixed; boundary="===============4481542665017589450==" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="33587"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Hans van der Meer To: NTG ConTeXt Original-X-From: ntg-context-bounces@ntg.nl Mon May 16 17:54:44 2022 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane-mx.org Original-Received: from zapf.boekplan.nl ([5.39.185.232] helo=zapf.ntg.nl) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nqd3U-0008X7-5a for gctc-ntg-context-518@m.gmane-mx.org; Mon, 16 May 2022 17:54:44 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by zapf.ntg.nl (Postfix) with ESMTP id 1BA0F280E25; Mon, 16 May 2022 17:54:12 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at zapf.boekplan.nl Original-Received: from zapf.ntg.nl ([127.0.0.1]) by localhost (zapf.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id FXtrjDqpbQY2; Mon, 16 May 2022 17:54:10 +0200 (CEST) Original-Received: from zapf.ntg.nl (localhost [127.0.0.1]) by zapf.ntg.nl (Postfix) with ESMTP id 5996328783C; Mon, 16 May 2022 17:54:10 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by zapf.ntg.nl (Postfix) with ESMTP id 05778280E25 for ; Mon, 16 May 2022 17:54:09 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at zapf.boekplan.nl Original-Received: from zapf.ntg.nl ([127.0.0.1]) by localhost (zapf.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 0dszYF28R9Nz for ; Mon, 16 May 2022 17:54:07 +0200 (CEST) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=212.54.42.168; helo=smtpq5.tb.mail.iss.as9143.net; envelope-from=havdmeer@ziggo.nl; receiver= X-Greylist: delayed 1440 seconds by postgrey-1.36 at zapf.ntg.nl; Mon, 16 May 2022 17:54:07 CEST Original-Received: from smtpq5.tb.mail.iss.as9143.net (smtpq5.tb.mail.iss.as9143.net [212.54.42.168]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by zapf.ntg.nl (Postfix) with ESMTPS id 4EEE0280438 for ; Mon, 16 May 2022 17:54:07 +0200 (CEST) Original-Received: from [212.54.42.105] (helo=smtp1.tb.mail.iss.as9143.net) by smtpq5.tb.mail.iss.as9143.net with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nqcff-0002HL-23 for ntg-context@ntg.nl; Mon, 16 May 2022 17:30:07 +0200 Original-Received: from smtpclient.apple ([84.106.134.200]) by smtp1.tb.mail.iss.as9143.net with ESMTPA id qcfenDvQcO6CXqcfen5rDU; Mon, 16 May 2022 17:30:07 +0200 X-Env-Mailfrom: havdmeer@ziggo.nl X-Env-Rcptto: ntg-context@ntg.nl X-SourceIP: 84.106.134.200 X-CNFS-Analysis: v=2.4 cv=cPTzD3SN c=1 sm=1 tr=0 ts=62826dff cx=a_exe a=wCstmS+ZHA3zSJXjQC+ubA==:117 a=wCstmS+ZHA3zSJXjQC+ubA==:17 a=MiNTnEJAAAAA:8 a=SQs2FS0bAAAA:8 a=YEMqx4UAAAAA:8 a=ACQCx6kCAAAA:8 a=xtERp6CFAAAA:8 a=a3nu-2BBAAAA:8 a=wbugXQIw9nuoB9w96ucA:9 a=QEXdDO2ut3YA:10 a=6xhFQYlxQlUA:10 a=psbNkZ0eai-DNqSbUOsA:9 a=v2CuewCKQawsXwza:21 a=_W_S_7VecoQA:10 a=LmrbSfiT3hecnSZifb5M:22 a=ZR5RGohv4dFzgcZAq_X4:22 a=V0662LiR8DSfwiDagK97:22 a=Sab0UneHBzlWrQDlOuxD:22 a=ekCXXmE-vB8RPiJ3MEZb:22 X-Authenticated-Sender: havdmeer@ziggo.nl DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ziggo.nl; s=202002corplgsmtpnl; t=1652715007; bh=nqE8w/Bd9wlCFcH7i5dds1ArPckwxVUhCaFZGe6kXag=; h=From:Subject:Date:References:To:In-Reply-To; b=VXOlhB6+mUEKH6trUoRMiehq2fU8nsmJ7Ch8SBBDfqO+XJxjEKrTdO5y31CBtesJ8 9YjdXFUevPDmwK46vbB1aulh79yxvqNK4WoqpjceO8Cuj68Hn/ThA4gcfkY2TUp/fp 4uL9aS4TSIk1XOr4sDh0WuvdYllQe4zJqpcak/mUY0ZA0JDBMBlbBVCNMvjzJ5GBFl 7VU9DCGsGHaEVyg6Yso4iq8EDSiRXen5sukQG844H57ciIUQ8f9NUc/NjfmftRZ1uN uvEGIpbVqjjMrZHkvTDhAuoCipmFAu7Rfr29l/akKPP8xn9bBeUxD4gFtd3n4RjH1d CzdKeR0D2Aemg== In-Reply-To: X-Mailer: Apple Mail (2.3696.80.82.1.1) X-CMAE-Envelope: MS4xfC/TfiBcLdhm/3blVHtBbshBzthNrhtmZpuIbOreRKcdHFp1EVgoRgTUI+4JRfyQSs7rH5ce8eL0Yghcg4aHRvZuhi/SufocUuCjd1zxX7uqMIHrXoTq A9sWqLl7LbxUz3Q4q9YJiW6saWrZ5PLzZcgEumOB9929WfW2+bKrJw9YjGj5bgzAkU7+y8u7ZZhY9w== X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.26 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: ntg-context-bounces@ntg.nl Original-Sender: "ntg-context" Xref: news.gmane.io gmane.comp.tex.context:114971 Archived-At: --===============4481542665017589450== Content-Type: multipart/alternative; boundary="Apple-Mail=_EA088013-92C8-414F-81EF-8FA6E29C34D3" --Apple-Mail=_EA088013-92C8-414F-81EF-8FA6E29C34D3 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 Can't you use an editor with grep, searching for something like the = pattern (with appropriate escapes of course). dr. Hans van der Meer > On 16 May 2022, at 17:08, Pablo Rodriguez via ntg-context = wrote: >=20 > Dear list, >=20 > I would like to feed > https://seumasjeltzz.github.io/LinguaeGraecaePerSeIllustrata/001.html = as > XML input for ConTeXt. >=20 > The problem is that (as many other XML files that I haven=E2=80=99t = generated > myself) some and tags aren=E2=80=99t closed, such as in: >=20 > > > >=20 > So, all that I get is the following message: >=20 > invalid xml file - parsed text >=20 > Unsuccessfully I have tried the following: >=20 > \xmlsetsetup{#1}{html/head/(meta|link)}{-} >=20 > Is there no way to make ConTeXt more tolerant, so that it is able to > ignore those tags? >=20 > Many thanks for your help, >=20 > Pablo > = __________________________________________________________________________= _________ > If your question is of interest to others as well, please add an entry = to the Wiki! >=20 > maillist : ntg-context@ntg.nl / = http://www.ntg.nl/mailman/listinfo/ntg-context > webpage : http://www.pragma-ade.nl / http://context.aanhet.net > archive : https://bitbucket.org/phg/context-mirror/commits/ > wiki : http://contextgarden.net > = __________________________________________________________________________= _________ --Apple-Mail=_EA088013-92C8-414F-81EF-8FA6E29C34D3 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8 Can't= you use an editor with grep, searching for something like the pattern = <meta.*^/> (with appropriate escapes of course).

dr. Hans van der Meer


On 16 May 2022, at 17:08, Pablo Rodriguez via ntg-context = <ntg-context@ntg.nl> wrote:

Dear = list,

I would like to feed
https://seumasjeltzz.github.io/LinguaeGraecaePerSeIllustrata/00= 1.html as
XML input for ConTeXt.

The problem is that (as many other XML files that I haven=E2=80= =99t generated
myself) some <meta> and <link> = tags aren=E2=80=99t closed, such as in:

=  <meta charset=3D"utf-8">
 <link = href=3D"https://fonts/css?greek" rel=3D"stylesheet">
 <link href=3D"style.css" rel=3D"stylesheet">

So, all that I get is the following = message:

 invalid xml file - parsed = text

Unsuccessfully I have tried the = following:

=  \xmlsetsetup{#1}{html/head/(meta|link)}{-}

Is there no way to make ConTeXt more tolerant, so that it is = able to
ignore those tags?

Many= thanks for your help,

Pablo
_______________________________________________________________= ____________________
If your question is of interest to = others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : = https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
_______________________________________________________________= ____________________

= --Apple-Mail=_EA088013-92C8-414F-81EF-8FA6E29C34D3-- --===============4481542665017589450== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX18KSWYgeW91ciBxdWVzdGlvbiBpcyBvZiBpbnRlcmVz dCB0byBvdGhlcnMgYXMgd2VsbCwgcGxlYXNlIGFkZCBhbiBlbnRyeSB0byB0aGUgV2lraSEKCm1h aWxsaXN0IDogbnRnLWNvbnRleHRAbnRnLm5sIC8gaHR0cDovL3d3dy5udGcubmwvbWFpbG1hbi9s aXN0aW5mby9udGctY29udGV4dAp3ZWJwYWdlICA6IGh0dHA6Ly93d3cucHJhZ21hLWFkZS5ubCAv IGh0dHA6Ly9jb250ZXh0LmFhbmhldC5uZXQKYXJjaGl2ZSAgOiBodHRwczovL2JpdGJ1Y2tldC5v cmcvcGhnL2NvbnRleHQtbWlycm9yL2NvbW1pdHMvCndpa2kgICAgIDogaHR0cDovL2NvbnRleHRn YXJkZW4ubmV0Cl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCg== --===============4481542665017589450==--