From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/114347 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Karl Pettersson via ntg-context Newsgroups: gmane.comp.tex.context Subject: Re: XMP metadata schema yields invalid PDF/A Date: Sat, 5 Feb 2022 09:48:16 +0100 Message-ID: References: <1f7be313-e921-1fdf-f87c-33d3899e397a@xs4all.nl> Reply-To: mailing list for ConTeXt users Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="ImGF4TvkZCC4MtNF" Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="20365"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Karl Pettersson To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Sat Feb 05 09:48:54 2022 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane-mx.org Original-Received: from zapf.boekplan.nl ([5.39.185.232] helo=zapf.ntg.nl) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nGGkY-0004ui-9M for gctc-ntg-context-518@m.gmane-mx.org; Sat, 05 Feb 2022 09:48:54 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by zapf.ntg.nl (Postfix) with ESMTP id 85D922A281C; Sat, 5 Feb 2022 09:48:27 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at zapf.boekplan.nl Original-Received: from zapf.ntg.nl ([127.0.0.1]) by localhost (zapf.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RifYw2EUnYAD; Sat, 5 Feb 2022 09:48:25 +0100 (CET) Original-Received: from zapf.ntg.nl (localhost [127.0.0.1]) by zapf.ntg.nl (Postfix) with ESMTP id 0B8992A280D; Sat, 5 Feb 2022 09:48:25 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by zapf.ntg.nl (Postfix) with ESMTP id 8F24F2A280D for ; Sat, 5 Feb 2022 09:48:23 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at zapf.boekplan.nl Original-Received: from zapf.ntg.nl ([127.0.0.1]) by localhost (zapf.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id GP6e_DFN5thP for ; Sat, 5 Feb 2022 09:48:21 +0100 (CET) Received-SPF: None (mailfrom) identity=mailfrom; client-ip=93.188.3.37; helo=smtp.outgoing.loopia.se; envelope-from=karl.pettersson@klpn.se; receiver= Original-Received: from smtp.outgoing.loopia.se (smtp.outgoing.loopia.se [93.188.3.37]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) (No client certificate requested) by zapf.ntg.nl (Postfix) with ESMTPS id 7431D2A280C for ; Sat, 5 Feb 2022 09:48:21 +0100 (CET) Original-Received: from s807.loopia.se (localhost [127.0.0.1]) by s807.loopia.se (Postfix) with ESMTP id 131952469B8D for ; Sat, 5 Feb 2022 09:48:20 +0100 (CET) Original-Received: from s899.loopia.se (unknown [172.22.191.5]) by s807.loopia.se (Postfix) with ESMTP id 042F02F505E5; Sat, 5 Feb 2022 09:48:20 +0100 (CET) Original-Received: from s470.loopia.se (unknown [172.22.191.6]) by s899.loopia.se (Postfix) with ESMTP id 00D792C9676F; Sat, 5 Feb 2022 09:48:19 +0100 (CET) X-Virus-Scanned: amavisd-new at amavis.loopia.se Original-Received: from s645.loopia.se ([172.22.191.6]) by s470.loopia.se (s470.loopia.se [172.22.190.10]) (amavisd-new, port 10024) with LMTP id qbXai5B8bbA0; Sat, 5 Feb 2022 09:48:17 +0100 (CET) X-Loopia-Auth: user X-Loopia-User: karl.pettersson@klpn.se X-Loopia-Originating-IP: IPv6:2a00:801:74d:9f61:51ef:ff36:4ae7:9ba8 Original-Received: from localhost (unknown [IPv6:2a00:801:74d:9f61:51ef:ff36:4ae7:9ba8]) (Authenticated sender: karl.pettersson@klpn.se) by s645.loopia.se (Postfix) with ESMTPSA id BA271157BA81; Sat, 5 Feb 2022 09:48:17 +0100 (CET) Content-Disposition: inline In-Reply-To: X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.26 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: ntg-context-bounces@ntg.nl Original-Sender: "ntg-context" Xref: news.gmane.io gmane.comp.tex.context:114347 Archived-At: --ImGF4TvkZCC4MtNF Content-Type: text/plain; charset=us-ascii Content-Disposition: inline On Sat, Feb 05, 2022 at 12:37:44AM +0100, luigi scarso via ntg-context wrote: > On Fri, Feb 4, 2022 at 11:11 PM Karl Pettersson via ntg-context < > ntg-context@ntg.nl> wrote: > > > On Fri, Feb 04, 2022 at 10:25:27PM +0100, Hans Hagen via ntg-context wrote: > > > On 2/4/2022 7:29 PM, Karl Pettersson via ntg-context wrote: > > > > Hi > > > > > > > > PDF/A files generated using ConTeXt fail validation with veraPDF, and > > > > the reason seems to be that the dc:description metadata is defined with > > > > the wrong type in the embedded XMP extension schema. > > > > > > > > > > https://tex.stackexchange.com/questions/632380/generate-pdf-a-with-context > > > > > > > > https://github.com/veraPDF/veraPDF-library/issues/1224 > > > > > > > > I can reproduce the problem using TeX Live 2021 (MkIV 2021.03.05). The > > > > definition seems to be controlled by this code. > > > > > > > > > > https://source.contextgarden.net/tex/context/base/mkiv/lpdf-pua.xml?search=rdf#l81 > > > so "dc:description" is not permitted? it is mentioned in > > > > > > > > https://www.dublincore.org/specifications/dublin-core/dcmi-terms/#http://purl.org/dc/elements/1.1/description > > > > > > (btw it never failed before) > > > > The description element is permitted, but the problem seems to be that > > its valueType is defined as Text in the embedded schema, while the > > metadata element in the PDF has an embedded > xml:lang="x-default"> structure. > > > > > > https://github.com/veraPDF/veraPDF-library/issues/1224#issuecomment-1029932963 > > > > (I suppose the reference to "title" in the issue comment should be > > "description".) > > > > Validating with veraPDF <1.20 does not raise the error. The validation > > seems to have changed for redefined types in that version. > > > > > > https://github.com/veraPDF/veraPDF-library/blob/integration/RELEASENOTES.md#validation > > > I am missing something here... true > FOO > makes a valid pdf 3a with verapdf 1.20.1. > But dc:description is like dc:title, so where we are redefining > dc:description as Text ? > > -- > luigi Attach metadata xml for the non-validating and validating example (extraced with `pdfinfo -meta | xmllint format -`) in the Github issue. Here is a reference to the description element. Note that only description seems to be redefined, not title. https://source.contextgarden.net/tex/context/base/mkiv/lpdf-pua.xml#l81 > ___________________________________________________________________________________ > If your question is of interest to others as well, please add an entry to the Wiki! > > maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context > webpage : http://www.pragma-ade.nl / http://context.aanhet.net > archive : https://bitbucket.org/phg/context-mirror/commits/ > wiki : http://contextgarden.net > ___________________________________________________________________________________ -- Karl Pettersson Uppsala, Sweden https://static-dust.klpn.se/ --ImGF4TvkZCC4MtNF Content-Type: text/xml; charset=utf-8 Content-Disposition: attachment; filename="test.xml" Content-Transfer-Encoding: 8bit application/pdf AUTHOR TITLE out | 2022-02-02T21:34:48+01:00 out 2022-02-02 21:34 www.pragma-ade.com contextgarden.net 2019.03.21 21:39 tug.org 1.10 7127 5.3 linux-64 2022-02-02T21:34:48+01:00 LuaTeX 1.10 7127 + ConTeXt MkIV 2019.03.21 21:39 2022-02-02T21:34:48+01:00 2022-02-02T21:34:48+01:00 LuaTeX-1.10 False uuid:77baf08a-41c6-87cb-47d0-807f764f5064 uuid:fe6c773e-42b0-864d-4c63-da5c57ec1a40 http://ns.adobe.com/pdf/1.3/ pdf Adobe PDF Schema internal A name object indicating whether the document has been modified to include trapping information Trapped Text http://purl.org/dc/elements/1.1/ pdf Dubline Core Schema internal Subject in Document Properties description Text http://ns.adobe.com/pdfx/1.3/ pdfx PDF/X ID Schema external Name of the ConTeXt job ConTeXt.Jobname Text external Time stamp of ConTeXt version ConTeXt.Time Text external ConTeXt website ConTeXt.Url Text external ConTeXt support ConTeXt.Support Text external ConTeXt version ConTeXt.Version Text external ConTeXt LMTX version ConTeXt.LMTX Text external TeX support TeX.Support Text external LuaTeX version LuaTeX.Version Text external LuaTeX functionality LuaTeX.Functionality Text external LuaTeX Lua version LuaTeX.LuaVersion Text external LuaTeX platform LuaTeX.Platform Text external Document identifier ID Text http://ns.adobe.com/xap/1.0/mm/ xmpMM XMP Media Management Schema internal UUID based identifier for specific incarnation of a document InstanceID URI http://www.aiim.org/pdfa/ns/id/ pdfaid PDF/A ID Schema internal Part of PDF/A standard part Integer internal Amendment of PDF/A standard amd Text internal Conformance level of PDF/A standard conformance Text 3 A --ImGF4TvkZCC4MtNF Content-Type: text/xml; charset=utf-8 Content-Disposition: attachment; filename="test-fixed.xml" Content-Transfer-Encoding: 8bit application/pdf AUTHOR TITLE out | 2022-02-02T21:34:48+01:00 out 2022-02-02 21:34 www.pragma-ade.com contextgarden.net 2019.03.21 21:39 tug.org 1.10 7127 5.3 linux-64 2022-02-02T21:34:48+01:00 LuaTeX 1.10 7127 + ConTeXt MkIV 2019.03.21 21:39 2022-02-04T15:11:05+03:00 2022-02-04T15:11:05+03:00 LuaTeX-1.10 Unknown uuid:77baf08a-41c6-87cb-47d0-807f764f5064 uuid:00ca627f-3d6c-46bd-8c42-793d2c04eb4d 3 A http://ns.adobe.com/pdf/1.3/ pdf Adobe PDF Schema internal A name object indicating whether the document has been modified to include trapping information Trapped Text http://ns.adobe.com/pdfx/1.3/ pdfx PDF/X ID Schema external Name of the ConTeXt job ConTeXt.Jobname Text external Time stamp of ConTeXt version ConTeXt.Time Text external ConTeXt website ConTeXt.Url Text external ConTeXt support ConTeXt.Support Text external ConTeXt version ConTeXt.Version Text external ConTeXt LMTX version ConTeXt.LMTX Text external TeX support TeX.Support Text external LuaTeX version LuaTeX.Version Text external LuaTeX functionality LuaTeX.Functionality Text external LuaTeX Lua version LuaTeX.LuaVersion Text external LuaTeX platform LuaTeX.Platform Text external Document identifier ID Text http://ns.adobe.com/xap/1.0/mm/ xmpMM XMP Media Management Schema internal UUID based identifier for specific incarnation of a document InstanceID URI http://www.aiim.org/pdfa/ns/id/ pdfaid PDF/A ID Schema internal Part of PDF/A standard part Integer internal Amendment of PDF/A standard amd Text internal Conformance level of PDF/A standard conformance Text --ImGF4TvkZCC4MtNF Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX18KSWYgeW91ciBxdWVzdGlvbiBpcyBvZiBpbnRlcmVz dCB0byBvdGhlcnMgYXMgd2VsbCwgcGxlYXNlIGFkZCBhbiBlbnRyeSB0byB0aGUgV2lraSEKCm1h aWxsaXN0IDogbnRnLWNvbnRleHRAbnRnLm5sIC8gaHR0cDovL3d3dy5udGcubmwvbWFpbG1hbi9s aXN0aW5mby9udGctY29udGV4dAp3ZWJwYWdlICA6IGh0dHA6Ly93d3cucHJhZ21hLWFkZS5ubCAv IGh0dHA6Ly9jb250ZXh0LmFhbmhldC5uZXQKYXJjaGl2ZSAgOiBodHRwczovL2JpdGJ1Y2tldC5v cmcvcGhnL2NvbnRleHQtbWlycm9yL2NvbW1pdHMvCndpa2kgICAgIDogaHR0cDovL2NvbnRleHRn YXJkZW4ubmV0Cl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCg== --ImGF4TvkZCC4MtNF--