ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
* [NTG-context] TDM Reservation Protocol (TDMRep)
@ 2025-04-30  5:49 Steffen Wolfrum
  2025-04-30  8:56 ` [NTG-context] " Gottschalt, Johannes
  2025-04-30 16:32 ` Pablo Rodriguez via ntg-context
  0 siblings, 2 replies; 14+ messages in thread
From: Steffen Wolfrum @ 2025-04-30  5:49 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hi,

we just saw the Accessibility Directive, now there is another demand: 
the TDM protocol (concerning the reuse of protected works for AI training purposes).

Is it possible to implement these TDM Metadata when producing PDF files with ConTeXt?

This technique provides a way to embed the rightsholder's choice into a PDF file.
(See «§6.6 TDM Metadata in PDF files»: https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#sec-pdf)

Steffen

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30  5:49 [NTG-context] TDM Reservation Protocol (TDMRep) Steffen Wolfrum
@ 2025-04-30  8:56 ` Gottschalt, Johannes
  2025-04-30 12:55   ` Steffen Wolfrum
  2025-04-30 16:32 ` Pablo Rodriguez via ntg-context
  1 sibling, 1 reply; 14+ messages in thread
From: Gottschalt, Johannes @ 2025-04-30  8:56 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Would https://wiki.contextgarden.net/Command/setupbackend work if you include the requisite information in your XMP file?

Best
J

-----Ursprüngliche Nachricht-----
Von: Steffen Wolfrum <context@st.estfiles.de> 
Gesendet: Mittwoch, 30. April 2025 07:49
An: mailing list for ConTeXt users <ntg-context@ntg.nl>
Betreff: [NTG-context] TDM Reservation Protocol (TDMRep)

Hi,

we just saw the Accessibility Directive, now there is another demand: 
the TDM protocol (concerning the reuse of protected works for AI training purposes).

Is it possible to implement these TDM Metadata when producing PDF files with ConTeXt?

This technique provides a way to embed the rightsholder's choice into a PDF file.
(See «§6.6 TDM Metadata in PDF files»: https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#sec-pdf)

Steffen

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror) archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30  8:56 ` [NTG-context] " Gottschalt, Johannes
@ 2025-04-30 12:55   ` Steffen Wolfrum
  2025-04-30 22:02     ` Rik Kabel
  0 siblings, 1 reply; 14+ messages in thread
From: Steffen Wolfrum @ 2025-04-30 12:55 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Do you mean by adding to my document  \setupbackend[xmpfile=…] ?

And if so, how do I actually write a xmpfile?

Steffen


> Am 30.04.2025 um 10:56 schrieb Gottschalt, Johannes <gottschalt@sub.uni-goettingen.de>:
> 
> Would https://wiki.contextgarden.net/Command/setupbackend work if you include the requisite information in your XMP file?
> 
> Best
> J
> 
> -----Ursprüngliche Nachricht-----
> Von: Steffen Wolfrum <context@st.estfiles.de> 
> Gesendet: Mittwoch, 30. April 2025 07:49
> An: mailing list for ConTeXt users <ntg-context@ntg.nl>
> Betreff: [NTG-context] TDM Reservation Protocol (TDMRep)
> 
> Hi,
> 
> we just saw the Accessibility Directive, now there is another demand: 
> the TDM protocol (concerning the reuse of protected works for AI training purposes).
> 
> Is it possible to implement these TDM Metadata when producing PDF files with ConTeXt?
> 
> This technique provides a way to embed the rightsholder's choice into a PDF file.
> (See «§6.6 TDM Metadata in PDF files»: https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#sec-pdf)
> 
> Steffen
> 
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
> webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror) archive  : https://github.com/contextgarden/context
> wiki     : https://wiki.contextgarden.net
> ___________________________________________________________________________________
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
> webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
> archive  : https://github.com/contextgarden/context
> wiki     : https://wiki.contextgarden.net
> ___________________________________________________________________________________

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30  5:49 [NTG-context] TDM Reservation Protocol (TDMRep) Steffen Wolfrum
  2025-04-30  8:56 ` [NTG-context] " Gottschalt, Johannes
@ 2025-04-30 16:32 ` Pablo Rodriguez via ntg-context
  2025-04-30 21:42   ` Bruce Horrocks
  2025-05-01 11:34   ` Steffen Wolfrum
  1 sibling, 2 replies; 14+ messages in thread
From: Pablo Rodriguez via ntg-context @ 2025-04-30 16:32 UTC (permalink / raw)
  To: ntg-context; +Cc: Pablo Rodriguez

On 4/30/25 07:49, Steffen Wolfrum wrote:
> Hi,
> 
> we just saw the Accessibility Directive, now there is another
> demand: the TDM protocol (concerning the reuse of protected works
> for AI training purposes).
Hi Steffen,

I have no doubt your suggestion or request is well intended.

But do you really think this may be worth the hassle?

I mean, from Llama is suspected that it has been trained with more than
80TB of unauthorised copies of books (from shadow libraries
https://en.wikipedia.org/wiki/Anna%27s_Archive#Meta_lawsuit,
https://en.wikipedia.org/wiki/Library_Genesis#Alleged_Meta_AI_training).

Do you really think that reservations will be honoured?

Even when they honour reservations, research purposes seem to be granted
without possible reservations (probably also to commercial entities).

As for the record, Google was taken to court in Spain by an individual
because of not complying with their robots.txt (not allowing Google to
crawl their content).

This went to the Spanish Supreme Court, who ruled that this was a case
of «ius usus innocui» (right to harmless use; not exactly fair right).

This was a commercial entity (well, an emporium), making a full copy of
a copyrighted work and the author has (or had before this ruling) the
right to control distribution of their work.

Sorry for being straight, but this is similar to data protection. As
long as we give our data (just by using your phones or using internet),
our data are collected, analyzed and traded in order to influence our
behaviour.

I’m afraid that this is part of sharing documents on the internet.

Just my two cents,

Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30 16:32 ` Pablo Rodriguez via ntg-context
@ 2025-04-30 21:42   ` Bruce Horrocks
  2025-05-01 10:27     ` Pablo Rodriguez via ntg-context
  2025-05-01 11:34   ` Steffen Wolfrum
  1 sibling, 1 reply; 14+ messages in thread
From: Bruce Horrocks @ 2025-04-30 21:42 UTC (permalink / raw)
  To: ntg-context mailing list; +Cc: Pablo Rodriguez



> On 30 Apr 2025, at 17:32, Pablo Rodriguez via ntg-context <ntg-context@ntg.nl> wrote:
> 
> But do you really think this may be worth the hassle?

If we're going to spend effort to counter AI crawlers then I think I would prefer to have an option to poison files.

Recently, YouTuber F4mi, who creates some excellent deep dives on obscure technology, detailed her efforts "to poison any AI summarisers that were trying to steal my content to make slop."
<https://www.youtube.com/watch?v=NEDFUjqA1s8>

Essentially she creates large amounts of nonsense text as YouTube subtitles but sets them to appear off-screen. The real subtitles appear on screen so normal users don't notice but unless the bot spends the effort to work out whether text is visible or not they will slurp up the nonsense and hopefully poison their models.

Perhaps ConTeXt could introduce something similar where invisible pages or invisible layers of nonsense are auto-generated but can't be seen on screen or when a document is printed?

In another group we shared recipes for cows eggs because we feel AI engines aren't aware of the health benefits of a cows egg for breakfast.

Regards,
—
Bruce Horrocks
Hampshire, UK

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30 12:55   ` Steffen Wolfrum
@ 2025-04-30 22:02     ` Rik Kabel
  2025-05-01 10:32       ` Pablo Rodriguez via ntg-context
  2025-05-01 11:37       ` Steffen Wolfrum
  0 siblings, 2 replies; 14+ messages in thread
From: Rik Kabel @ 2025-04-30 22:02 UTC (permalink / raw)
  To: ntg-context


[-- Attachment #1.1: Type: text/plain, Size: 675 bytes --]

On 2025-04-30 08:55, Steffen Wolfrum wrote:

> Do you mean by adding to my document  \setupbackend[xmpfile=…] ?
>
> And if so, how do I actually write a xmpfile?
>
> Steffen
>
I took the route described in the mailing list at 
https://www.mail-archive.com/ntg-context@ntg.nl/msg64956.html and 
inserted it successfully as follows:

|\setupbackend[format={PDF/UA-1}] \setuptagging[state=start] 
\enabledirectives[backend.usetags=crap] \startluacode 
lpdf.injectxmpinfo("xml://rdf:RDF","<rdf:Description rdf:about='' 
xmlns:tdm='http://www.w3.org/ns/tdmrep/'><tdm:reservation>1</tdm:reservation></rdf:Description>",false) 
\stopluacode |

--
Rik

&#8203;

[-- Attachment #1.2: Type: text/html, Size: 9661 bytes --]

[-- Attachment #2: Type: text/plain, Size: 511 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30 21:42   ` Bruce Horrocks
@ 2025-05-01 10:27     ` Pablo Rodriguez via ntg-context
  0 siblings, 0 replies; 14+ messages in thread
From: Pablo Rodriguez via ntg-context @ 2025-05-01 10:27 UTC (permalink / raw)
  To: ntg-context; +Cc: Pablo Rodriguez

On 4/30/25 23:42, Bruce Horrocks wrote:
> [...] 
> If we're going to spend effort to counter AI crawlers then I think I
> would prefer to have an option to poison files.

Many thanks for your reply, Bruce.

I’m afraid that accessibility (an excellent goal in itself) is also the
perfect excuse to convert everything digital into AI forage (whether we
like it or not).

Sorry if I’m missing your point, but I cannot see how to poison content
for AI while leaving them accessible for people with impairments.

As British citizen, you may be exempt to comply, but the Accessibility
Directive is law in the EU.

> Recently, YouTuber F4mi, who creates some excellent deep dives on
> obscure technology, detailed her efforts "to poison any AI
> summarisers that were trying to steal my content to make slop."
> <https://www.youtube.com/watch?v=NEDFUjqA1s8>.

Many thanks for the reference.

> Perhaps ConTeXt could introduce something similar where invisible
> pages or invisible layers of nonsense are auto-generated but can't
> be seen on screen or when a document is printed?
There is something similar (Hans already mentioned it):

  \enabledirectives[backend.pdf.nounicode=​]

You could add any text you want as value.

Cheers,

Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30 22:02     ` Rik Kabel
@ 2025-05-01 10:32       ` Pablo Rodriguez via ntg-context
  2025-05-01 11:40         ` Steffen Wolfrum
  2025-05-01 14:27         ` Rik Kabel
  2025-05-01 11:37       ` Steffen Wolfrum
  1 sibling, 2 replies; 14+ messages in thread
From: Pablo Rodriguez via ntg-context @ 2025-05-01 10:32 UTC (permalink / raw)
  To: ntg-context; +Cc: Pablo Rodriguez

On 5/1/25 00:02, Rik Kabel wrote:
> [...]
> I took the route described in the mailing list at https://www.mail-
> archive.com/ntg-context@ntg.nl/msg64956.html and inserted it
> successfully as follows:
> 
> |\setupbackend[format={PDF/UA-1}] \setuptagging[state=start]
> \enabledirectives[backend.usetags=crap] \startluacode
> lpdf.injectxmpinfo("xml://rdf:RDF","<rdf:Description rdf:about=''
> xmlns:tdm='http://www.w3.org/ns/tdmrep/'><tdm:reservation>1</
> tdm:reservation></rdf:Description>",false) \stopluacode |

Many thanks for the code, Rik.

Just in case anyone intends to use any form of TDM reservation, it seems
that it might break PDF/A validation.

I haven’t tested it, but there is a warning:
https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#issue-container-generatedID-6.

Just in case it helps,

Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30 16:32 ` Pablo Rodriguez via ntg-context
  2025-04-30 21:42   ` Bruce Horrocks
@ 2025-05-01 11:34   ` Steffen Wolfrum
  2025-05-01 20:00     ` Pablo Rodriguez via ntg-context
  1 sibling, 1 reply; 14+ messages in thread
From: Steffen Wolfrum @ 2025-05-01 11:34 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hi Pablo,

… well, it was (again) a demand from publishers, it wasn't my idea. I am absolutely dispassionate regarding this question (as I am more interested in the typographical aspects of typesetting).

But I will tell them your concerns, as they are for sure interesting.

Yours, Steffen


> Am 30.04.2025 um 18:32 schrieb Pablo Rodriguez via ntg-context <ntg-context@ntg.nl>:
> 
> On 4/30/25 07:49, Steffen Wolfrum wrote:
>> Hi,
>> 
>> we just saw the Accessibility Directive, now there is another
>> demand: the TDM protocol (concerning the reuse of protected works
>> for AI training purposes).
> Hi Steffen,
> 
> I have no doubt your suggestion or request is well intended.
> 
> But do you really think this may be worth the hassle?
> 
> I mean, from Llama is suspected that it has been trained with more than
> 80TB of unauthorised copies of books (from shadow libraries
> https://en.wikipedia.org/wiki/Anna%27s_Archive#Meta_lawsuit,
> https://en.wikipedia.org/wiki/Library_Genesis#Alleged_Meta_AI_training).
> 
> Do you really think that reservations will be honoured?
> 
> Even when they honour reservations, research purposes seem to be granted
> without possible reservations (probably also to commercial entities).
> 
> As for the record, Google was taken to court in Spain by an individual
> because of not complying with their robots.txt (not allowing Google to
> crawl their content).
> 
> This went to the Spanish Supreme Court, who ruled that this was a case
> of «ius usus innocui» (right to harmless use; not exactly fair right).
> 
> This was a commercial entity (well, an emporium), making a full copy of
> a copyrighted work and the author has (or had before this ruling) the
> right to control distribution of their work.
> 
> Sorry for being straight, but this is similar to data protection. As
> long as we give our data (just by using your phones or using internet),
> our data are collected, analyzed and traded in order to influence our
> behaviour.
> 
> I’m afraid that this is part of sharing documents on the internet.
> 
> Just my two cents,
> 
> Pablo
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
> webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
> archive  : https://github.com/contextgarden/context
> wiki     : https://wiki.contextgarden.net
> ___________________________________________________________________________________

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-04-30 22:02     ` Rik Kabel
  2025-05-01 10:32       ` Pablo Rodriguez via ntg-context
@ 2025-05-01 11:37       ` Steffen Wolfrum
  1 sibling, 0 replies; 14+ messages in thread
From: Steffen Wolfrum @ 2025-05-01 11:37 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 1577 bytes --]

Thank you Rik. 

My personal context-archive seems to have some white spots in year 2012… I didn't find this thread, thanks for digging it out ;o)

Steffen


> Am 01.05.2025 um 00:02 schrieb Rik Kabel <ConTeXt@rik.users.panix.com>:
> 
> On 2025-04-30 08:55, Steffen Wolfrum wrote:
> 
>> Do you mean by adding to my document  \setupbackend[xmpfile=…] ?
>> 
>> And if so, how do I actually write a xmpfile?
>> 
>> Steffen
>> 
> I took the route described in the mailing list at https://www.mail-archive.com/ntg-context@ntg.nl/msg64956.html <https://www.mail-archive.com/ntg-context@ntg.nl/msg64956.html> and inserted it successfully as follows:
> 
> 
> \setupbackend[format={PDF/UA-1}]
> \setuptagging[state=start]
> \enabledirectives[backend.usetags=crap]
> \startluacode
>     lpdf.injectxmpinfo("xml://rdf:RDF","<rdf:Description rdf:about='' xmlns:tdm='http://www.w3.org/ns/tdmrep/ <http://www.w3.org/ns/tdmrep/>'><tdm:reservation>1</tdm:reservation></rdf:Description>",false)
> \stopluacode
> --
> Rik
> 
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
> webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
> archive  : https://github.com/contextgarden/context
> wiki     : https://wiki.contextgarden.net
> ___________________________________________________________________________________


[-- Attachment #1.2: Type: text/html, Size: 11448 bytes --]

[-- Attachment #2: Type: text/plain, Size: 511 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-05-01 10:32       ` Pablo Rodriguez via ntg-context
@ 2025-05-01 11:40         ` Steffen Wolfrum
  2025-05-01 14:27         ` Rik Kabel
  1 sibling, 0 replies; 14+ messages in thread
From: Steffen Wolfrum @ 2025-05-01 11:40 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hi Pablo,

thank you very much for your attentiveness! I will forward this warning.

Yours, Steffen


> Am 01.05.2025 um 12:32 schrieb Pablo Rodriguez via ntg-context <ntg-context@ntg.nl>:
> 
> On 5/1/25 00:02, Rik Kabel wrote:
>> [...]
>> I took the route described in the mailing list at https://www.mail-
>> archive.com/ntg-context@ntg.nl/msg64956.html and inserted it
>> successfully as follows:
>> 
>> |\setupbackend[format={PDF/UA-1}] \setuptagging[state=start]
>> \enabledirectives[backend.usetags=crap] \startluacode
>> lpdf.injectxmpinfo("xml://rdf:RDF","<rdf:Description rdf:about=''
>> xmlns:tdm='http://www.w3.org/ns/tdmrep/'><tdm:reservation>1</
>> tdm:reservation></rdf:Description>",false) \stopluacode |
> 
> Many thanks for the code, Rik.
> 
> Just in case anyone intends to use any form of TDM reservation, it seems
> that it might break PDF/A validation.
> 
> I haven’t tested it, but there is a warning:
> https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#issue-container-generatedID-6.
> 
> Just in case it helps,
> 
> Pablo
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
> webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
> archive  : https://github.com/contextgarden/context
> wiki     : https://wiki.contextgarden.net
> ___________________________________________________________________________________

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-05-01 10:32       ` Pablo Rodriguez via ntg-context
  2025-05-01 11:40         ` Steffen Wolfrum
@ 2025-05-01 14:27         ` Rik Kabel
  2025-05-01 15:27           ` Rik Kabel
  1 sibling, 1 reply; 14+ messages in thread
From: Rik Kabel @ 2025-05-01 14:27 UTC (permalink / raw)
  To: ntg-context

On 2025-05-01 06:32, Pablo Rodriguez via ntg-context wrote:
> On 5/1/25 00:02, Rik Kabel wrote:
>> [...]
>> I took the route described in the mailing list at https://www.mail-
>> archive.com/ntg-context@ntg.nl/msg64956.html and inserted it
>> successfully as follows:
>>
>> |\setupbackend[format={PDF/UA-1}] \setuptagging[state=start]
>> \enabledirectives[backend.usetags=crap] \startluacode
>> lpdf.injectxmpinfo("xml://rdf:RDF","<rdf:Description rdf:about=''
>> xmlns:tdm='http://www.w3.org/ns/tdmrep/'><tdm:reservation>1</
>> tdm:reservation></rdf:Description>",false) \stopluacode |
> 
> Many thanks for the code, Rik.
> 
> Just in case anyone intends to use any form of TDM reservation, it seems
> that it might break PDF/A validation.
> 
> I haven’t tested it, but there is a warning:
> https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#issue-container-generatedID-6.
> 
> Just in case it helps,
> 
> Pablo
> 

Indeed, it does fail VeraPDF tests for PDF/A-3A if no effort is made to 
accept the XMP, but it passes VeraPDF testing as PDF/UA-1.

At https://pdfa.org/resource/iso-14289-pdfua/, one can read:

 > PDF/UA-1 documents may also conform to PDF/A-2 (ISO 19005-2) or 
PDF/A-3 (ISO 19005-3). In such cases, these PDF/A standards require the 
inclusion of XMP Extension Schema for custom XMP metadata not defined in 
the relevant ISO 19005 part. The PDF Association provides free XMP 
Extension Schema templates for PDF/UA-1, other ISO PDF subset standards, 
and PDF Declarations.

The phrase "free XMP Extension Schema templates" links to 
https://pdfa.org/resource/xmp-extension-schema-templates/, where we 
read, inter alia:

 > This W3C report _incorrectly_ notes that PDF/A validators will raise 
errors if TDMRep properties are present in the XMP metadata section of 
PDF documents. As clearly stated in ISO 19005-1 (PDF/A-1), ISO 19005-2 
(PDF/A-2), and ISO 19005 (PDF/A-3) appropriate XMP Extension Schema must 
always be included. This download link (XML) is a free XMP Extension 
Schema template in “PDF/A extension schema container schema” syntax 
appropriate for inclusion in PDF/A-1, PDF/A-2, and PDF/A-3 files that 
supports TDMRep.

I have not succeeded in integrating the provided extension schema.

-\-
Rik

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-05-01 14:27         ` Rik Kabel
@ 2025-05-01 15:27           ` Rik Kabel
  0 siblings, 0 replies; 14+ messages in thread
From: Rik Kabel @ 2025-05-01 15:27 UTC (permalink / raw)
  To: ntg-context


[-- Attachment #1.1: Type: text/plain, Size: 2682 bytes --]

On 2025-05-01 10:27, Rik Kabel wrote:

> On 2025-05-01 06:32, Pablo Rodriguez via ntg-context wrote:
>> On 5/1/25 00:02, Rik Kabel wrote:
>>> [...]
>>> I took the route described in the mailing list at https://www.mail-
>>> archive.com/ntg-context@ntg.nl/msg64956.html and inserted it
>>> successfully as follows:
>>>
>>> |\setupbackend[format={PDF/UA-1}] \setuptagging[state=start]
>>> \enabledirectives[backend.usetags=crap] \startluacode
>>> lpdf.injectxmpinfo("xml://rdf:RDF","<rdf:Description rdf:about=''
>>> xmlns:tdm='http://www.w3.org/ns/tdmrep/'><tdm:reservation>1</
>>> tdm:reservation></rdf:Description>",false) \stopluacode |
>>
>> Many thanks for the code, Rik.
>>
>> Just in case anyone intends to use any form of TDM reservation, it seems
>> that it might break PDF/A validation.
>>
>> I haven’t tested it, but there is a warning:
>> https://www.w3.org/community/reports/tdmrep/CG-FINAL-tdmrep-20240510/#issue-container-generatedID-6. 
>>
>>
>> Just in case it helps,
>>
>> Pablo
>>
>
> Indeed, it does fail VeraPDF tests for PDF/A-3A if no effort is made 
> to accept the XMP, but it passes VeraPDF testing as PDF/UA-1.
>
> At https://pdfa.org/resource/iso-14289-pdfua/, one can read:
>
> > PDF/UA-1 documents may also conform to PDF/A-2 (ISO 19005-2) or 
> PDF/A-3 (ISO 19005-3). In such cases, these PDF/A standards require 
> the inclusion of XMP Extension Schema for custom XMP metadata not 
> defined in the relevant ISO 19005 part. The PDF Association provides 
> free XMP Extension Schema templates for PDF/UA-1, other ISO PDF subset 
> standards, and PDF Declarations.
>
> The phrase "free XMP Extension Schema templates" links to 
> https://pdfa.org/resource/xmp-extension-schema-templates/, where we 
> read, inter alia:
>
> > This W3C report _incorrectly_ notes that PDF/A validators will raise 
> errors if TDMRep properties are present in the XMP metadata section of 
> PDF documents. As clearly stated in ISO 19005-1 (PDF/A-1), ISO 19005-2 
> (PDF/A-2), and ISO 19005 (PDF/A-3) appropriate XMP Extension Schema 
> must always be included. This download link (XML) is a free XMP 
> Extension Schema template in “PDF/A extension schema container schema” 
> syntax appropriate for inclusion in PDF/A-1, PDF/A-2, and PDF/A-3 
> files that supports TDMRep.
>
> I have not succeeded in integrating the provided extension schema.
>
Just modifying a local copy of mkiv/lpdf-pua.xml seems to do it. (There 
is no separate equivalent in the mkxl tree, so this is used for both.)

Both sets of changes (the extension schema and the actual setting of 
tdm) can be done there.

--
Rik

&#8203;

[-- Attachment #1.2: Type: text/html, Size: 10918 bytes --]

[-- Attachment #2: Type: text/plain, Size: 511 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [NTG-context] Re: TDM Reservation Protocol (TDMRep)
  2025-05-01 11:34   ` Steffen Wolfrum
@ 2025-05-01 20:00     ` Pablo Rodriguez via ntg-context
  0 siblings, 0 replies; 14+ messages in thread
From: Pablo Rodriguez via ntg-context @ 2025-05-01 20:00 UTC (permalink / raw)
  To: ntg-context; +Cc: Pablo Rodriguez

On 5/1/25 13:34, Steffen Wolfrum wrote:
> Hi Pablo,
> 
> … well, it was (again) a demand from publishers, it wasn't my idea.

Hi Steffen,

I see… then allow me some comments.

I’m afraid I cannot avoid thinking that the whole thing is not flawed by
design.

Article 4.3 of the Directive EU/2019/790
(https://eur-lex.europa.eu/legal-content/EN/TXT/HTML/?uri=CELEX:32019L0790#004.003)
requires an explicit reservation to legally object to data extraction
for commercial purposes (but not in scientific research).

Well, we are all in by default, in some cases we might opt-out.

Even if AI training complies with these data-extraction reservations,
the vast majority of works from independent creators or small publishers
may even not have means to add it (in machine-readable format).

Requesting the addition of explicit permission (opt-in) would have been
by far a more sensible policy.

The problem still remains with single works and/or creators. How can we
know that a single work has not been used to train AI? Even if one is
caught infringing, it might be so common that nothing happens.

We have already seen that with big media using and not complying with
the terms of works released under free licenses.

Of course, in that case small creators have the right to sue the
infringers. But they lack the money, time and legal teams to even think
of it.

Just in case it might help in some remote case,

Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2025-05-01 20:09 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-04-30  5:49 [NTG-context] TDM Reservation Protocol (TDMRep) Steffen Wolfrum
2025-04-30  8:56 ` [NTG-context] " Gottschalt, Johannes
2025-04-30 12:55   ` Steffen Wolfrum
2025-04-30 22:02     ` Rik Kabel
2025-05-01 10:32       ` Pablo Rodriguez via ntg-context
2025-05-01 11:40         ` Steffen Wolfrum
2025-05-01 14:27         ` Rik Kabel
2025-05-01 15:27           ` Rik Kabel
2025-05-01 11:37       ` Steffen Wolfrum
2025-04-30 16:32 ` Pablo Rodriguez via ntg-context
2025-04-30 21:42   ` Bruce Horrocks
2025-05-01 10:27     ` Pablo Rodriguez via ntg-context
2025-05-01 11:34   ` Steffen Wolfrum
2025-05-01 20:00     ` Pablo Rodriguez via ntg-context

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).