w3c / tdm-reservation-protocol

Repository of the Text and Data Mining Reservation Protocol Community Group
https://www.w3.org/community/tdmrep/
Other
9 stars 8 forks source link

What about extending tdm reservation protocol to structured data (json-ld, microdata, rdf)? #25

Open claudiotubertini opened 2 years ago

claudiotubertini commented 2 years ago

May be it is more for tdm-protocol 2.0 than for the proposal as it is now, but using something like schema.org vocabulary (https://schema.org/docs/gs.html#advanced, https://schema.org/docs/extension.html) with Microdata, RDFa, or JSON-LD formats, would allow to add copyright information to content inside the html page, that now remains a black box. I think it would not be too difficult to add tdm-reservation field to schema.org. As an example I'm thinking about a single copyrighted image inside a public domain page, or a long text (that could be protected) inside an open access page. Single informations are generally open to scraping but sometimes the web may be quite complex and structured data can help text and data mining.

dascritch commented 1 year ago

May be it's time to resurrect and extend <meta name="copyright" > tag ?

typopaul commented 1 year ago

What about files which could be used offline like EPUB and PDF? In my opinion such files should also contain a machine readable information about permission to be used for TDM or not.

Edit: Ah, I see that this question is also raised in this issue: https://github.com/w3c/tdm-reservation-protocol/issues/31