vliz-be-opsci / py-trav-harv

python module that will allow an enduser to perform link traversal on a triple store.
0 stars 0 forks source link

allow instructing the (multiple) content-type(s) to be retrieved #31

Closed marc-portier closed 4 months ago

marc-portier commented 6 months ago

we recently discovered that e.g. orcid produces quite different representations (amount of included triples) based on what conneg instruction one provides

compare:

$ curl -L --url "https://orcid.org/0000-0003-4305-0617" -H "Accept: text/turtle" 
$ curl -L --url "https://orcid.org/0000-0003-4305-0617" -H "Accept: application/ld+json" 

given this fact, it would be nice to be able to include in the task/executor, which actual content-formats it should be harvesting

e.g. for orcid access, one might want to request both json-ld and triple to maximalise the retrieved content

cedricdcc commented 4 months ago

Closed by merge of #52