springernature / scigraph

A Linked Open Data platform for the scholarly domain.
http://www.springernature.com/scigraph
73 stars 17 forks source link

will there be any linking to Wikidata? #2

Open egonw opened 7 years ago

egonw commented 7 years ago

For example, for the journals this would be nice, as many of them are in Wikidata already (thanks to projects like Wikicite)? To get an idea what is in there, check out @fnielsen's https://github.com/fnielsen/scholia ...

image

Source: https://tools.wmflabs.org/scholia/venue/Q180445

lambdamusic commented 7 years ago

Hi Egon, most definitely! We're just about to release some initial mappings, we'll keep you posted about that.

egonw commented 6 years ago

hi @lambdamusic, I was just talking to @SamWinthrop about this... we have some ideas in mind. The first step would be to get a Wikidata Property so that Wikidata can link to your graphs... a formatter URL (for webpage and for RDF) would be useful...

lambdamusic commented 6 years ago

hi @egonw just looking at this again. Don't know exactly what you mean by 'formatter URL' but maybe the redirect API could do? http://scigraph.springernature.com/explorer/api/#redirect Essentially you can retrieve the SciGraph URI by passing the journal ISSN eg http://scigraph.springernature.com/api/redirect?issn=2365-631X

pigsonthewing commented 6 years ago

A formatter URL is used in Wikidata to specify the pattern for linking identifiers. for example, instead of storing the full URL of an individuals ORCID iD:

https://orcid.org/0000-0001-5882-6823

we store just the ID value (i.e. the URL slug):

0000-0001-5882-6823

and the formatter URL:

https://orcid.org/$1

By replacing the "$1" in the formatter URL with the stored ID value, we can build the full URL again.

This way, if a site changes its URL structure, we only have to make one change.; and for some sites we can store multiple formatter URLs (or indeed, formatter URLs on third-party sites).

lambdamusic commented 6 years ago

Got ya. Thanks for the explanation.

I suppose then that if you have a DOI (or ISSN, ISBN) in wikidata you can consider it the ID value, which then can be used with this formatter URL:

http://scigraph.springernature.com/api/redirect?doi=$1

Of course for an ISSN you want issn=$1 etc.. as per the docs .

This approach would be resilient also in case SciGraph URIs happen to change in the future.

egonw commented 6 years ago

@lambdamusic, I think what we need is formatters based on the IRIs used for journals and articles... along with the mappings of SciGraph<>ISBN, SciGraph<>ISSN, SciGraph<>DOI, of course... something like:

http://scigraph.springernature.com/things/journals/$1
http://scigraph.springernature.com/things/articles/$1
lambdamusic commented 6 years ago

@egonw I don't have enough context for judging whether one approach is more suited than the other. Surely though, what you are suggesting is more laborious than using redirect URIs, and probably more fragile too (for the reason I mentioned above).

Anyways, here's some more info:

Hope this helps!

egonw commented 6 years ago

Well, that is why I was wondering if the SciGraph IDs were stable and public... if they are not, then ISSN/DOI is what is left, but then we don't need a Wikidata predicate...

For the semantic web layer, do you have owl:sameAs relations between the Scigraph ID-based IRIs and an IRI based on the DOI or ISSN? Otherwise, there would not be a semantic link possible from the Wikidata side... and we'd rely on SciGraph to make links to Wikidata (which is perfectly fine, of course...)

lambdamusic commented 6 years ago

Publications have a property sg:doiLink pointing to crossref DOIs. Eg see http://scigraph.springernature.com/things/articles/e4fe974547d30d62aeabe83de034738d#table

So, not owl:sameAs, although it's trivial to generate it via a rule if that's what you need.

SciGraph URIs are meant to be stable and public. But other than a firm intent to accomplish that, I don't know what else to say!