DaniFdezAlvarez / shexer

Apache License 2.0
26 stars 2 forks source link

how to use this on a subset of Wikidata? #149

Closed dpriskorn closed 11 months ago

dpriskorn commented 11 months ago

would it be possible to make notebook or example for this usecase? I found https://github.com/DaniFdezAlvarez/shexer/tree/develop/experiments/wikidata/countries_and_cities but I don't see the code that produced the result there. A notebook would be preferred so I can easily play with it and adapt it to my needs.

DaniFdezAlvarez commented 11 months ago

Hi @dpriskorn . You can find example code showing how to use sheXer against the Wikidata endpoint here: https://github.com/DaniFdezAlvarez/shexer/blob/master/doc/shexer_wikidata_tutorial.ipynb.

If, instead of the endpoint, you want to work against a local subset of a Wikidata dump, you can also do it. But the way of doing it depends on your use case. In case the subset is in turtle/ntriples, you can do it trivially. But if your subset is in JSON, sheXer don't offer currently a parser for that format (if you need it, add an issue and I'll try to implement it). So in that last case you should translate it first to RDF or load the graph in an rdflib graph.

If you provide more details about your use case, I can send you ad-hoc example code for it.

DaniFdezAlvarez commented 11 months ago

@dpriskorn , were you able to run it for your case? do you need some help or may we close this issue?

dpriskorn commented 11 months ago

Thanks, I'll reopen if needed