Closed barrynl closed 4 years ago
Hi @barrynl ,
in an attempt to resolve this issue, I've tried to run your query without grlc in Yasgui, a sparql editor. It appears that the endpoint is not available?
http://yasgui.org/short/H1V5NSz7-
Best,
Richard
Hi @barrynl
Good catch. Those URI's though are not generated by grlc, but by (Git2PROV)[https://github.com/IDLabResearch/Git2PROV] which we're using to generate PROV out of the repo's commit history. At a closer look, it seems that the bug might be in rdflib's parser instead, because it seems legal to have parenthesis in URIs. But this contradicts NodeJS Turtle validator, which also complains about those pars.
For the time being we're just skipping the parsing of Git2PROV's output if it fails.
Thanks! Albert
Hi, thanks for your replies.
@rlzijdeman Sorry, yes, you are correct. I run GRLC in a docker-compose setup, so the endpoints make no sense outside that context. This means my "Try: http://grlc.io/api/barrynl/uncertainty-sparql" text above will also not work once the git2prov issue @albertmeronyo describes is resolved.
@albertmeronyo Thanks for the explanation. According to this post, URIs may contain parentheses: https://stackoverflow.com/a/1547940. Did not double check myself, though. But maybe IRIs have a different valid characters set.
Regards, Barry
Do you want me to close this issue? Because you talk about 'for the time being' as if you are planning to find a better solution in the future :)
Hi @barrynl , thanks for your comment.
I opened an issue at the RDFLib tracker when we investigated this: https://github.com/RDFLib/rdflib/issues/752 But so far it hasn't caught attention. I just updated that issue with a related link to the Turtle spec that deals with escaping special characters --might be related.
Shall we wait a bit more for a response before closing? In the worst case scenario, is it an option to edit the history of the git repo? And: how bad is the current behavior to you (i.e. skipping the PROV generation for the commit that contains the filenames with pars)?
Best, Albert
It looks like the issue in rdflib never got fixed, and neither did this issue.
I will close this issue, please reopen it if it is still relevant.
Hi,
I have created a GitHub repository that breaks GRLC because it's commit history contains files with invalid characters. Try: http://grlc.io/api/barrynl/uncertainty-sparql
The error in the log can be found at the end of this issue.
I have reduced the repository to a minimal example to debug the error and it is caused by this commit:
barrynl/uncertainty-sparql@5419d3e
The commit contains a filename with parenthesis (give-me-all-uncertainty-values-(and-causes)-per-sentences.nq) and these parenthesis are directly included in the temp.prov.ttl file which breaks the turtle parser because URIs cannot contain parenthesis.
So, does anyone know a work around (maybe remove this particular commit)? Currently, I've created a new repository with the same files but without the commit history and this indeed works correct.
I think the code of GRLC can be improved by checking the commit files for filenames that contain characters that cannot be used in URIs.
Regards,
Barry