kasei / attean

A Perl Semantic Web Framework
19 stars 10 forks source link

Verify correct escaping in use of Turtle and SPARQL tokens #55

Open kasei opened 8 years ago

kasei commented 8 years ago

I suspect that the $token->value values are sometimes (or perhaps always) being stored in an escaped form (e.g. when parsing Turtle unicode escapes like \u0063). This needs to be fixed, so that the stored value is always the actual unicode string intended. If this requires changes, then the Turtle and SPARQL serializers need to be verified as re-encoding where necessary.

kasei commented 8 years ago

Also consider changing the LANG token at the same time to drop the "@". Similar tokens like VAR do not keep their sigil in the stored value.

kasei commented 8 years ago

Simple tests for Turtle added in c04b64b.