sul-dlss-deprecated / rialto-etl

ETL tools for RIALTO, Stanford Libraries' research intelligence project
https://library.stanford.edu/projects/rialto
Apache License 2.0
3 stars 0 forks source link

Encoding issues in website display #335

Closed peetucket closed 5 years ago

peetucket commented 5 years ago

There still appears to be encoding issues in the website -- not sure where it is coming from.

screen shot 2018-12-12 at 11 01 34 am
peetucket commented 5 years ago

Seen in -dev in collaboration report, country view.

jcoyne commented 5 years ago

When I run this query against Neptune, I can see that the data is messed up there:

select ?country ?label
WHERE { <http://sul.stanford.edu/rialto/agents/people/71a2b7d79f2038a9c8850f9b1ec0766f> <http://purl.org/dc/terms/spatial> ?country .
                 ?country rdfs:label ?label .
}
curl --data-urlencode query@one-document.sparql http://localhost:9999/sparql
{
  "head" : {
    "vars" : [ "country", "label" ]
  },
  "results" : {
    "bindings" : [ {
      "country" : {
        "type" : "uri",
        "value" : "http://sws.geonames.org/1814991/"
      },
      "label" : {
        "type" : "literal",
        "value" : "People���s Republic of China"
      }
    } ]
  }
}
jcoyne commented 5 years ago

We are able to push smart quotes through the sparql-loader and it seems to look good in neptune.

jcoyne commented 5 years ago

We think this is fixed by https://github.com/sul-dlss/sparql-loader/commit/92e21c3c63248da4b6aa8aeb69fdfaaa71d0af87, and all of the bad data loaded before that fix was in place.