stain / jena-docker

Docker image for Apache Jena riot
Apache License 2.0
99 stars 86 forks source link

Character set issue #38

Closed ebal5 closed 1 year ago

ebal5 commented 4 years ago

I loaded data that came from YAGO4 in n-triple format. When I run the query below, I got a list in image and more. scr xcOUn3

SELECT ?subject ?predicate ?object
WHERE {
  ?subject ?predicate ?object
}
LIMIT 25

My issue is that the character "�" is not contained in original data but faced in return data. I think it happened in strings which contain some accented character in it. scr DDVBkn

What has caused it?

kinow commented 3 years ago

I think it was caused due to the character encoding you used in Jena when loading the data. Can't think of something that would cause it in these Docker images, so maybe better to check StackOverflow or Jena docs.

ebal5 commented 3 years ago

I think it may be caused by Java because it uses UTF-16 in internal character set. But... it is not problem for me now. I'm out of project. Thanks for replying.

This problem might not solved so that why I won't close this issue.

kuzeko commented 1 year ago

I believe this is a problem with Jena, not with this docker container setup. So I will close it.