Open taojing2002 opened 3 years ago
It seems the document contains some like (32\u00b010\u0004N, 64\u00b030\u0004W)
in the description. So the parser can't handle the unicode.
can you show those characters in context of the schema.org document please?
The original string is:
(32\u00b010\u0004N, 64\u00b030\u0004W)
After expansion (adding context):
(32°10\u0004N, 64°30\u0004W)
In the solr doc before sending to the solr serever:
(32°10N, 64°30W)
In another description, it has the value:
32\u00b0 10'N, 64\u00b0 30'W
after expansion and compaction:
32° 10'N, 64° 30'W
The solr doc is:
32° 10'N, 64° 30'W
It works well.
It seems the author uses \u0004
, which is \u0027
. After I replace \u0004
by \u0027
. Everything works. But I am not sure why solr can't handle EOT (
).
We need to escape the special character in dataone-indexer
When we index the objects from BCODMO, we saw some errors like:
It sounds like the document has some special characters we need to escape.