PASTAplus / cite

A data citation generator for PASTA data packages
Apache License 2.0
2 stars 0 forks source link

Raw citation json not properly encoding unicode characters #13

Closed servilla closed 3 years ago

servilla commented 3 years ago

See this ticket: https://github.com/PASTAplus/DataPortal/issues/66

We have a creator with a surname containing an "o" with an acute accent. This all seems to work fine with the list of creators and the personnel details in the full metadata. However, the character does not appear to be handled properly in translation to rendering the Citation on the data package summary page. See package edi.498 on the staging server for an example. Tagging @clnsmth just as an FYI.

I have confirmed that the Level-1-EML correctly encodes the accented lowercase "o" (ó), but is not represented correctly in the citation JSON cache file (see staging environment edi.498.1.json).

servilla commented 3 years ago

Requests was not able to automatically set the encoding after reading the metadata. Encoding is now coerced to "UTF-8": bae6a5f