Closed bpinsard closed 11 years ago
I currently have an "ignore_rule" to skip elements with a VR of 'OW', 'OB', or 'UN' if the value contains non-ASCII characters. However the ignore rules are not applied to the results from translators.
Can you confirm that the bad data is coming from the CsaImage or CsaSeries translators?
I see, the bad data is not in csa, but the VR is LO so it is not handled in the ignore rule.
Intersesting. I don't think it is valid to have non utf8 characters in a value with a VR of LO. Any idea what the value is? Was the file possibly corrupted?
I am not a Dicom expert and it is maybe not valid but it seems that the scanner/export/other allowed non-ascii characters(accent in French) to be entered, the field is RequestedProcedureDescription field.
I believe non-ASCII in an LO value is fine, but not non-utf8. The french accent should be valid utf8.
Is there some way you could send me the data?
This should address the issue.
tag values not utf8 encoded (seems to be possible in data from Siemens scanner) causes crash in json serialization:
is there a way to force conversion ? or remove the tags that cannot convert to utf-8