When I did attempted to process a XML file that contained a single UTF-8 character in the filename, the s/w issues a very non-descript ERROR message.
The ERROR message led me to believe the UTF-8 character was in the body of the XML.
Only after hours of trying to "find" the UTF-8 characters, I finally renamed the file.
And the s/w processed the renamed file.
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/data/home/pds4/pds-doi-service/bin/pds-doi-cmd", line 8, in
sys.exit(main())
File "/data/home/pds4/pds-doi-service/lib/python3.9/site-packages/pds_doi_service/core/cmd/pds_doi_cmd.py", line 42, in main
output = action.run(**kwargs)
File "/data/home/pds4/pds-doi-service/lib/python3.9/site-packages/pds_doi_service/core/actions/release.py", line 322, in run
raise CriticalDOIException(str(err))
pds_doi_service.core.entities.exceptions.CriticalDOIException: 'utf-8' codec can't decode byte 0xd7 in position 1455: invalid continuation byte
π΅οΈ Expected behavior
I expected the s/w to either process a filename having UTF-8 chars OR at least yield a more human readable Error message.
I had the rename both files to ".txt" as files with "xml" cannot be uploaded here.
I did not reproduce that on my local deployment with file bundle_test_20231005.xml . Could that be related to local system library versions or environment settings. That need to be investigated.
Checked for duplicates
No - I haven't checked
π Describe the bug
When I did attempted to process a XML file that contained a single UTF-8 character in the filename, the s/w issues a very non-descript ERROR message.
The ERROR message led me to believe the UTF-8 character was in the body of the XML. Only after hours of trying to "find" the UTF-8 characters, I finally renamed the file. And the s/w processed the renamed file.
During handling of the above exception, another exception occurred:
Traceback (most recent call last): File "/data/home/pds4/pds-doi-service/bin/pds-doi-cmd", line 8, in
sys.exit(main())
File "/data/home/pds4/pds-doi-service/lib/python3.9/site-packages/pds_doi_service/core/cmd/pds_doi_cmd.py", line 42, in main
output = action.run(**kwargs)
File "/data/home/pds4/pds-doi-service/lib/python3.9/site-packages/pds_doi_service/core/actions/release.py", line 322, in run
raise CriticalDOIException(str(err))
pds_doi_service.core.entities.exceptions.CriticalDOIException: 'utf-8' codec can't decode byte 0xd7 in position 1455: invalid continuation byte
π΅οΈ Expected behavior
I expected the s/w to either process a filename having UTF-8 chars OR at least yield a more human readable Error message. I had the rename both files to ".txt" as files with "xml" cannot be uploaded here.
bundle_test_20231005.txt bundle_moon_lro_mini-rf_mosaics_apl_2022.txt
π To Reproduce
π₯ Environment Info
π Version of Software Used
No response
π©Ί Test Data / Additional context
No response
π¦ Related requirements
π¦ #xyz
βοΈ Engineering Details
No response