Hyrax is a Ruby on Rails Engine built by the Samvera community. Hyrax provides a foundation for creating many different digital repository applications.
This relates to Hyrax version from 2.9.5 to 3.4.1 (and main branch).
This is the bug that led to the discovery of #5670
I'm unsure it's worth merging a fix for this if that one gets fixed, but I have a PR ready for this niggle which I will be merging into heliotrope anyways, and it's nice to have the issue findable here if someone else runs into it before #5670 is fixed.
Descriptive summary
This relates to Hyrax version from 2.9.5 to 3.4.1 (and main branch). This is the bug that led to the discovery of #5670
I'm unsure it's worth merging a fix for this if that one gets fixed, but I have a PR ready for this niggle which I will be merging into heliotrope anyways, and it's nice to have the issue findable here if someone else runs into it before #5670 is fixed.
So. Files with non-ASCII characters in the name will break CharacterizeJob if the job tries to set the FileSet title to its original_file's original_name. This is the result:
aside: I assume label gets set correctly initially, i.e. in IngestJob, because it's pulled from the UploadedFile in the standard UI workflow.
Rationale
It's a new bug and should be fixed.
Expected behavior
You should be able to import a file with non-ASCII characters in the name (potentially assigning the original_name to a Fedora field).
Actual behavior
CharacterizeJob fails with such a file. Ingest does still succeed, however.
Steps to reproduce the behavior
Simply try to add this File to a Work/Monograph and see CharacterizeJob fail: ファイル.txt
Related work
5670