Open henning-gerhardt opened 4 years ago
METS file cannot be read. This is another job for org.kitodo.dataformat.access.FLocatXmlElementAccess.getAndRepairUri(FileType file)
I don't know how the [
character was added at this position but the process title alldeba_266928358_0001
did not contain this character. So it can be removed in a manual way or during the meta data transformation?
With your change in #3698 I can even more illegal characters like normal white space.
I assume the mistake was there before, only now you can see it for the first time.
Sure. I don't know the reason nor the time when this illegal characters was "added". Maybe from a former migration (1.5.x to 1.6.x or so). Maybe I can fix this for our data but maybe even the application should handle this.
With your change in #3698 I can even more illegal characters like normal white space.
@henning-gerhardt, could you make me a list of the illegal characters you found in paths and how the paths should look correct?
There is no list and the list of illegal characters depends on many things like your used operation system, used file system and how you may interact with this kind of characters. All illegal characters ([
,
, ...) which I found I removed for our instance but I don't know if this change is correct until we successful migrated and checked the data.
Should this fixed outside of the application or should the application handle this?
Since we don't have a clear error pattern, I would answer your initial question that such errors have to be corrected locally outside the application. Should we still be able to obtain a clear error pattern in the future, which affects several installations, then we can of course also incorporate a correction function here.
After migrating existing meta data files to new format with the provided transformation file and start indexing all the data this error appear in the
catalina.out
file:An excerpt from the meta data file of this process:
I don't know how this error is influencing the index operation. Should this fixed outside of the application or should the application handle this?