digital-preservation / csv-validator

CSV Validation Tool and API (CSV Schema RI)
http://digital-preservation.github.io/csv-validator
Mozilla Public License 2.0
202 stars 54 forks source link

Integritycheck fails on paths including minus signs #129

Closed ghost closed 7 years ago

ghost commented 8 years ago

Integritycheck breaks paths at minus signs, and then fails to find the resource at that path.

For example, given a folder containing a file at:

/mnt/test/RW_32/content/IM_TNA_102013_1015.fluencycontent-schoolwebsite.netdna-ssl.com/IM_TNA_102013_1015.fluencycontent-schoolwebsite.netdna-ssl.com-20151118170135-00000.arc.gz

Integritycheck fails on both the directory and the file:

Error: integrityCheck fails for line: 14006, column: identifier, Cannot find the folder /mnt/test/RW_32/content/IM_TNA_102013_1015.fluencycontent with substitution paths (file:///T:WORK/RW_32,/mnt/test/RW_32) Error: integrityCheck fails for line: 14007, column: identifier, Cannot find the folder /mnt/test/RW_32/content/IM_TNA_102013_1015.fluencycontent-schoolwebsite.netdna-ssl.com/IM_TNA_102013_1015.fluencycontent with substitution paths (file:///T:WORK/RW_32,/mnt/test/RW_32)

In addition the second error message references the folder as the problem, where the folder name is actually correct, and it is the filename which has been truncated.

rhubner commented 7 years ago

Limited fix. We pick up last content folder in whole path.