digital-preservation / droid

DROID (Digital Record and Object Identification)
BSD 3-Clause "New" or "Revised" License
278 stars 75 forks source link

Exporting CSV Error #1130

Open eliu-ual opened 2 days ago

eliu-ual commented 2 days ago

I have a couple of .droid files I’m trying to export to CSV, but I’m receiving an error about an illegal character preventing DROID from generating a CSV file. I’ve attached the beginning of the error message to this email (the full error is very long, but the remainder is just the rest of the filepath).

For background, I’m running DROID on a set of files to support identification of duplicate records. The hard drive was as donated Mac-formatted hard drive, then a copy was made onto a Windows network drive. I’m running DROID on the Windows copy, and I’m aware there can be loads of issues going from Mac to Windows as they differ in tolerance to ‘non-standard’ character usage such as backslashes etc.

Is this a common error? Might it be resolved by running a Mac-compatible version of DROID on a Mac-formatted copy?

Thanks in advance for any advice you might have! Screenshot 2024-08-15 174823

steve-daly commented 2 days ago

DROID is cross-platform so it's the same code running on every operating system, but I can see that underlying file system behaviour can vary. Can you see what the folder/file names look like in Windows? DROID should be fine with unicode characters in filenames but obviously something unusual is happening here. If you can see an unusual filename could you possibly copy that filename onto a harmless file (e.g. blank text document) and upload the document here to see if we can reproduce this with DROID.

eliu-ual commented 2 days ago

Hi Steve! Thanks for your prompt response. Strangely the filename cited in the error message is pretty simple, just 'Icon'. But I can see some of the other filenames in the same folder could be problematic; these are 'C=P_Ars_electronica_edit3.doc' and 'Ars-final.doc'.

I've attached .txt files with each of these names to support testing.

Icon.txt Ars-final.doc.txt C=P_Ars_electronica_edit3.doc.txt

steve-daly commented 2 days ago

Thanks. In your original screenshot you have some scissors emojis in the filepath. Do you see that in your folder browser/explorer too.

eliu-ual commented 2 days ago

In File Explorer, they appear as boxes - definitely a legacy of material being moved from Mac to Windows:

S:\DAC Projects\Archives and Special Collections Centre\Her Noise Archive\2_HNPLANNING\2_HN_PLANNING\HN12\1_ORGANISERS\HN123\3_Lina\HN12310\PREVIOUS WRITING\Ars Electronica 02 (The Wire)