Closed nickynicolson closed 8 years ago
Thanks for your report. I'm not sure if it is a valid archive or not (the standard is not always as clear as I'd like it to be), but it seems there are such archives in the wild and it shouldn't be too difficult to support, so I'll give it a try!
I'm a little ambivalent about this issue.
To me, it looks like an error at GBIF to provide such a sample file for their DwCA validator. I'd be tempted to not fix it here (or just the single-directory simple case) and report it as an issue to GBIF. What do you think, @nickynicolson ?
Thanks @niconoe - I agree. Re your first point: I've seen a lot of these single sub-directory archives in use - perhaps from IPT instances, but also from the Scratchpads project and emonocot. If we can jump into the subdir when only one subdir exists, that seems like a good solution.
I also agree that the sample DWCA referenced from the GBIF validator should be cleaner.
That's good to know. I'll implement this "single subdir fix" so at least we support those common archives!
Example archive: http://rs.gbif.org/datasets/german_sl.zip
This is the default archive used in the GBIF DWCA validator
It contains the following files:
The dwca-reader unzips, but fails to find a meta.xml - as it is inside a subdirectory. The following error is produced:
Presumeably this is a valid archive - if so should the reader locate the
meta.xml
and continue relative to that location?