Open dshorthouse opened 9 years ago
Could you validate that the harvest is trying to harvest http://web.macs.ualberta.ca:8088/ipt/archive.do?r=entomology (the archive) and not http://web.macs.ualberta.ca:8088/ipt/resource.do?r=entomology (the web page) ?
Sorry, URL in the ticket updated for clarity. Yes, the problem is in reading the archives from UofA via their IPT v2.2, not a mistake on my part in trying to use a landing page URL in place of the actual DwC-A. More details added as well.
Changes in GBIF Metadata Profile version 1.1 used by the IPT introduced in v2.2: http://dev.gbif.org/issues/browse/POR-2560
This impacts the implementation of the ResourceMetadataProcessor but we need to make sure we can still read the previous version.
FATAL DwcaEmlReader:62 - Can't open DwcaEmlReader org.gbif.dwca.io.UnkownDelimitersException: Unable to detect field delimiter
The issue appears to only apply to the CLI whereas there is no problem when using the GUI.
Example DwC-A: http://web.macs.ualberta.ca:8088/ipt/archive.do?r=entomology