Open arnaumevi opened 1 year ago
Thank you. Just to confirm, you WERE able to harvest from this OAI archive successfully, before upgrading to 5.11.1, correct?
A quick followup: This isn't mentioned in this issue here, but the original report in the Google group suggests that these failures started happening after the upgrade to 5.11.1. Having looked at this OAI server and the failures, I don't think these OAI_DC records would have been imported successfully by any version of Dataverse. So if you were able to harvest from this archive previously, they must have changed their record format on the server side since then.
The short answer is that Dataverse can't import these OAI_DC records because they don't have persistent identifiers in any of the <dc:identifier>
fields, for example:
<dc:identifier>https://ddd.uab.cat/record/166606</dc:identifier>
<dc:identifier>urn:oai:ddd.uab.cat:166606</dc:identifier>
<dc:identifier>urn:10.5565/ddd.uab.cat/166606</dc:identifier>
<dc:identifier>urn:articleid:14712202</dc:identifier>
i.e. Dataverse wants one of these fields to contain either a DOI or a Handle identifier.
This is our fault, in more than one way:
We have an open issue for improving the client-side harvesting functionality that should address 1. above - we'll make more and better diagnostics visible to the admin; I'm hoping that it will be prioritized and addressed soon. As for 2., I have brought this up with the dev. team and we at least started talking about this.
But, unfortunately, this is not something we can fix for you, and/or something you can fix with a configuration change, right away.
Is related to the previous issue :
Hi, I'm having trouble harvesting Clients with the Dataverse 5.11.1 version. I get the message javax.ejb.EJBTransactionRolledbackException, Exception thrown from bean: java.lang.NullPointerException on the server log
Client configurations:
Here is the log for the attempt : harvest_UAB_2023-01-24T13-21-32.log
Thank you for your time in advance, Best Regards, Arnau