k-int / XCRI-Aggregator

XCRI Course Related Information - Feed Validator and Aggregator
10 stars 4 forks source link

Duplicate identifiers in OAI-PMH data? #47

Open mhawksey opened 12 years ago

mhawksey commented 12 years ago

Pulling the xcri OAI-PMH data and paging using rsumptionTokens appears to pull duplicate records Here's a log of ids pulled and a sheet of which are unique and counts https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdG1QaTRkUEI0Y0dZRUo5OGIwa0VRdXc#gid=7

MIJohnson commented 12 years ago

Hi! Please could you send us the URL's you are using to pull and a sample of the resumption tokens involved so I can investigate this further. Cheers!