Linking BOLD DNA sequences to specimens published in GBIF
Linking DNA sequence barcode data from BOLD to specimens in GBIF has a high priority in the GBIF work-plan. The GBIF Science Committee represented by SC chair Rod Page, published in December 2016 a snapshot of the iBOL dataset doi:10.15468/inygc6 including a total of 2,789,906 occurrences. However, the link to the museum specimens themselves has not been maintained. Example: gbifKey:1415958347 and the corresponding BOLD data record with processid:LON2542-15.
The most reliable specimen identifier in GBIF is the dwc:occurrenceID. There is also the traditional and (more) human readable dwc:catalogNumber identifying a museum specimen. The BOLD Process ID is the most important identifier for material samples corresponding to the museum specimens. BOLD also provide a "Museum ID" and a "Sample ID" however, nether match exactly the occurrenceID or the catalogNumber in GBIF.
GBIF | BOLD |
---|---|
occurrenceKey = 1426521030 | Process ID = NOBAS010-14 |
occurrenceID = urn:catalog:O:F:75130 | Museum ID = O-F-75130 |
catalogNumber = 75130 | Sample ID = O-F-75130 |
eventID/fieldNumber = [blank] | Field ID = MY1-0568 |
BOLD URL: http://bins.boldsystems.org/index.php/Public_RecordView?processid=NOBAS010-14
BOLD API: http://www.boldsystems.org/index.php/API_Public/sequence?ids=NOBAS010-14
GBIF API: http://api.gbif.org/v1/occurrence/1426521030/verbatim
Feedback on the proposed mapping using the issues tracker is most welcome! What would be the appropriate measurementType and measurementMethod?