globalbioticinteractions / usnm

Configuration for GloBI to index collections of the National Museum of Natural History, Smithsonian Institution
1 stars 0 forks source link

indexed records for http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af #2

Open birdje opened 2 years ago

birdje commented 2 years ago

Hi!

Thanks for helping to make existing biotic interaction data easier to find and access!

I was looking for a record of Marlattiella secunda Compere that is a host on olea chrysophylla. GUID: http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af

on the globi search i found no results: https://www.globalbioticinteractions.org/?accordingTo=globi%3Aglobalbioticinteractions%2Fusnm&interactionType=interactsWith&sourceTaxon=Marlattiella%20secunda image

I was expecting to see and inteaction with olea chrysophylla supported by emu entomology collection record image

suggest to index notes text [Host: ex # 4430 on olea chrysophylla] as a host of olea chrysophylla

jhpoelen commented 2 years ago

found an interpreted version of http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af at https://www.gbif.org/occurrence/1318738687 .

Screenshot from 2022-01-13 14-22-09

jhpoelen commented 2 years ago

Also, I was able to find the exact coordinates of the record in a recent USNM extant dwca dataset -

preston cat hash://sha256/f6d133620a665569a13a3fb7ca31b163bf849864812d447238994226d35e3253 | grep si.edu | grep archive | grep hasVersion | grep extant | head -n1 | preston grep 343f209a5-fc4c-44a5-84b6-cd417cf7f6af
<urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/prov#Activity> <urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> .
<urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> <http://www.w3.org/ns/prov#wasInformedBy> <urn:uuid:5a831b3a-ef28-4038-b8dc-5e883d355f12> <urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> .
<urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> <http://www.w3.org/ns/prov#used> <hash://sha256/ccd08a81f18dd40555a4379968575b12f0dab5d8f8aa2e776713e9ad8e43bacd> <urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> .
<urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> <http://purl.org/dc/terms/description> "An activity that finds the locations of text matching the regular expression '343f209a5-fc4c-44a5-84b6-cd417cf7f6af' inside any encountered content (e.g., hash://sha256/... identifiers)."@en <urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> .
<line:zip:hash://sha256/ccd08a81f18dd40555a4379968575b12f0dab5d8f8aa2e776713e9ad8e43bacd!/occurrence.txt!/L2338041> <http://www.w3.org/ns/prov#value> "http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af  PhysicalObject      urn:lsid:biocol.org:col:34871   urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad   USNM    urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad   NMNH Extant Biology PreservedSpecimen   http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af             Compere 1   ; Male; Female      PRESENT94   94  1930    4   4                       Ethiopia, Eritrea                   Ethiopia    EritreaNefasit, Eritrea                                 Syntype Marlattiella secunda    Animalia, Arthropoda, Insecta, Hymenoptera, Encyrtidae  Animalia    Arthropoda  Insecta Hymenoptera Encyrtidae  Marlattiella        secunda         Compere" <urn:uuid:812278ec-f4be-4ebd-9f49-239d6a23ab21> .

with content

http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af PhysicalObject urn:lsid:biocol.org:col:34871 urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad USNM urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad NMNH Extant Biology PreservedSpecimen http://n2t.net/ark:/65665/343f209a5-fc4c-44a5-84b6-cd417cf7f6af Compere 1 ; Male; Female PRESENT94 94 1930 4 4 Ethiopia, Eritrea Ethiopia EritreaNefasit, Eritrea Syntype Marlattiella secunda Animalia, Arthropoda, Insecta, Hymenoptera, Encyrtidae Animalia Arthropoda Insecta Hymenoptera Encyrtidae Marlattiella secunda Compere

As far as I can tell, the occurrence record does not include the remark Host: ex # 4430 on olea chrysophylla .

jhpoelen commented 2 years ago

@birdje any way for you to check whether the "migrated data remark" fields are being exported to DwC-A? If not, would it be able to include them in occurrence remarks or similar field.

jhpoelen commented 2 years ago

Note that the phrase containing 4430 on olea chrysophylla was nowhere to be found in the USNM Extant data publication, suggesting data the remarks are not included in the DwC-A for some reason.

$ preston cat hash://sha256/f6d133620a665569a13a3fb7ca31b163bf849864812d447238994226d35e3253 | grep si.edu | grep archive | grep hasVersion | grep extant | head -n1 | preston grep "4430 on olea chrysophylla"

. . . "An activity that finds the locations of text matching the regular expression '4430 on olea chrysophylla' inside any encountered content (e.g., hash://sha256/... identifiers)."@en .