Closed seltmann closed 1 month ago
run by:
sh find-datasets-with-bees.sh 1>datasets.txt 2>datasets.log
runs in background and lists datasets that are skipped.
cat datasets.log | grep measurementOrFact.csv | grep -Eo "hash://sha256/[a-f0-9]{64}"
After disabling the content cache, I was able to see:
curl 'https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/meta.xml'\
| head
and
curl 'https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/eml.xml'\
| head
For this particular dataset, the eml is:
curl 'https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/eml.xml'\
| xmllint --format -
<?xml version="1.0" encoding="UTF-8"?>
<eml:eml xmlns:eml="eml://ecoinformatics.org/eml-2.1.1" xmlns:dc="http://purl.org/dc/terms/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="eml://ecoinformatics.org/eml-2.1.1 http://rs.gbif.org/schema/eml-gbif-profile/1.0.1/eml.xsd" packageId="09a24d41-4fd1-4b1a-9152-c3ae8166cfda" system="http://symbiota.org" scope="system" xml:lang="eng">
<dataset>
<alternateIdentifier>https://scan-bugs.org:443/portal/collections/misc/collprofiles.php?collid=78</alternateIdentifier>
<title xml:lang="eng">R. M. Bohart Museum of Entomology</title>
<creator id="2f57806a-f37f-40b2-915b-6fcdf6ae100f">
<organizationName>SCAN</organizationName>
<electronicMailAddress>scan_noreply@asu.edu</electronicMailAddress>
<onlineUrl>https://scan-bugs.org:443/portal/index.php</onlineUrl>
</creator>
<metadataProvider>
<organizationName>SCAN</organizationName>
<electronicMailAddress>scan_noreply@asu.edu</electronicMailAddress>
<onlineUrl>https://scan-bugs.org:443/portal/index.php</onlineUrl>
</metadataProvider>
<pubDate>2023-05-17</pubDate>
<language>eng</language>
<abstract>
<para>The Bohart Museum’s mission is to serve the research community, support campus teaching, provide science educational outreach for the public, and to provide diagnostic and information services. The museum houses the seventh largest insect collection in North America. The museum specializes in the unique insect fauna of California, both native and exotic species. Current holdings of nearly 8 million specimens include a wide range of terrestrial invertebrates, specializing in arthropods (insects, millipedes, centipedes, scorpions, mites, ticks, and spiders), crustaceans (pill and sow bugs, fairy and tadpole shrimp), and tardigrade water bears. The largest collections are of the Lepidoptera, Hymenoptera, Coleoptera, Acarina and Tardigrada. The collections are worldwide in scope, with the Americas, south Asia and Australia particularly well-represented. The museum is also the home of the California Insect Survey and sponsors research on the California fauna.</para>
</abstract>
<contact>
<individualName>Lynn S. Kimsey</individualName>
<organizationName>R. M. Bohart Museum of Entomology</organizationName>
<phone/>
<electronicMailAddress>lskimsey@ucdavis.edu</electronicMailAddress>
<onlineUrl>http://Bohart.ucdavis.edu</onlineUrl>
</contact>
<associatedParty>
<individualName>Neil</individualName>
<role>CONTENT_PROVIDER</role>
</associatedParty>
<associatedParty>
<organizationName>R. M. Bohart Museum of Entomology</organizationName>
<individualName>Lynn S. Kimsey</individualName>
<positionName>Collection Manager</positionName>
<role>CONTENT_PROVIDER</role>
<electronicMailAddress>lskimsey@ucdavis.edu</electronicMailAddress>
<phone/>
</associatedParty>
<intellectualRights>
<para>To the extent possible under law, the publisher has waived all rights to these data and has dedicated them to the <ulink url="http://creativecommons.org/licenses/by-nc/4.0/legalcode"><citetitle>CC BY-NC (Attribution-Non-Commercial)</citetitle></ulink>Users can copy, redistribute the material in any medium or format, remix, transform, and build upon the material. The licensor cannot revoke these freedoms as long as you follow the license terms.</para>
</intellectualRights>
</dataset>
<additionalMetadata>
<metadata>
<symbiota>
<dateStamp>2023-05-17T04:58:01-07:00</dateStamp>
<citation identifier="08369cae-9603-4294-9f5e-d879690564cc">SCAN - 08369cae-9603-4294-9f5e-d879690564cc</citation>
<physical>
<characterEncoding>UTF-8</characterEncoding>
<dataFormat>
<externallyDefinedFormat>
<formatName>Darwin Core Archive</formatName>
</externallyDefinedFormat>
</dataFormat>
</physical>
<collection identifier="79aa23b2-db5f-4b44-9484-e8af67d3db86" id="78">
<alternateIdentifier>https://scan-bugs.org:443/portal/collections/misc/collprofiles.php?collid=78</alternateIdentifier>
<parentCollectionIdentifier>UCD</parentCollectionIdentifier>
<collectionIdentifier>BMEC</collectionIdentifier>
<collectionName>R. M. Bohart Museum of Entomology</collectionName>
<resourceLogoUrl>http://scan-bugs.org/portal/images/collicons/bohart.png</resourceLogoUrl>
<onlineUrl>http://Bohart.ucdavis.edu</onlineUrl>
<intellectualRights>http://creativecommons.org/licenses/by-nc/4.0/</intellectualRights>
<additionalInfo>University of California</additionalInfo>
<abstract>
<para>The Bohart Museum’s mission is to serve the research community, support campus teaching, provide science educational outreach for the public, and to provide diagnostic and information services. The museum houses the seventh largest insect collection in North America. The museum specializes in the unique insect fauna of California, both native and exotic species. Current holdings of nearly 8 million specimens include a wide range of terrestrial invertebrates, specializing in arthropods (insects, millipedes, centipedes, scorpions, mites, ticks, and spiders), crustaceans (pill and sow bugs, fairy and tadpole shrimp), and tardigrade water bears. The largest collections are of the Lepidoptera, Hymenoptera, Coleoptera, Acarina and Tardigrada. The collections are worldwide in scope, with the Americas, south Asia and Australia particularly well-represented. The museum is also the home of the California Insect Survey and sponsors research on the California fauna.</para>
</abstract>
</collection>
</symbiota>
</metadata>
</additionalMetadata>
</eml:eml>
https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/meta.xml https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/eml.xml