Big-Bee-Network / bif

Biodiversity Interaction Finder (pronounced Bifi) - using Preston and Elton to detect bee interactions in existing biodiversity data infrastructures like GBIF/iDigBio/ALA etc.
Creative Commons Zero v1.0 Universal
1 stars 0 forks source link

once found dataset, how do you retrieve the eml? #9

Closed seltmann closed 1 month ago

seltmann commented 1 month ago

https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/meta.xml https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/eml.xml

seltmann commented 1 month ago

run by:

sh find-datasets-with-bees.sh 1>datasets.txt 2>datasets.log

runs in background and lists datasets that are skipped.

cat datasets.log | grep measurementOrFact.csv | grep -Eo "hash://sha256/[a-f0-9]{64}"

jhpoelen commented 1 month ago

After disabling the content cache, I was able to see:

curl 'https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/meta.xml'\
 | head

and

curl 'https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/eml.xml'\
 | head
jhpoelen commented 1 month ago

For this particular dataset, the eml is:

curl 'https://linker.bio/zip:hash://sha256/b90a523c156babba2b495fa7ed67a9ad280d6cef585e0d4fbdc1f8c566cb57a8!/eml.xml'\
 | xmllint --format -
<?xml version="1.0" encoding="UTF-8"?>
<eml:eml xmlns:eml="eml://ecoinformatics.org/eml-2.1.1" xmlns:dc="http://purl.org/dc/terms/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="eml://ecoinformatics.org/eml-2.1.1 http://rs.gbif.org/schema/eml-gbif-profile/1.0.1/eml.xsd" packageId="09a24d41-4fd1-4b1a-9152-c3ae8166cfda" system="http://symbiota.org" scope="system" xml:lang="eng">
  <dataset>
    <alternateIdentifier>https://scan-bugs.org:443/portal/collections/misc/collprofiles.php?collid=78</alternateIdentifier>
    <title xml:lang="eng">R. M. Bohart Museum of Entomology</title>
    <creator id="2f57806a-f37f-40b2-915b-6fcdf6ae100f">
      <organizationName>SCAN</organizationName>
      <electronicMailAddress>scan_noreply@asu.edu</electronicMailAddress>
      <onlineUrl>https://scan-bugs.org:443/portal/index.php</onlineUrl>
    </creator>
    <metadataProvider>
      <organizationName>SCAN</organizationName>
      <electronicMailAddress>scan_noreply@asu.edu</electronicMailAddress>
      <onlineUrl>https://scan-bugs.org:443/portal/index.php</onlineUrl>
    </metadataProvider>
    <pubDate>2023-05-17</pubDate>
    <language>eng</language>
    <abstract>
      <para>The Bohart Museum’s mission is to serve the research community, support campus teaching, provide science educational outreach for the public, and to provide diagnostic and information services. The museum houses the seventh largest insect collection in North America. The museum specializes in the unique insect fauna of California, both native and exotic species. Current holdings of nearly 8 million specimens include a wide range of terrestrial invertebrates, specializing in arthropods (insects, millipedes, centipedes, scorpions, mites, ticks, and spiders), crustaceans (pill and sow bugs, fairy and tadpole shrimp), and tardigrade water bears. The largest collections are of the Lepidoptera, Hymenoptera, Coleoptera, Acarina and Tardigrada. The collections are worldwide in scope, with the Americas, south Asia and Australia particularly well-represented. The museum is also the home of the California Insect Survey and sponsors research on the California fauna.</para>
    </abstract>
    <contact>
      <individualName>Lynn S. Kimsey</individualName>
      <organizationName>R. M. Bohart Museum of Entomology</organizationName>
      <phone/>
      <electronicMailAddress>lskimsey@ucdavis.edu</electronicMailAddress>
      <onlineUrl>http://Bohart.ucdavis.edu</onlineUrl>
    </contact>
    <associatedParty>
      <individualName>Neil</individualName>
      <role>CONTENT_PROVIDER</role>
    </associatedParty>
    <associatedParty>
      <organizationName>R. M. Bohart Museum of Entomology</organizationName>
      <individualName>Lynn S. Kimsey</individualName>
      <positionName>Collection Manager</positionName>
      <role>CONTENT_PROVIDER</role>
      <electronicMailAddress>lskimsey@ucdavis.edu</electronicMailAddress>
      <phone/>
    </associatedParty>
    <intellectualRights>
      <para>To the extent possible under law, the publisher has waived all rights to these data and has dedicated them to the <ulink url="http://creativecommons.org/licenses/by-nc/4.0/legalcode"><citetitle>CC BY-NC (Attribution-Non-Commercial)</citetitle></ulink>Users can copy, redistribute the material in any medium or format, remix, transform, and build upon the material. The licensor cannot revoke these freedoms as long as you follow the license terms.</para>
    </intellectualRights>
  </dataset>
  <additionalMetadata>
    <metadata>
      <symbiota>
        <dateStamp>2023-05-17T04:58:01-07:00</dateStamp>
        <citation identifier="08369cae-9603-4294-9f5e-d879690564cc">SCAN - 08369cae-9603-4294-9f5e-d879690564cc</citation>
        <physical>
          <characterEncoding>UTF-8</characterEncoding>
          <dataFormat>
            <externallyDefinedFormat>
              <formatName>Darwin Core Archive</formatName>
            </externallyDefinedFormat>
          </dataFormat>
        </physical>
        <collection identifier="79aa23b2-db5f-4b44-9484-e8af67d3db86" id="78">
          <alternateIdentifier>https://scan-bugs.org:443/portal/collections/misc/collprofiles.php?collid=78</alternateIdentifier>
          <parentCollectionIdentifier>UCD</parentCollectionIdentifier>
          <collectionIdentifier>BMEC</collectionIdentifier>
          <collectionName>R. M. Bohart Museum of Entomology</collectionName>
          <resourceLogoUrl>http://scan-bugs.org/portal/images/collicons/bohart.png</resourceLogoUrl>
          <onlineUrl>http://Bohart.ucdavis.edu</onlineUrl>
          <intellectualRights>http://creativecommons.org/licenses/by-nc/4.0/</intellectualRights>
          <additionalInfo>University of California</additionalInfo>
          <abstract>
            <para>The Bohart Museum’s mission is to serve the research community, support campus teaching, provide science educational outreach for the public, and to provide diagnostic and information services. The museum houses the seventh largest insect collection in North America. The museum specializes in the unique insect fauna of California, both native and exotic species. Current holdings of nearly 8 million specimens include a wide range of terrestrial invertebrates, specializing in arthropods (insects, millipedes, centipedes, scorpions, mites, ticks, and spiders), crustaceans (pill and sow bugs, fairy and tadpole shrimp), and tardigrade water bears. The largest collections are of the Lepidoptera, Hymenoptera, Coleoptera, Acarina and Tardigrada. The collections are worldwide in scope, with the Americas, south Asia and Australia particularly well-represented. The museum is also the home of the California Insect Survey and sponsors research on the California fauna.</para>
          </abstract>
        </collection>
      </symbiota>
    </metadata>
  </additionalMetadata>
</eml:eml>