bio-guoda / preston

a biodiversity dataset tracker
MIT License
24 stars 1 forks source link

build image corpus from https://zenodo.org/record/5558840#.YmMcO5PMLzc via catalogue numbers #169

Open jhpoelen opened 2 years ago

jhpoelen commented 2 years ago

@seltmann fyi

jhpoelen commented 2 years ago

related to #147

suggest to include examples for merging images (e.g., preston track image.jpg) and merging preston archives describing the provenance of images (e.g., preston merge [content id preston archive]).

resulting artifact includes:

  1. README with instructions, content ids etc.
  2. csv file with description of content (scientific names, catalog number vouchers, related images)
  3. preston data archive
jhpoelen commented 2 years ago

@seltmann could you please give me some example of scientific names and catalogue numbers (and other identifying information like institutionid, collection code, occurrence id) for vouchers.

seltmann commented 2 years ago

scientificName,scientificNameAuthorship,catalogNumber,collectionCode,institutionCode,occurrenceID Andrena angustitarsata,"Viereck, 1904",UCSB-IZC00036109,IZC,UCSB,3480a221-558d-42a6-b29a-ee4e89f58790 Bombus vosnesenskii,"Radoszkowski, 1862",UCSB-IZC00041355,IZC,UCSB,dfd0aa69-7419-4d6f-ae46-0eaf6697cd22 Bombus vosnesenskii,"Radoszkowski, 1862",AMNH_BEE 00099372,AMNB,BEE,urn:uuid:84cb9d64-d8e1-11e2-99a2-0026552be7ea

jhpoelen commented 2 years ago

https://github.com/bio-guoda/preston/issues/169#issuecomment-1106904263 as file available at:

vouchers.csv