gbif / collection-mobilization

This repository is used to coordinate data mobilisation based on GRSciColl (https://www.gbif.org/grscicoll) entries.
3 stars 0 forks source link

Department of Biology and Ecology, Faculty of Sciences, University of Novi Sad #2108

Open ManonGros opened 1 year ago

ManonGros commented 1 year ago

University of Novi Sad

GRSciColl URL: https://www.gbif.org/grscicoll/institution/e208c3eb-21b7-42cd-9ff0-0dafea1d23d1 Synchronized with IH Code: DBE-UNS (removed BUNS) Homepage: http://www.dbe.uns.ac.rs Fuzzy name search in GBIF publisher:

Number of linked occurrences: 2,199 (02.02.2024) 0 (03.10.2023) (https://www.gbif.org/occurrence/search?advanced=1&institution_key=e208c3eb-21b7-42cd-9ff0-0dafea1d23d1)


  1. Find out if the information is complete and up to date:
  1. Check if the data is also in GBIF:
spalp commented 1 year ago
spalp commented 1 year ago

I created the collection Insecta and added _GRSCICOLLURI and UUID of the collection as identifiers: https://registry.gbif.org/collection/59093a0f-9471-4182-9e56-455d9ce85672/identifier. | CollectionCode: Insecta. @ManonGros Could you link the following dataset: Prime Hoverfly (Insecta: Diptera: Syrphidae) Areas (PHA) as a conservation tool in Serbia to the collection?

ManonGros commented 1 year ago

Thanks @spalp, I can actually add default values for specified fields in a given dataset. (By "default", I mean that it will only apply if no other value is provided by the publisher for that field). I have added a default collection and institution code for the dataset Prime Hoverfly (Insecta: Diptera: Syrphidae) Areas (PHA) as a conservation tool in Serbia. You can see them here: https://registry.gbif.org/dataset/9060319d-ceb2-44ac-9423-efaa4a88214f/defaultValue

Because you just created the collection entry, I need to wait a few days before reinterpreting the dataset and linking the occurrences. As the GRSciColl cache is only updated twice a week. I am temporarily co-assigning myself so I don't forget to reinterpret the data.

PS: the Serbian dataset is orphaned. This means that the endpoint was unresponsive for a long time and we made a copy of the dataset that we are hosting on our system. If you ever contact the publisher, it would be a good idea to ask if they would be interested in adopting back the dataset (meaning to host it again). You can learn more about orphaned datasets here: https://github.com/gbif/watchdog/wiki/OrphanedDatasets

ManonGros commented 1 year ago

I reinterpreted the dataset and it got linked actually.

spalp commented 1 year ago

Thanks @spalp, I can actually add default values for specified fields in a given dataset. (By "default", I mean that it will only apply if no other value is provided by the publisher for that field). I have added a default collection and institution code for the dataset Prime Hoverfly (Insecta: Diptera: Syrphidae) Areas (PHA) as a conservation tool in Serbia. You can see them here: https://registry.gbif.org/dataset/9060319d-ceb2-44ac-9423-efaa4a88214f/defaultValue

Oh, great. I did not know one can look at default values of datasets this way. I was always going through individual records, which took so much time. Thanks for sharing.

ManonGros commented 1 year ago

@spalp only admins can add default values If you would like to get an overview of the codes used for a given selection of occurrences, you can do so via the API and/or (for the institution codes) in the web interface. For example, https://www.gbif.org/occurrence/charts?dataset_key=9060319d-ceb2-44ac-9423-efaa4a88214f&d=institutionCode

I can show how to work with the API on our Friday meeting if we have time.

spalp commented 1 year ago

PS: the Serbian dataset is orphaned. This means that the endpoint was unresponsive for a long time and we made a copy of the dataset that we are hosting on our system. If you ever contact the publisher, it would be a good idea to ask if they would be interested in adopting back the dataset (meaning to host it again). You can learn more about orphaned datasets here: https://github.com/gbif/watchdog/wiki/OrphanedDatasets

Yes, I will make it part of my data mobilization plan:

Data mobilization plan:

spalp commented 11 months ago

Linking occurrences to the institution

Step 1: Check which institution codes need manual mapping?

https://api.gbif.org/v1/grscicoll/lookup?institutionCode=DBE-UNS - correct match https://api.gbif.org/v1/grscicoll/lookup?institutionCode=UNSPMF - correct match https://api.gbif.org/v1/grscicoll/lookup?institutionCode=FSUNS - correct match https://api.gbif.org/v1/grscicoll/lookup?institutionCode=BUNS - no match, needs manual mapping

Step 2: Check occurrences that use the above institution codes?

https://www.gbif.org/occurrence/search?basis_of_record=LIVING_SPECIMEN&basis_of_record=FOSSIL_SPECIMEN&basis_of_record=PRESERVED_SPECIMEN&basis_of_record=MATERIAL_CITATION&basis_of_record=MATERIAL_SAMPLE&institution_code=BUNS&institution_code=DBE-UNS&institution_code=UNSPMF&institution_code=FSUNS

No dataset with institution code BUNS.

Step 3: Check whether occurrences from the following datasets should be linked to the institution

spalp commented 11 months ago

Linking occurrences to the institution's collections

Step 1: Check occurrences using institution codes as a collection code.

https://www.gbif.org/occurrence/search?basis_of_record=LIVING_SPECIMEN&basis_of_record=FOSSIL_SPECIMEN&basis_of_record=PRESERVED_SPECIMEN&basis_of_record=MATERIAL_CITATION&basis_of_record=MATERIAL_SAMPLE&collection_code=BUNS&collection_code=DBE-UNS&collection_code=UNSPMF&collection_code=FSUNS

07.12.2023: 28 results. All are insects, thus should be linked to: https://registry.gbif.org/collection/59093a0f-9471-4182-9e56-455d9ce85672. All use FSUNS as collection code and https://api.gbif.org/v1/grscicoll/lookup?collectionCode=FSUNS returns a unique match, so no need for manual mapping.

spalp commented 5 months ago