inbo / movepub

R package to prepare animal tracking data from Movebank for publication in a research repository or GBIF
https://inbo.github.io/movepub/
Other
3 stars 1 forks source link

What reference data to include #24

Open peterdesmet opened 2 years ago

peterdesmet commented 2 years ago

@sarahcd so far I have based the mapping for the start and end HumanObservations on fields I have in my datasets. It would be good however to have a mapping based on all fields potentially available in the reference data. Can you check the field where you agree that they are not useful to map (crossed out) and comment on those you think would be useful to include?

peterdesmet commented 2 years ago

Note that the start eventRemarks are currently based on a number of fields:

https://github.com/inbo/movepub/blob/33bcd1763bf357720c47993390e39b669d26a6ea/inst/sql/movebank_dwc_occurrence.sql#L54-L60

Resulting in e.g.:

UvA-BiTS tag attached by harness to free-ranging animal | Found dead in Sint-Maartensdijk (NL) on 2018-08-24, last active day is 2018-08-23.

This will likely be empty if any of the fields are empty.

sarahcd commented 2 years ago

I can look at this next week, but one thing to think about: The deploy start/end times are sometimes based on something other than a human observation. For the deploy-on time, we can assume with current methods that the tag had to be attached by a person, and therefore assume it at least approximates the time of a human observation. For the deploy-off-time, this is regularly used to define the end of the reliable tracking data, e.g., to exclude locations from the track determined to have been recorded after the tag stopped moving or sending reliable data. Even when tags are physically retrieved, this often happens after an animal has died, or after it has been automatically released from the animal, so doesn't necessarily represent an observation of the live animal.

peterdesmet commented 2 years ago

Hmm, interesting point. I'll reference your comment in #10.

peterdesmet commented 2 years ago

Addressed. @sarahcd can you go over the list and check off the ones that you think make sense to exclude from DwC (I indicated them crossed out). We should then go over the remaining ones to see what to do with those.

sarahcd commented 2 years ago

Sorry for the delay. I am breaking these down based on the entity they describe in Movebank, since this could impact how to code transformation to DwC.

Possible mappings added from above have notes in italics. We can discuss whether to implement any additions to the current version, and I could propose specifics in the sql.

Terms describing the animal

Terms describing the tag

Terms describing the deployment, including tags and animals at the time of the deployment

peterdesmet commented 1 year ago

@sarahcd do you think including study-site for human observations makes more sense as:

Note that for GPS observations, the locationRemarks currently state "elevations are altitude above mean sea level"

peterdesmet commented 1 year ago

Overview of things to tackle:

albenson-usgs commented 1 year ago

For the animal death- I am really really hoping that vitality will be added to Darwin Core so you should be able to use that term for this.

sarahcd commented 1 year ago

@sarahcd do you think including study-site for human observations makes more sense as:

* locality: https://dwc.tdwg.org/terms/#dwc:locality

* locationRemarks: https://dwc.tdwg.org/terms/#dwc:locationRemarks

Note that for GPS observations, the locationRemarks currently state "elevations are altitude above mean sea level"

locality for human observations sounds good to me. Also, FYI, we are in the progress of adding locality as an event field in Movebank (e.g., for GPS records) for cases that segments of data are assigned location names. I think those should work fine together in the translation to DwC.