Closed peterdesmet closed 2 years ago
That sounds like a reasonable approach. Is there a GBIF/OBIS approach to dataGeneralizations? ALA uses it specifically in reference to location precision, in which case informationWithheld might make more sense to put the subsample info into (ie "we withheld 50 records").
The movepub R package now implements dataGeneralizations
as described above (e.g. subsampled by hour: first of 13 record(s)
). For informationWithheld
we opted for the static value see metadata
for all records. The metadata will describe that the Darwin Core dataset is derived from a source dataset that might contain more information.
Closing this issue.
In a lossy transformation to Darwin Core we will loose information that is available in the source dataset. Should this be indicated for every record in
informationWithheld
and/ordataGeneralizations
(inflating unzipped size), at dataset level (in the metadata), or both?For the
movebank-gps
data case, I've currently opted forinformationWithheld
as it would only be something generic as "See source dataset for more information" (e.g. tag info, tag attachment, measurements)GPS records: to include
dataGeneralizations
to indicate the amount of subsampling:Note that I do indicate
datasetName
for every record, as that is available in the source records.Any feedback regarding what to do best here?