Closed peterdesmet closed 4 years ago
@sarahcd, above is a summary of the discrepancies in the records and dates in the source data and how these are presented on Movebank for the H_GRONINGEN study (the other two studies are fine). I think the issue might be related to the deployments and how to resample records based on wider deployments. Could you let me know how to solve this?
@sarahcd, see also #113
It looks like the study has been updated since you posted these numbers, is the issue resolved? These summary numbers are calculated based on the timestamps of events in the study, the deployments and deploy on/off timestamps if set, and number of flagged outliers (summarized in visible=false). So if deployment times or the imported timestamps are wrong, you'll want to correct those. Currently the study contains deploy-off dates from 2021 and 2024, and you also mention future dates in #118. If this is not intended, probably the months and dates were swapped in mapping the timestamps.
Let me know if any of that sets you on the right track!
Currently the study contains deploy-off dates from 2021 and 2024
Yes, or undefined. These are deploy-off dates as stored for the tags in the UvA-BiTS database (and I'm fine with having them set to a date in the future) and uploaded as such as movebank_ref_data.csv
. This is reflected correctly in the Movebank deployments.
The problem is that the Movebank summary numbers for the gps data are incorrect. I've looked again at the files I've uploaded (downloaded the original files) and they are correct. They do not contain dates in the future. Despite that, the Time of First Location
and Time of Last Location
for those files are incorrectly reported by Movebank (and as a result probably, the number of records).
See left (correct numbers) and right (Movebank summary numbers):
781906 vs 774873 2014-07-04 15:17:46 vs 2014-01-08 00:57:42.000 2018-07-11 11:10:40 vs 2020-07-05 23:57:37.000
21337 vs 21337 OK 2016-06-18 07:59:05 vs 2016-01-07 00:03:48.000 2016-09-03 10:35:04 vs 2018-07-08 23:58:37.000
5420 vs 5420 OK 2014-06-04 22:03:20 vs 2014-04-06 22:03:20.000 (looks like a month switch, but since it only occurs here I don't think it's the actual bug) 2014-06-23 18:37:02 vs 2015-11-06 18:37:02.000
178914 vs 178913 2012-05-10 15:39:00 vs 2012-01-06 00:09:40.000 2016-08-11 17:08:34 vs 2018-07-07 23:52:10.000
Should I remove all data and start over or is there something you can do on your site? 🤷♂
Looks like the study has changed since you posted this (the notification was in my spam folder!) so I'm not sure what the current status is. In any case, I can see that the number of deployed locations in the summary statistics are very clearly wrong, so will report this and CC you.
For general reference, since usually unexpected statistics are not caused by a problem with the calculations, here are things to consider:
Hi @sarahcd, just a heads up that I could upload the acceleration data without any problem and that the summary stats for those are correct. The issue with the gps summary data remains.
The remainder of this issue is described in #113, closing here
MH_WATERLAND
MH_ANTWERPEN
H_GRONINGEN