Closed peterdesmet closed 3 years ago
Awesome question. Give me a day or so. After reading and responding, I need to make sure my documentation is in-line with these responses.
_Does not allow for videos to be split into sequences, as you can only assign one sequence ID per file?__ You could tackle this in two ways.
Does not allow for file and sequence based observations to exist in the same package, as you can only assign either a unique or grouping sequenceID per file? It should, so if it doesn't then I need to fix my documentation. This is a key feature of Chapman Core after learning from eMammal (sequence-based) and Wildlife Insights (image and sequence-based). Even though Wildlife Insights can accomodate both, you must delcare the type at the project level. A single project cannot contain both file and sequence based observations.
The challenge is how you determine the number of animals associated with a single identification. In analytics, the number of animals in a single file and the number of animals in a sequence are two very different things. Roland can elaborate better on this specific challenge.
You can't use a single-image animal count and a sequence count in the same dataset for modeling purposes. You must group images by an independent interval (60seconds in eMammal) to process alongside sequences. Otherwise, you're occupancy models are all wrong.
A single project cannot contain both file and sequence based observations.
It makes a lot of sense to me, thats also how we designed Camtrap DP i.e. to support both cases. However, thinking about the future (ML/AI) I would put more emphasis on file-based identification.
In analytics, the number of animals in a single file and the number of animals in a sequence are two very different things.
Definitely. For the expert-based annotations/identifications we solved this issue introducing a field count_new
which next to a total number of animals identified on an image/video (i.e. count
) stores information about new individuals on current image/video taking into account an entire sequence. This, together with a predefined sequence_interval
, gives us a possibility for automatic aggregation of count data. How to solve the same problem for ML/AI is less clear to me - likely some kind of smart objects-tracking algorithm would need to be involved.
@ben-norton I think we have discussed this, and the nullable multimedia_id
as currently in use in Camtrap DP is the most flexible option. Can we close this issue?
Yes
@ben-norton in ChapmanCore, multimedia files have a
sequenceID
. This allows observations to be media-based (one distinctsequenceID
per file) or sequence-based (onesequenceID
for multiple files).Do I understand correctly that without creating duplicate file records this approach:
Just trying to wrap my head around this.