TheCacophonyProject / classifier-pipeline

Exports tracked animals through thermal vision.
GNU General Public License v3.0
21 stars 14 forks source link

Update build #127

Closed gferraro closed 3 years ago

gferraro commented 3 years ago

Build process creates training, validation and testing sets from database of clips. Changed build process to create a dataset which works with frames or segments ( e.g. 45 frames).

Frames (important frames) are calculated by choosing frames with a mass ( the count of pixels that have been deemed object by track extraction) between the LQ and UQ.

Frames are also checked to see if they are noisy frames. This attempts to remove the following tyeps of noisy frames:

Segments are calculated either by: Choosing random permutations of the important frames.

or by choosing segment duration consecutive frames whose mass is above a certain amount

split datasets by camera and location ( too try and remove any bias that may occur from using a camera in multiple sets).

Updated Cumulative Distribution Function

Removed uneeded code for heavy bins

Some labels have low amounts of data so a single camera is split into 2 e.g. Wallabies and Leoparidaes

Added mapped_labels which lets you group multiple labels into a single label. e.g. possum, hedgehog and rabbit into the labels pests