weecology / EvergladesTools

Computational tools for working with Everglades image data
MIT License
0 stars 2 forks source link

Duplication within Vacation 5/26/2022 Vacation. #149

Closed bw4sz closed 1 year ago

bw4sz commented 1 year ago

There are spatially duplicate predictions, but with unique IDs in Vacation 05/26/2022.

Screenshot 2023-03-17 at 10 43 50 AM

I download PredictedBirds.zip from

/blue/ewhite/everglades/EvergladesTools/App/Zooniverse/data unzipped and overlaid the image on the drone data. There is one bird, but two records.

Screenshot 2023-03-17 at 11 01 48 AM

To verify this is a pipeline and not a machine learning problem I downloaded

scp hpg:'/blue/ewhite/everglades/predictions/2022/Vacation/Vacation_05_26_2022_projected.*' .

and there are no duplicates.

image

This is likely a merge issue somewhere when predicted_birds.zip is created?

ethanwhite commented 1 year ago

The workflow is doing what it's supposed to. There's an issue with the data:

(base) [ethanwhite@login1 everglades]$ ls -lh orthomosaics/2022/Vacation/Vacation_05_26_2022*
-rw-r--r-- 1 b.weinstein ewhite 3.9G Jan 26 13:55 orthomosaics/2022/Vacation/Vacation_05_26_2022_A.tif
-rw-r--r-- 1 b.weinstein ewhite 3.9G Jan 26 13:58 orthomosaics/2022/Vacation/Vacation_05_26_2022_B.tif
-rw-rw-r-- 1 b.weinstein ewhite 2.9G Jun  2  2022 orthomosaics/2022/Vacation/Vacation_05_26_2022.tif

There are actually two flights here, but they are 3 files. *2022_A.tif and *2022.tif are both getting processed as primary and so the birds are getting doubled.

ethanwhite commented 1 year ago

Confirmed that Vacation_05_26_2022 and Vacation_05_26_2022_A are the same flight. I've deleted Vacation_05_26_2022.tif from Dropbox. Sadly I can't delete if from the HPG and the rule because I can't login...

ethanwhite commented 1 year ago

Now that the HPG was fixed it's auth system I've deleted the extra file and am currently rerunning the workflow. Thanks for catching that @bw4sz! We should add a check for similar situations in the future. I'll open an issue.

ethanwhite commented 1 year ago

Should be all fixed now