Digital-Naturalism-Laboratories / bucket-o-bugs

Sort through wild bugs (insects) localized from a specific location in the world to help humans id them hierarchically
Creative Commons Zero v1.0 Universal
2 stars 0 forks source link

post-V51 reformatting and filter #10

Closed egrace479 closed 1 month ago

egrace479 commented 2 months ago

Adds the filtering function. Takes the V51 JSON output and reformats to output CSV with taxonomic and other labels.

Next step will be to produce lists for second pybioclip run. Documentation of how these work is still required.

egrace479 commented 2 months ago

I fixed the DataFrame write in the filtering script so that it will fill null values in unlabeled columns. This isn't necessarily an ideal solution, but it was previously coercing the other labels into the non-null abiotic column to fill order and family, which was problematic.

I also updated the test JSON file to have unknown and a different order, family pair. It would probably be good to add more variation and create tests for this (step in the right direction for a useable package).

Next step is to add the extra filtering on taxa.csv to feed it back to pybioclip again.

I set up a few issues for these points.