microsoft / arcticseals

A deep learning project in cooperation with the NOAA Marine Mammal Lab to detect & classify arctic seals in aerial imagery to understand how they’re adapting to a changing world.
MIT License
33 stars 14 forks source link

Process raw.csv to append more records to training.csv #8

Closed pbaer closed 6 years ago

pbaer commented 6 years ago

The new raw.csv file contains all hotspot detections, including ones we don't care about (duplicates) and ones we don't have imagery data for yet. It's also in a different schema than the training.csv file. This task is to process raw.csv to extract only the records that have at least thermal data in our datasets (01-99), convert those records to the training.csv schema, and append the records that are not already in training.csv (raw.csv is a superset of training.csv).

pbaer commented 6 years ago

This ended up only yielding 23 net new records for which we currently have IR/color data. I've checked in the script changes (in parse-labels) to allow us to try this again once we have more data.