KSU-ART / Tasks

Main repository for task and project coordination
2 stars 0 forks source link

Vision Processing Pipeline #10

Open alecGraves opened 6 years ago

alecGraves commented 6 years ago
alecGraves commented 6 years ago

@eneleski Step 2: python-based data lebeling tool,

alecGraves commented 6 years ago

data labeling tool moved to #13

hjevans commented 6 years ago

Github documentation for TensorFlow object detection API models, etc: https://github.com/tensorflow/models/tree/master/research/object_detection

G-RMI powerpoint w/ info http://presentations.cocodataset.org/Places17-GMRI.pdf

Tiny YOLO (general YOLO webpage): https://pjreddie.com/darknet/yolo/

alecGraves commented 6 years ago

just an idea: use batching to more efficiently process data from multiple cameras at the same time And maybe use a recurrent network to help the results stay continuous over time... If we do not do that, the results will be super jumpy, using tinyyolo.

Also, we should try using a custom tinyyolo architecture based on mobilenet to improve accuracy. Or a custom NAS- based model. Basically swap base models, train on mscoco, and fine-tune for our application. We will need to use tensorflow to maintain compatibility with the movidius.

alecGraves commented 6 years ago

TODO: Get retinanet to work, try swapping backend, etc.

alecGraves commented 6 years ago

We are transitioning to the movidius as the main processor for the underwater competition.

@cmcdermitt is working on making an object-detection neural network work, @eneleski and @zcesar1 are setting up the movidius sdk and getting ready to work on movidius ros stuff.