Open w530248323 opened 6 years ago
@w530248323
Actually, I am working on this topic too. Recently I come out with two ideas. In this toy project, cozmo costs over 100ms and under one seconds for calculating the result . What if we cut this period down to 10ms? Firstly, I would like to rewrite the prediction function with caffe which is build by C++. Secondly if your hardware is powerful enough( with high GPU calculation ability, which i don't have in my macbook) I recommend predict with tiny amount captured pics. Like 10 pics predict once. Filling the missing required pics ( the amount differ to the original picture number that in your traing process, in my project it is 40.) with the first frame or last frame. Hope for helping your problem.
Thanks a lot, my Wechat id is w530248323. We can study this topic together~
I'm doing the same job, I also trained jester_dataset used c3d. I saw you use robot to predict the gesture, do you have any idea on how to predict data in real time on the RGB camera?