Naman-ntc / 3D-HourGlass-Network

3D HourGlass Networks for Human Pose Estimation Through Videos
MIT License
56 stars 10 forks source link

What the meaning of function "robust" in HourGlassNet3D.py? #2

Closed Talegqz closed 5 years ago

Talegqz commented 5 years ago

I tried to understand it but I failed. Please tell me why using it, thank you very much

Naman-ntc commented 5 years ago

Hi! Thanks for interest. I am afraid I push my code to github in haste and don't really plan to update it much. So the code may be quite jargon. Feel free to ask stuff. By the way you should check out my other repo Pose Estimation. The code really fashioned and usable.

As of this, it was an idea I tried (I tried many such small ideas!) which might be little hard to explain but let's see.

The variable temporal is a list (check ref.py) which stores the temporal size of 3D convolution kernel. I wanted to have different temporal receptive fields for different layers (more at the start of network, large at the end). The variable stores that information in form of a list and passes it to all the constructors which find the temporal kernel size required in bunch of convolutions they contain. The function robust (which seems a very bad name now, apologies!) slices out some of the temporal features that are not being predicted (corresponding to frames far away from main/central frame).

I guess it's not very clear. Check this (old commit code) which is neat and doesnt have these fancy additions as well. (These fancy additions didn't work as well)

Feel free to get back Thanks

Naman-ntc commented 5 years ago

closing this for now, feel free to open it again!