mks0601 / I2L-MeshNet_RELEASE

Official PyTorch implementation of "I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image", ECCV 2020
MIT License
710 stars 130 forks source link

Speeding up network inference questions #44

Closed pablovela5620 closed 3 years ago

pablovela5620 commented 3 years ago

I wanted to ask if you'd done any experimentation with a more mobile-friendly version of i2l.

I could image a few modifications to try to reduce the number of parameters and speed up network inference

  1. Change backbone from Res50 to a smaller version (Res34/18)
  2. Use a MobileNetV2/3backbone
  3. Directly regress mesh lixel heatmaps rather than first getting pose lixel heatmaps -> feeding into another resnet + image feature

Have you done any of the following? It seems like some of these have been tried by implementations such as Mediapipe face mesh regression method

I also wanted to ask if you had any data on how integral regression and lixel based heatmap regression speed compares to traditional heatmap based methods.

Thanks!

mks0601 commented 3 years ago

1,2: No I haven't. 3: There is a table in the paper which compares without and with pose regression.

pablovela5620 commented 3 years ago

Whoops completely forgot that was in there. Had only remembered where you directly fed the ground truth 3d pose and not where you regressed. Thanks for the response. Here's the difference in the paper for anyone else that may wonder image