Questions about kp2gaussian and gaussian2kp

Hi, 1) kp2gaussian(gaussian2kp(x)) != x, is this is what are you asking? This is the case because, gaussian2kp produce unormalized heatmaps. E.g. The heatmaps that does not sum to one, while kp2gaussian requires maps like this. Why do you need to invert it? 2) This operation is called soft-argmax, and it computes the weighted mean of the coordinate grid. More formally heatmap defines a probability distribution over the image coordinates, and we compute the mean coordinate given this probability distribution. 3) Do you have a pytorch landmark detector? In that case follow these steps:

Replace architecture of the keypoint_detector with your architecture. Modify the forward method so that it return a dict with a single value ['mean']. It should have the following shape [bs, num_kp, 2].
Modify the config.yaml, replace num_kp with number of keypoiths in your case and modify the kp_var from 'matrix' to 0.01.
Modify train.py, load your keypoint detector weights and commend lines wich do optimizer_kp_detector.step().

If you don't have acces to the architecrure of detector or it is complicated, you can do the following:

Compute the keypoints for all the videos and all the frames offline and save it to some file.
In frames dataset load this file and for each frame load the appropriate keypoints. Alternatively you can just run you keypoint detector on your frames at this point. Without offline computation and file saving.

Put the keypoints in the out dict under the key 'kp'.

Replace keypoint detector with some dummy class that will just forward you the keypoints. In other words just do out['mean'] = x['kp'].
Modify the config and train as in previous method.

AliaksandrSiarohin / monkey-net

Questions about kp2gaussian and gaussian2kp #20