NVGesture Processing - Githubissues

VIPL-SLP / pointlstm-gesture-recognition-pytorch

This repo holds the codes of paper: An Efficient PointLSTM for Point Clouds Based Gesture Recognition (CVPR 2020).

https://openaccess.thecvf.com/content_CVPR_2020/html/Min_An_Efficient_PointLSTM_for_Point_Clouds_Based_Gesture_Recognition_CVPR_2020_paper.html

Apache License 2.0

117 stars 19 forks source link

NVGesture Processing #30

Open aamzhas opened 11 months ago

aamzhas commented 11 months ago

Hi, I was going through your processing for NVGesture and wanted some clarification regarding a function call.

I noticed that in nvidia_process.py:31 the uvd2xyz_sherc() function is called rather than uvd2xyz_nvidia(). The differences between the two functions being that the parameters for f and the image centers change. Would this mean that the processing was incorrect? Am I misunderstanding something? Any help would be appreciated. Thank you!

ycmin95 commented 11 months ago

Yes, it seems there is a typo here, and the corresponding function should be uvd2xyz_nvidia() , thanks for your correction!

During our experiments, we didn't notice significant performance difference between image coordinate and camera coordinate systems (and we use the default intrinsic matrix of camera). And most experiments are conducted under the image coordinate system as shown in dataloader. However, I believe using the camera coordinate system should be more robust in real-world applications, and it would be a valuable research direction to increase the robustness of point-cloud based method.

NOTGOOOOD commented 2 months ago

Yes, it seems there is a typo here, and the corresponding function should be uvd2xyz_nvidia() , thanks for your correction!

During our experiments, we didn't notice significant performance difference between image coordinate and camera coordinate systems (and we use the default intrinsic matrix of camera). And most experiments are conducted under the image coordinate system as shown in dataloader. However, I believe using the camera coordinate system should be more robust in real-world applications, and it would be a valuable research direction to increase the robustness of point-cloud based method.

Hi, I am confused for converting the point-cloud to 3D space using uvd2xyz_***(). I didn't really find out at what a stage the 3D information works. It seems that point-cloud in training and testing stage both are [batch_size, T, N, 4] by here.