Try to recurrent the result but get lower AP

yokies commented 1 year ago

I just clone the latest code in github and run the step in readme.md file with the following changes(I download pretrained NeRF models and created sample dataset with BlenderProc here):

set the basedir and datadir root
in "run_nerf_noscale.py" file line 7, change "from optimization.utils.load_LINEMOD_noscale ..." to "from utils.load_LINEMOD_noscale ..."
run " python neural_sim_main.py --config ../configs/nerf_param_ycbv_general.txt --object_id 2 --expname exp_ycb_synthetic --psi_pose_cats_mode 5 --test_distribution 'one_1' --n_epochs 50", which append the n_epochs parameter

I think this config is similar to experiment YCB-synthetic dataset in your paper with id2 and no overlap setting. And the result would match the result in Fig.5 (a), which show the AP achieve 95%-100%. However, I get the "AP-2" in "save_result.txt" about 85% after 35 epochs and stop to rise. Do you have any idea to solve this problem?

yokies commented 1 year ago

I find that in your paper Fig 5 (a), two object id is Pitcher and Cheese Box, but in 4.2 Train/test biasness part shows that "We introduce such pose biasness in two of the six objects, pitcher and driller." Which object would match the result in Fig5 (a) id2?

gyhandy commented 1 year ago

Thanks for your question! In the updated version, we fixed a previous OOD problem caused by detection. For your question, please try to use a relatively smaller learning rate: e.g., 5e-5 or 1e-5 for the bi-level optimization. Yes, if you use our provided NeRF model and dataset, the config is similar to the YCB-synthetic dataset, and the results would match Fig.5 (a). We wrote a new wrapper for the main function so that the detailed hyperparameter may need a little adjustment accordingly. (For instance, the initial pose distribution, learning rate, optimizer)

yokies commented 1 year ago

Thanks for your answer. Besides, I find some problem in your dataset here, object 11. It may have some noise in backgroud, so your get_annotation function may return wrong annoation, which may be like: 000 The cv2.threshold result in:

gyhandy commented 1 year ago

Good point! Yes, we find the same issue that the NeRF may not be perfect especially when the training data has heavy shadow. We added some rules to eliminate the noise pixel, while the foreground extraction function may still need some improvement.

gyhandy / Neural-Sim-NeRF

Try to recurrent the result but get lower AP #4