Cannot reproduce paper results using the given ckpt #4

Closed aopolin-lv closed 1 month ago

aopolin-lv commented 11 months ago
Hello, hiveformer is prosing cause it uses the instruction drivend history information to predict agent action. However, when I try to replement this wor, I come across an important problem. The result I run on my headless server is followed: seed pick_ and_lift pick_up _cup put_knifeon chopping_board put_money _in_safe push_ button reach_ target slide_block _to_target stack _wine take_money _out_safe take_umbrellaout of_umbrella_stand Avg.
seed=0 89.20 61.00 38.20 21.60 94.60 99.80 41.80 58.40 43.00 34.20 58.26
seed=2 89.40 70.20 63.80 21.00 98.80 99.80 76.40 59.80 52.20 41.60 67.3
seed=4 92.80 72.00 43.60 32.60 93.60 99.80 36.80 76.00 43.80 47.00 63.80

Except the task pick_and_lift and reach_target, other result is far away from the result you report in paper and repo. Could you give me some advice?

I tried three configs and ckpts obtained from transformer_unet+gripper_attn_multi32_300k. The main command is as followed:

export COPPELIASIM_ROOT=/data/project/CoppeliaSim_Edu_V4_1_0_Ubuntu20_04
export WORLD_SIZE=1
export MASTER_ADDR='localhost'
export MASTER_PORT=10000

export LOCAL_RANK=0 
export RANK=0
export DISPLAY=:0.0


python eval_models.py \
    --exp_config ${outdir}/logs/training_config.yaml \
    --seed 200 \
    --num_demos 500 \
    checkpoint ${outdir}/ckpts/model_step_${step}.pt

CUDA version is: 12.0

cshizhe commented 11 months ago

I would suggest you visualize the rendered images during the evaluation. We noticed that there might be some problems of running RLBench simulator in a headless machine, where the rendered images are not correct.