Closed Mealoore closed 1 year ago
Hi,
You can try to change the parameter about the reward in line 35 of the file train_process_s1.py.
For example, to improve the success rate, you can increase the the third value of the reward parameter from 0.0 to 0.3
I have a question about stage_1, why can't the result reach 100%?