Open gauravkuppa opened 2 years ago
Hi Gaurav,
Thank you for bringing this to my attention! Please note that the FIM determinant as the reward function was an experimental work, and I haven't tested it fully. You can use it either by assigning self.info_acc = None
in L132 of envs/tracking_waypoints_env.py, or you can pull the latest version and use the following command: python -m target_localization.train --sess test_session --num_targets 2 --reward_type fim --no_augmented_state
.
I will update the codebase later to handle the errors when using the FIM determinant rewards with the information accumulator.
How can I use FIM reward? I get these errors.
When I run
python -m target_localization.train --sess dynamic_target --num_targets 2
I getWhen I run
python -m target_localization.train --sess dynamic_target --num_targets 2 --no_augmented_state
I getHow can I use FIM reward to train a tracking policy?