BASALT-2022-Karlsruhe / ka-basalt-2022

MIT License
0 stars 0 forks source link

Implement custom models on top of VPT models #83

Closed rk1a closed 2 years ago

rk1a commented 2 years ago

Training reward models, subtask detectors and ESC policies from scratch is probably quite inefficient. It would be better if we could base all of these on a general model backbone that is capable in the MineRL domain. Luckily the VPT models are exactly that. A first step could be to make use of the ImpalaCNN which is the first component of the MineRLAgent policy. Good representations of long term dependencies could also benefit from using the embeddings of the transformer decoder blocks.

Task: Implement custom modules that only load relevant parts of the VPT model for use in

rk1a commented 2 years ago

This is addressed here: https://github.com/BASALT-2022-Karlsruhe/ka-basalt-2022/blob/demo_segmentation/impala_based_models.py