HumanCompatibleAI / imitation

Clean PyTorch implementations of imitation and reward learning algorithms
https://imitation.readthedocs.io/
MIT License
1.22k stars 233 forks source link

Add support for saving videos of policies on a environment for evaluation during and after training #523

Open yawen-d opened 1 year ago

yawen-d commented 1 year ago

Problem

Solution

def record_and_save_video(
    output_dir: str,
    policy: policies.BasePolicy,
    eval_venv: vec_env.VecEnv,
    video_kwargs: Mapping[str, Any],
    logger: Optional[sb_logger.Logger] = None,
) -> None:
    ...
if key != "video":
    self.wandb_module.log({key: value}, step=step)
else:
    self.wandb_module.log({"video": self.wandb_module.Video(value)})
AdamGleave commented 1 year ago

Assigning to @samuelarnesen to get PR https://github.com/HumanCompatibleAI/imitation/pull/524 over finish line once he starts

Rocamonde commented 1 year ago

Sounds good, feel free to request a review @samuelarnesen once you make some progress. I had already reviewed #524 before.