Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
https://lightning.ai
Apache License 2.0
28.51k stars 3.39k forks source link

possibilities for Asynchronous (RL online rollout) evaluation during training #15439

Open zcczhang opened 2 years ago

zcczhang commented 2 years ago

🚀 Feature

I am wondering if it is possible to include the asynchronous evaluation during the training process

Motivation

For RL projects (or imitation training + online rollout evaluation), evaluation is really a bottleneck during the training process even the environments are vectorized, especially if we want to evaluate lots of long-horizon episodes. Now the training only can continue after eval is done, but it seems not necessary as the evaluation could be done with weights at that timestep, and does not matter the future training.

Pitch

Option for doing asynchronous evaluation during training

Alternatives

using separate scripts to do this

Additional context

No


If you enjoy Lightning, check out our other projects! âš¡

cc @borda @awaelchli @rohitgr7

rohitgr7 commented 2 years ago

one solution I suggested on slack is to create checkpoints and run another script that monitors these checkpoints and run validation separately.

Borda commented 2 years ago

I think you can do it with Flow/Work? cc: @lantiga

stale[bot] commented 1 year ago

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions - the Lightning Team!