Question: During --train mode, sampling seems to stop, if there's no agent of e.g. g_f left. Any tips how to prevent this / implement many different types of agents which get consumed one by one?
Use case: I tried to add food with different types of value to train_gather.py to see if agents gather the more valuable food first. Unfortunately I'm not able to set this up, because sampling simply stops when one of the food agents has been used up.
Hey, thanks for publishing this great platform!
Question: During
--train
mode, sampling seems to stop, if there's no agent of e.g.g_f
left. Any tips how to prevent this / implement many different types of agents which get consumed one by one?Use case: I tried to add food with different types of value to train_gather.py to see if agents gather the more valuable food first. Unfortunately I'm not able to set this up, because sampling simply stops when one of the food agents has been used up.
Thank you!