Problem in the code. - Githubissues

Hi. Hope you are doing well. Thanks for this repo. It helped me a lot. I just wanted to say that there is a problem in the predict function. when we are getting action with the highest action value, we are using this code: return random.choice(range(n_power_levels)) and this will cause problem because all the agents will select a specific V2I link all the time. To solve this, we should change it to return random.choice(range(n_power_levels * n_RB))

huangkk99 / MARLV2X

Problem in the code. #2