Open Zainmustafajajja opened 10 months ago
I think what is implemented in code is right. Let me put it in this way. Imagine we are changing the channel state at each time step. At the beginning of the next episode we'll do the same. The question here is what's the point of defining various episodes in our training while we are updating the channel state per step?! Instead, we should use a single episode of a very large number of steps! Moreover, I think we need to give the agent a chance to explore the possible actions at the training phase for a given channel state to find a way to the optimal solution.
hi, I do agree with you. some aspects of the experimentation don't make any sense at all. but I would like to remind you that this is an implementation of an article that is not authored by me. the repo was just for a grad-level course project, and I tried to implement the paper as precise as possible, adhering to what's reported.
for a way of experimentation that makes sense, I suggest you refer to my own paper's repo.
Thank you for your response dear Baturay. I also looked into your amazing ICC paper. I learnt a lot from your implementation of the JSAC paper too. I encountered some questions reading these two papers and I will greatly appreciate if you could share your expert opinion about them:
hope this helps.
Hello, I would like to ask, according to the JSAC method where a fixed channel is used for each episode to train the neural network, how do we use the trained neural network in an actual communication environment where the channel varies at each time step after the entire neural network training is completed?
We are resetting the channel (H_1, H_2) in reset() part of the environment. resetting is done only at start of each episode. it means channel will change once in an episode and it will stay constant during all steps in an episode, as there is no command to change channel during each time step. As per my knowledge channel should change at each time step. Kindly let me know if i am getting it wrong somewhere.