Open IIIgnac opened 1 year ago
Hi, welcome and good luck with learning.
2.1 Due to some features of neural networks (such as supersaturation, and overfitting), increasing the amount of training does not necessarily guarantee that the learning curve (the reward value) will always rise. By the way, multi-agent learning will cause more situations (such as stability and learning modes) and it normally exacerbates this problem. 2.2 The reinforcement learning algorithms try to find the optimal policy. Here, the aim is that the algorithm tries to find reasonable paths for ships. So, in this case, I need a model which will get the highest score. In other words, a terrible reward curve is acceptable for my project, if this curve is increasing at the beginning and the optimal path set is found. 2.3 If u want to get the perfect model for each scenario, adjusting parameters is necessary. If u just want to find reasonable paths, the model with these given parameters is acceptable.
Hello, as a newcomer to reinforcement learning, I have some questions I would like to ask.