harfang3d / dogfight-sandbox-hg2

Air to air combat sandbox, created in Python 3 using the HARFANG 3D 2 framework.
GNU General Public License v3.0
155 stars 49 forks source link

Problems with IA models being used for long periods of time #84

Open puxd opened 9 months ago

puxd commented 9 months ago

Hi, when I train intelligences for air combat missions using reinforcement learning algorithms, the "enemy aircraft" that the intelligences pair with are outputting maneuvers using your IA model. At the beginning of the training the "enemy aircraft" is able to output normal maneuvers, but as the training time progresses it seems to have a bug and leaves the IA model! The output from the command line looks like the picture below. what can I do to solve this problem? question

muratozbek23 commented 8 months ago

Hello, do you start the plane as IA at the beginning of each episode?

puxd commented 7 months ago

Hello, do you start the plane as IA at the beginning of each episode?

No, I don't turn IA mode on every episode, I turn it on at the beginning of training. After a number of turns of training the airplane leaves IA mode on its own and goes out of control.

muratozbek23 commented 7 months ago

I don't think there will be a problem if you turn on at the beginning of each episode.

puxd commented 7 months ago

I don't think there will be a problem if you turn on at the beginning of each episode. Yes, I turned on IA and trained before the start of each turn of the mission like you wanted. But it is still not tracking after some rounds of training, as shown in the image below, where red represents the algorithm-controlled intelligences and gray represents the IA-controlled aircraft.(The second image represents the trajectory of IA when it goes wrong.)

1

2

cym-cym commented 6 months ago

I don't think there will be a problem if you turn on at the beginning of each episode. Yes, I turned on IA and trained before the start of each turn of the mission like you wanted. But it is still not tracking after some rounds of training, as shown in the image below, where red represents the algorithm-controlled intelligences and gray represents the IA-controlled aircraft.(The second image represents the trajectory of IA when it goes wrong.)

1

2

Is your problem solved? I am also doing reinforcement learning research based on the dogfight platform, using enemy AI to train our own agents. However, a series of problems arise when using enemy AI. For example, enemy aircraft will not track us beyond a certain distance. Aircraft, enemy aircraft will fly near the aircraft carrier to attack the aircraft on the aircraft carrier. I sincerely hope to communicate with you.

puxd commented 6 months ago

如果你在每集的开头打开,我认为不会有问题。是的,我打开了 IA 并在任务的每个回合开始前按照你的意愿进行了训练。但经过几轮训练后,它仍然没有跟踪,如下图所示,其中红色代表算法控制的智能,灰色代表IA控制的飞机。

1 2

您的问题解决了吗?我还在做基于混战平台的强化学习研究,使用敌方人工智能来训练我们自己的智能体。然而,在使用敌方AI时会出现一系列问题。例如,敌机不会在一定距离之外跟踪我们。飞机,敌机会飞到航母附近,攻击航母上的飞机。我真诚地希望与您交流。

I don't think there will be a problem if you turn on at the beginning of each episode. Yes, I turned on IA and trained before the start of each turn of the mission like you wanted. But it is still not tracking after some rounds of training, as shown in the image below, where red represents the algorithm-controlled intelligences and gray represents the IA-controlled aircraft.(The second image represents the trajectory of IA when it goes wrong.)

1 2

Is your problem solved? I am also doing reinforcement learning research based on the dogfight platform, using enemy AI to train our own agents. However, a series of problems arise when using enemy AI. For example, enemy aircraft will not track us beyond a certain distance. Aircraft, enemy aircraft will fly near the aircraft carrier to attack the aircraft on the aircraft carrier. I sincerely hope to communicate with you.

Sure. This is a question I still haven't addressed, and I thought we could communicate about it.You can contact me at the e-mail address on my home page.