BCs for Canons - Githubissues

schrum2 / MM-NEAT

Modular Multiobjective (Hyper) Neuro-Evolution of Augmenting Topologies + MAP-Elites: Java code for evolving intelligent agents in Ms. Pac-Man, Tetris, and more, as well as code for Procedural Content Generation in Mario, Zelda, Minecraft, and more!

http://people.southwestern.edu/~schrum2/re/mm-neat.php

Other

50 stars 20 forks source link

BCs for Canons #906

Closed schrum2 closed 1 year ago

schrum2 commented 1 year ago

Make a binning scheme/behavior characterization for canons that is num Obsidian vs num TNT and then make a batch file for it and test it out.

TjRaffert commented 1 year ago

Seems to be working properly. I had to reduce the number of threads to three because TNT causes a lot of strain on the server.

schrum2 commented 1 year ago

Let's let it run to completion before closing this

TjRaffert commented 1 year ago

With the threads down to 3 it has slowed down the test. Both are only a fourth of the way done. They should be done by Tuesday when we get back.

TjRaffert commented 1 year ago

This is for the target to the side.

schrum2 commented 1 year ago

It's good that it's not crashing though. The next time we test, we may try ramping up to 5 threads or more.

One thing I'm concerned with here is how consistent scores are. I suspect they are not.

Can you take a look at some high scoring result from this and evaluate it multiple times in a row, reporting the scores in this issue thread?

TjRaffert commented 1 year ago

I also think one thing to note for these cannons in general is that I think it is only possible because of how we are spawning them in. We are using loops to spawn them in, so every shape is spawned at a different time. This allows for the variation in activation of the TNT. If every block was somehow spawned at the exact same time they would probably all explode but not be able to push any of the other TNT.

schrum2 commented 1 year ago

I don't think this is accurate. Although a look adds each block to a block list, the blocks themselves are spawned via a single API call to the spawnBlocks command in the MinecraftClient, which corresponds to a similar command on the Python side. So, all blocks should be spawned at the same time. However, the timing of the snapshots is definitely nondeterministic, and even though Minecraft itself is supposed to be deterministic, the fact that you could spawn the same shape in different places or at different times could easily be enough on its own to affect the random number generator, and thus lead to the exact same shape exhibiting different behavior across multiple evaluations.

Regardless, observation is needed to track the variation in scores.

TjRaffert commented 1 year ago

The two tests I am running are using the new bin labels so they can't use postSpawnMinecraftEvaluateBlocksMissile.bat. Should I run an old test so we can examine it tomorrow?

TjRaffert commented 1 year ago

With shapesIsWorthSaving implemented it is clear to see that the scores aren't consistent. It is also clearly still getting better and evolving.