Closed jan-christiansen closed 5 years ago
Another issue why training with long programs might not improve the detection, is that while the programs themselves might be near maximum size, there is no way to guarantee that the instruction which invalidate the program are also near the end of the program.
Is the net able to generalise the behaviour for short programs, if we train it with long programs only? Train the net with programs that are near the maximal number of tokens. Save the resulting weights into the
models
folder. Please try to improve the naming scheme of the files with pertained weights.