I notice that the model does a good job with predicting a repeat but struggles with replicating the sequence, here is the first parts of the subsequences of this sample prediction:
I realize that we don't need this functionality from the model, as we only need the absence of a repeat in these subsequences. Would it make sense then to predict a single additional class for bases in non-repeat subsequences, making the prediction and output of the model like this?
I notice that the model does a good job with predicting a repeat but struggles with replicating the sequence, here is the first parts of the subsequences of this sample prediction:
I realize that we don't need this functionality from the model, as we only need the absence of a repeat in these subsequences. Would it make sense then to predict a single additional class for bases in non-repeat subsequences, making the prediction and output of the model like this?
(Or any other character to represent the absence of a repeat.)
Would that be easy to test?