Closed yangzhao1230 closed 11 months ago
I only raise the issue when using NTs, as NTs use non-overlapping 6-mer tokenzier.
Thanks for reporting this! It should be fixable by using upsampling also for NT models. I'll investigate whether results are affected by this and fix it in a PR.
I noticed you pre pre-set embedding_idx in variant taks. However, there exists 'N' in such datasets, which may alter the embedding_idx, because the 'N' occupied a whole token.