MAGICS-LAB / DNABERT_2

[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
Apache License 2.0
254 stars 59 forks source link

token split issue #62

Open HITzhongyu opened 9 months ago

HITzhongyu commented 9 months ago

Hi, thanks for your pretrain model I wonder if I use all the seqence at same length such as 70, the token has the same split? And I try to use the pretrain model as feature extract, the output of the pretrain whether has the same length?

jaouiwassim commented 9 months ago

Could you reformulate the question please ? I didn't get what you mean by "split"