how to deal with the seq position of reference genome?

instadeepai / nucleotide-transformer

🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

Other

480 stars 55 forks source link

Hi dallatt,

Thanks for your work and providing such a great tool.

I would like to know how you process the sequence position information of the reference genome. I saw in the article that during the data preparation stage, the mutation sequence in the individual from corresponding position was used to replace the tokens of the reference seq. I don't know how this step is implemented because I don't see the input related to position information in your codes.

Looking forward to your reply, thanks!

instadeepai / nucleotide-transformer

how to deal with the seq position of reference genome? #63