Closed LiineKasak closed 2 years ago
thought a oneliner function doesn't need documentation, but I'll add it for good measure :) there are a lot of Ns in the ref sequence, typically at the start and end of chromosomes I think. for example the first 10 000 nucleotides in chr1 are Ns iirc. But for the given genes I doubt there will be any Ns. Just added encoding for it in case
@Sum02dean I just use the built in auto generation of docstring stubs in pycharm (jetbrains premium for all uni students). this is how it works: https://www.jetbrains.com/help/pycharm/using-docstrings-to-specify-types.html
Notable things:
https://s3.amazonaws.com/igv.broadinstitute.org/genomes/seq/hg38/hg38.fa
Otherwise, think this is the best way to load sequence, afterwards can add logic due to window and other shifts