Non-indexed FASTA: access sequence by genomic region

BioJulia / FASTX.jl

Parse and process FASTA and FASTQ formatted files of biological sequences.

MIT License

61 stars 20 forks source link

Hi,

Indexed FASTA have extract to quickly get a sequence. I could not find a way to do the same without an index. Could such a method be added ?

Expected Behavior

For example, with sequence("chr1", 10, 1000, reader)

Possible Solution / Implementation

My attempt was

function sequence(chrom, x, y, reader)
    r= first(reader)
    seq = nothing
    while iterate(reader) != nothing
        if identifier(r) == chrom
            seq = FASTX.sequence(r)[x:y]
            break
        end
    end
    return seq
end

Your Environment

Package Version used: 2.1.2
Julia Version used: 1.9.2
Operating System and version (desktop or mobile): Gentoo

Thanks,

BioJulia / FASTX.jl

Non-indexed FASTA: access sequence by genomic region #110

Expected Behavior

Possible Solution / Implementation

Your Environment