Was looking through the BAM reader section of your code. I saw that your using samtools view to load in all of the lines of the code at ounce:
This is fine when the files are small, but won't work on sequencing files that are hundreds of gigs. Do you want me to adapt this? I know you had mentioned that you might adapt this section to work in parallel anyway...
Yes, this is just a placeholder. I am going to work on this today, and will use index files to read the lines so that I don't overflow the memory when reading big files.
@mukarramtahir
Was looking through the BAM reader section of your code. I saw that your using samtools view to load in all of the lines of the code at ounce:
This is fine when the files are small, but won't work on sequencing files that are hundreds of gigs. Do you want me to adapt this? I know you had mentioned that you might adapt this section to work in parallel anyway...
L