iqbal-lab / Mykrobe-predictor

Antibiotic resistance predictions in minutes on a laptop
Other
50 stars 19 forks source link

paired-end reads #106

Closed afkoeppel closed 8 years ago

afkoeppel commented 8 years ago

I'm working with some TB data that I'd like to analyze with Mykrobe, but my reads are paired-end. Is there an option in mykrobe for handling paired-end reads as input? I'm not seeing it in the documentation so I apologize if I just missed it.
I could align them to a TB reference and use a bam file as input, but since I have several samples from different TB strains I'd prefer to use the original fastq files. I've considered just combining the R1s and R2s into one file and just treating them as single-end reads, or concatenating the sequences with a single N between the R1 and R2 reads, but I'm not sure which is preferable.
Does anyone have a recommendation for the best way to handle this?

Phelimb commented 8 years ago

We don't explicitly used the paired end information. You can either concatenate the reads as you describe above or run with two sequence files at runtime.

e.g.

mykrobe predict ERR117639 tb -1 /download/ena/ERR117639_R1.gz /download/ena/ERR117639_R2.gz