zavolanlab / htsinfer

Infer metadata for your downstream analysis straight from your RNA-seq data
Apache License 2.0
10 stars 22 forks source link

bug: STAR using uncompressed sample files when mate relationship is not #149

Closed balajtimate closed 1 year ago

balajtimate commented 1 year ago

Describe the bug With the revised mapping logic from #131, when the input samples are not mates and the mate relationship needs to be determined by mapping the reads against the transcripts, STAR uses the original input sample files instead of the uncompressed subset of the samples, resulting in an error, as the STAR command is currently not built to handle .gz files.

To Reproduce Run HTSinfer in paired-end mode on files from different mates, one of them compressed, e.g.

htsinfer tests/files/first_mate.fastq tests/files/mixed_mates_compressed.fastq.gz

Expected behavior Add the paths to the subset of samples as argument when calling to the mapping function in get_library_type.py