SystemsGenetics / GEMmaker

A workflow for construction of Gene Expression count Matrices (GEMs). Useful for Differential Gene Expression (DGE) analysis and Gene Co-Expression Network (GCN) construction
https://gemmaker.readthedocs.io/en/latest/
MIT License
33 stars 16 forks source link

Hisat2 unmapped read optional cleanup #255

Open JohnHadish opened 2 years ago

JohnHadish commented 2 years ago

Hisat2 has a feature where unmapped reads are assigned to a file of the users choice using the parameter --un. In GEMmaker, this file is ${sample_id}_un.fastq.

Currently, these files are kept, with no cleanup for them present. These files are fairly small, but an edge case scenario could result in them expanding.

This is a low priority request to add functionality that would add a parameter to clean up these files.