AdamaJava / adamajava

Other
14 stars 4 forks source link

create tool to rescue fastq reads with empty sequence #321

Closed ChristinaXu2017 closed 2 years ago

ChristinaXu2017 commented 2 years ago

Description

A Fastq file may contain an empty sequence after cutAdapter, Here A base N is set to an empty sequence with 0 base quality value". This tool takes fastq file as input and output, but also can read from stdin and writer to stdout, so it can insert into the pipe on our new WDL script.

Type of change

Please delete options that are not relevant.

How Has This Been Tested?

unit tests is added, tested on small real data. It will also run by new WDL

Are WDL Updates Required?

It is an independent new took, and not yet used by any current WDL pipeline. Any current WDL pipeline won't be affected. A new WDL script is created, which implements this tool after cutAdapter, and pass output fastq to BWA mapping, and also creates an uBAM (unmapped BAM). Thi new WDL script will be tested after its branch is merged into the master. This new WDL script named fasqTobam.wdl may replace the current mapPairedChunks.wdl.

Checklist:

ChristinaXu2017 commented 2 years ago

yes, agree and fixed