Plant-Food-Research-Open / assemblyqc

A Nextflow pipeline for evaluating assembly quality
https://plant-food-research-open.github.io/assemblyqc/
MIT License
27 stars 3 forks source link

samblaster not found #161

Open CeciliaDeng opened 3 days ago

CeciliaDeng commented 3 days ago

Description of the bug

Hi @GallVp , AssemblyQC successfully completed assessing an assembly with HiC data. When I check the HiC QC report, it mentioned [forward/reverse] Hi-C reads not found, as well as 'samblaster command not found'.

image image

Is somehow samblaster not loaded? Thank you.

Command used and terminal output

sbatch pfr_assemblyqc
Submitted batch job 7756409

Relevant files

No response

System information

No response

GallVp commented 3 days ago

Thank you for submitting an issue on this. I have looked at the report for the pipeline test data and it also has the same items missing.

I think this information is missing because we don't provide the path to the raw reads to the HICQC module. I'll investigate and see if this can be fixed.

Nonetheless, if Subjective Hi-C library judgment is given along with other statistics that means that HICQC had enough information to reach a judgement on the quality of the library. See the following link for interpreting the judgement: https://phasegenomics.github.io/2019/09/19/hic-alignment-and-qc.html

GallVp commented 2 days ago

Versions

assemblyqc version: 2.1.1 hic_qc version: https://github.com/phasegenomics/hic_qc/tree/6881c3390fd4afb85009a52918b4d068100c58b4 hic_qc container: https://github.com/GallVp/docker-images/blob/76f2da6af94c2899849ac13fd82decc43401bed5/hic_qc/6881c33_ps/Dockerfile#L9

Problems

Solution

GallVp commented 2 days ago

The results of the report are correct. Only information regarding reference and FastQ file names is missing. The solution requires changes in the tool repo. Therefore, I am placing this issue on backlog and deprioritising it.