icgc-argo / seq-tools

Command line tools for ARGO sequencing data validation
https://github.com/icgc-argo/seq-tools
GNU Affero General Public License v3.0
1 stars 0 forks source link

New feature: add a flag to skip md5sum check #72

Closed junjun-zhang closed 3 years ago

junjun-zhang commented 3 years ago

for WGS data, md5sum check takes a long time to complete. As all other checks are very fast, md5sum check may take over 90% of the execution time. In order to allow much quicker iteration of metadata checking and fixing, the user can temporarily skip md5sum check until all other checks are passed.

As c680 check includes both fileSize and fileMd5sum checks, it would be good to split them into two separate checks: c681_fileSize_match and c683_fileMd5sum_match.

junjun-zhang commented 3 years ago

addressed by #74