Closed Faizal-Eeman closed 1 year ago
Thanks for reporting this Faizal - We recently performed a metadata collection/analysis regarding all fastqs, involving gunzip/gzip - this may produce different md5s (from different gz file header if not using gzip -n for example). However, the uncompressed file (fastq file) are unchanged with identical md5. The sequence.index files will need to be updated accordingly.
Thanks for confirming @chunlinxiao! Please let me know when sequence.index files are updated with the correct checksums.
Hi @Faizal-Eeman, the md5s were updated: you can follow the link of sequence.index.AJtrio_Illumina300X_wgs_07292015_updated.
thanks
Great, they now match. Thanks a lot!
I've tried downloading a few FASTQ files listed in sequence.index.AJtrio_Illumina300X_wgs_07292015.HG002 and found the MD5 checksums listed here don't match with the downloaded files.
commands used:
MD5 checksum listed for the same file from the same library is
48e52acfce7548bddad2b3f89e8e0348
https://github.com/genome-in-a-bottle/giab_data_indexes/blob/d3c9afd4c08d9df5b2a6e94fe0692a11def4fe50/AshkenazimTrio/sequence.index.AJtrio_Illumina300X_wgs_07292015.HG002#L2Can you please verify this?
Best, Faizal