Closed xinyixinyijiang closed 7 months ago
Thanks for bringing this to our attention. We have identified that many of the additive files were updated prior to the migration to AWS, however the md5s were not updated. We are working on generating the md5s and should have this completed by the end of the week.
Thanks for your patience here. We've updated the file list, and generated the complete list of MD5 for all files in this repository. This can be found in 2018_gwas_imputed_md5.20240212.txt. Sorry for any confusion this may have created.
Details about these GWAS sumstats in question are here
Hi!
I recently downloaded GWAS summary statistics for almost all UKBB phenotypes (both sexes only) via the AWS link found in the provided Google document. To ensure data integrity, I conducted an MD5 checksum verification (on Linux system). However, I encountered a recurring issue with 22 specific phenotypes failing the MD5 checksum validation across two separate tests.
For example, for phenotype ID 1220, accessed through the link
https://broad-ukb-sumstats-us-east-1.s3.amazonaws.com/round2/additive-tsvs/1220.gwas.imputed_v3.both_sexes.tsv.bgz
, the MD5 checksum I computed was5ee1df62d2a6608c942ec85fd712bf9a
. This differs from the expected checksum provided, which is80d2c21a425aee154d585cd20ffa1e8c
.I have listed all affected phenotypes below for your reference. Could you please investigate this discrepancy?
Thank you for your attention to this matter!