Closed ttbek closed 3 years ago
This probably is the issue, I see that the first download attempt also had a different size:
-rw-r--r-- 1 root root 7878918094 Jan 24 19:57 dbsnp-b150-GRCh37.gz
Is there a more robust way to download this file? I don't usually have any trouble with downloads via, e.g. Firefox or wget. Python script downloads have been a bane in the past, not sure why they are so unreliable, but they've messed me up for weeks before (from other projects).
Found the URL in the code, I'm downloading now with wget instead, will let you know how it goes.
That method of downloading resources was unreliable. I've changed pheweb in the hg38
branch to download from our own server instead. I'm hoping to test that code a bit more and then merge those changes into master
and make a new release in the next couple days. In the meantime I recommend using the new code, especially if you're on hg38.
Thanks. We actually need hg37 for our data, I had just grabbed the FinnGen data quickly to test with. The wget of the file is the correct size and it seems to be proceeding now. Are there other resource downloads I should consider suspect?
For anyone that encounters the same, they can get the file as:
wget https://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606_b150_GRCh37p13/VCF/00-All.vcf.gz
And then it needs to be renamed:
mv 00-All.vcf.gz dbsnp-b150-GRCh37.gz
The file should be in ./generated-by-pheweb/sites/dbSNP/
Then the previously failing command can be run again.
I'll close the issue since going forward hg38 will of course be the standard more and more frequently (well, until the next one of course).
When running: pheweb add-rsids
The exception says:
Which doesn't tell me too much more. Taking a look at the file:
It's not a permissions issue, it's in a Docker container and everything is root. The size of the downloaded file seems wrong? Tried downloading twice, same. This error isn't related to the user input data, right? Because I was using a GRCh38 based file just to try things out, but I think this is in regards to the downloaded data.