Closed PhilPalmer closed 5 years ago
Hello,
I am running this on an AWS EC2 instance which has plently of disk space and yet I am still getting this error.
note that by default temporary files are created on /tmp
, if this /tmp
doesn't have much space you will get these errors, to be sure try using the --tmpdir
parameter to set the tmp dir explicitly.
I am not sure if it is because parallel-fastq-dump is mounting volumes and running it there
neither parallel-fastq-dump or fastq-dump does anything like that.
Also do you know how I can get it to run any faster. For example do you know how parallel-fastq-dump compares to fasterq-dump
I haven't had the time to test fasterq-dump extensively, but if space is a issue fasterq will be a problem because it doesn't support compressing the output on-the-fly.
If you haven't downloaded your target SRA prior to running fastq-dump you should, it speed things up considerably specially using aspera for the download. The optimum number of threads will depend on the number of reads of the SRA file. a small file won't benefit from 40 threads. Think in terms of # of reads per thread.
@rvalieris that's great, thank you for your quick reply. Still not exactly sure what was causing the error but I will definitely try using the --tmpdir
option.
Also, I sometimes get connection issues, eg:
sys: connection failed while opening file within cryptographic module - mbedtls_ssl_handshake returned -76 ( NET - Reading information from the socket failed )
Do you know what is causing this? Could it be from parallel-fastq-dump
making too many requests or is it more likely a problem in general with the connection to ncbi? The latter seems more likely as other people seem to be experiencing the same issue
Thanks again
yes, that connection error is from fastq-dump connecting to the server. its usually benign (because it will retry the connection automatically), unless you are getting too many of these errors at once, in this case it could mean you have too many connections open, so try reducing the number of threads.
Okay perfect, thanks for your help
Hi,
I am trying to download several reads. Eg using the following command:
But when I do, I get this error message:
I am running this on an AWS EC2 instance which has plently of disk space and yet I am still getting this error. However, I am not sure if it is because
parallel-fastq-dump
is mounting volumes and running it there which have less disk space. It looks like these volumes are mounted on the instance and I am not sure how else I might have created them.Do you know if this is the case and if so how I can change the location where the command is run to prevent this error.
Also do you know how I can get it to run any faster. For example do you know how
parallel-fastq-dump
compares tofasterq-dump
. I tried running that instead and it seems like it may be a bit slower. Or what is the optimum value to set--threads
to? For example, I knowfasterq-dump
for the value of th has diminishing returns, doesparallel-fastq
also have the same?