ropensci / onekp

Access sequences from the 1000 Plant Initiative (1KP)
https://docs.ropensci.org/onekp
Other
12 stars 4 forks source link

Error while downloading .fna files after filter_by_code #4

Closed StevinWilson closed 4 years ago

StevinWilson commented 4 years ago

Only a fraction of the .fna files (the smaller ones) gets downloaded when I run the following commands. I believe the links for the larger files get redirected to the Google Drive webpage to notify the user about the file being too big for an anti-virus scan.

Commands run

library(onekp)
onekp <- retrieve_onekp()
seqs <- filter_by_code(onekp, c('URDJ','WTKZ','PZRT','ROAP','VZCI',
                                'NWMY','FZJL','NPND','DDEV','IFCJ','WKSU',
                                'WAIL','FALI','WPHN','MAQO','BSVG','BCGB'))
download_nucleotides(seqs, 'oneKP/nuc')

Error message

trying URL 'https://drive.google.com/uc?export=download&id=1rOA7Iec-BN6ymGdnlm4B-8ZDuLD7igMM'
downloaded 3253 bytes

tar (child): oneKP/nuc/URDJ.fna.tar.bz2: Cannot open: No such file or directory
tar (child): Error is not recoverable: exiting now
/usr/bin/tar: Child returned status 2
/usr/bin/tar: Error is not recoverable: exiting now
tar (child): oneKP/nuc/URDJ.fna.tar.bz2: Cannot open: No such file or directory
tar (child): Error is not recoverable: exiting now
/usr/bin/tar: Child returned status 2
/usr/bin/tar: Error is not recoverable: exiting now
trying URL 'https://drive.google.com/uc?export=download&id=1w-snIZ-WIc8VeTep6Y48EbzLwwfwLZgJ'
downloaded 3253 bytes

bzip2: (stdin) is not a bzip2 file.
/usr/bin/tar: Child returned status 2
/usr/bin/tar: Error is not recoverable: exiting now
bzip2: (stdin) is not a bzip2 file.
/usr/bin/tar: Child returned status 2
/usr/bin/tar: Error is not recoverable: exiting now
trying URL 'https://drive.google.com/uc?export=download&id=1h1K4mX8ksZfRxs6Y8tSDDQOI2uRF9CE1'

Warning messages:
1: untar(compressed=) is deprecated 
2: In system(cmd, intern = TRUE) :
  running command '/usr/bin/tar -jtf 'oneKP/nuc/URDJ.fna.tar.bz2'' had status 2
3: untar(compressed=) is deprecated 
4: In untar(path, compressed = "bzip2", exdir = dir) :
  ‘/usr/bin/tar -jxf 'oneKP/nuc/URDJ.fna.tar.bz2' -C '/tmp/RtmpjP61OC/onekp_sequences'’ returned error code 2
5: In file.remove(path) :
  cannot remove file 'oneKP/nuc/URDJ.fna.tar.bz2', reason 'No such file or directory'
6: untar(compressed=) is deprecated 
7: In system(cmd, intern = TRUE) :
  running command '/usr/bin/tar -jtf 'oneKP/nuc/WTKZ.fna.tar.bz2'' had status 2
8: untar(compressed=) is deprecated 
9: In untar(path, compressed = "bzip2", exdir = dir) :
  ‘/usr/bin/tar -jxf 'oneKP/nuc/WTKZ.fna.tar.bz2' -C '/tmp/RtmpjP61OC/onekp_sequences'’ returned error code 2
Session Info ``` R version 3.6.1 (2019-07-05) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Manjaro Linux Matrix products: default BLAS: /usr/lib/libopenblasp-r0.3.7.so LAPACK: /usr/lib/liblapack.so.3.8.0 locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8 [4] LC_COLLATE=en_US.UTF-8 LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 [7] LC_PAPER=en_US.UTF-8 LC_NAME=C LC_ADDRESS=C [10] LC_TELEPHONE=C LC_MEASUREMENT=en_IN LC_IDENTIFICATION=C attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] onekp_0.2.2 loaded via a namespace (and not attached): [1] Rcpp_1.0.2 rstudioapi_0.10 xml2_1.2.2 [4] magrittr_1.5 rappdirs_0.3.1 bit_1.1-14 [7] tidyselect_0.2.5 rvest_0.3.4 R6_2.4.0 [10] rlang_0.4.1 hoardr_0.5.2 blob_1.2.0 [13] stringr_1.4.0 httr_1.4.1 dplyr_0.8.3 [16] tools_3.6.1 DBI_1.0.0 dbplyr_1.4.2 [19] selectr_0.4-1 taxizedb_0.1.9.9130 digest_0.6.22 [22] bit64_0.9-7 assertthat_0.2.1 tibble_2.1.3 [25] crayon_1.3.4 purrr_0.3.3 vctrs_0.2.0 [28] zeallot_0.1.0 curl_4.2 memoise_1.1.0 [31] glue_1.3.1 RSQLite_2.1.2 stringi_1.4.3 [34] compiler_3.6.1 pillar_1.4.2 backports_1.1.5 [37] pkgconfig_2.0.3 ```
arendsee commented 4 years ago

@StevinWilson Thanks for the report and apologies for the very late reply. This last commit should have fixed it. You get a free cookie : )