Open nemitheasura opened 2 years ago
Hi @nemitheasura ,
Hopefully some of the above points would provide clues to what is going on... I have not worked with R myself, but hdf should work transparently... Do you know what version of hdf is actualy used internally by the R package?
Thanks,
Hi @jorj1988 ,
Ad1. The error message is rather vague, I'd say (screen attached).
Ad2. I tried all 3 releases from the repo (1.0.0, 1.0.1 and 1.0.2) with the same result.
Ad3. Hdfview shows typical tree-like structure of multifast5 file. The vbz compressed signal data can be displayed too (screens attached).
Ad4. normal, ordinary output. Nothing suspicious (screen attached, dumped with -H option)
Ad5. yes
Ad.6 Currently, I use HDF5 1.12.2, but I also tested all releases from 1.10 to the newest one, hoping that it would help somehow. Nothing helped.
EDIT: SOLUTION
While attempting to debug this, I encountered a following thread on SO: https://stackoverflow.com/questions/72067335/r-session-aborted-and-r-encountered-a-fatal-error
The problem described here has nothing in common with the plugin and nanopore data, however this also concerns fatal error produced by R/Rstudio. In all previous attempts I used official R releases for Windows from recommended installation packages with default options. This time I gave a try and installed patched R 4.2.1 build for Windows from the following link mentioned in this thread: https://cran.r-project.org/bin/windows/base/rpatched.html
Then I installed compatible Rtools and devtools and rhdf5 package from bioconductor (as it is absent from CRAN for some reason). The session_info() showed me that there is some issue with rhdf5filters (md5sum differs) so I went to the github repository and installed rhdf5filters directly from there (https://github.com/grimbough/rhdf5filters).
After all of those steps I went to the plugin directory, copied the plugin and library files and placed them into the lib directory within rhdf5filters dir (in my case it was C:\Users\nemitheasura\AppData\Local\R\win-library\4.2\rhdf5filters\lib\x64)
This finally solved the issue. The plugin is now recognized properly by R and the fast5 files are completely browsable. I post it here in case if someone would encounter the same issue while working with vbz compressed nanopore data in R.
Best, N.
Hello, I am asking you for troubleshooting regarding using vbz plugin & R.
I want to access raw signal data stored within fast5 files and plot the signals in R (more specific: I want to create a Rmarkdown document with some plots). I have a vast collection of fast5 files- some are older, some are new, some were compressed with vbz algo and some are using older compression methods.
Before I decided to post an issue, I did my research - e.g. followed the hints posted there: https://github.com/nanoporetech/vbz_compression/issues/20 https://github.com/adnaniazi/tailfindr/issues/12
I want my document to be usable on Linux and Win operating systems. I have Linux Mint Vanessa and Win11.
All necessary dependencies to access and browse hdf5 files are installed on both systems - according to tutorials provided and to aforementioned threads (above). The hdf5 libraries, R packages (rhdf5, hdf5filters, Rhdf5lib) and Rtools in case of Windows are present. Off corse the plugin itself too. The plugin file was also copied to the rhdf5filters directory, so it would be accessible from within R. All software was installed in standard directories, no custom dirs were used.
This strategy allows me to access & plot signals from vbz compressed fast5 files only on Linux. On Windows it crashes R session immediately throwing a fatal error.
I am even unable to access the signal data from within R on Windows11, not to mention plotting. Just accessing & trying to store them as vec. Not a chance - fatal error, session restart.
Interestingly, the fast5 file itself is accessible, I can browse sections which were not subjected to vbz compression, so for instance, I can access and display configuration data, sampling rate, minKNOW version etc. I am only unable to access the signal data.
Off course I can change the compression algo and then plot my files under Win11 OS, but this is additional step, which I would like to avoid since there is a plugin which should solve the problem without using h5repack and convert the data back and forth with ont-fast5-api.
Also, solution provided in this thread https://github.com/adnaniazi/tailfindr/issues/12 allows me to produce a nonempty csv file using test data provided by tailfindr maintainer, while on my data, on Windows, the produced file is blank.
This is not a problem on Linux Mint and a few other Linux distros I have already tested. The problem seems to appear only on Windows. It is just like this plugin would require other actions to be recognizable/accessible by R.
Here is my session_info() on Windows (this was initially a Win10 upgraded to 11).
Session info ------------------------------------------------------------------------------------------------------- setting value version R version 4.1.3 (2022-03-10) os Windows 10 x64 (build 22000) system x86_64, mingw32 ui RStudio language (EN) collate Polish_Poland.1250 ctype Polish_Poland.1250 tz Europe/Warsaw date 2022-08-25 rstudio 2022.07.1+554 Spotted Wakerobin (desktop) pandoc 2.18 @ C:/Program Files/RStudio/bin/quarto/bin/tools/ (via rmarkdown)
Packages ----------------------------------------------------------------------------------------------------------- package version date (UTC) lib source assertive 0.3-6 2020-08-01 [1] CRAN (R 4.1.3) assertive.base 0.0-9 2021-02-08 [1] CRAN (R 4.1.3) assertive.code 0.0-3 2018-10-21 [1] CRAN (R 4.1.3) assertive.data 0.0-3 2018-11-21 [1] CRAN (R 4.1.3) assertive.data.uk 0.0-2 2018-10-21 [1] CRAN (R 4.1.3) assertive.data.us 0.0-2 2018-10-21 [1] CRAN (R 4.1.3) assertive.datetimes 0.0-3 2020-07-30 [1] CRAN (R 4.1.3) assertive.files 0.0-2 2016-05-10 [1] CRAN (R 4.1.3) assertive.matrices 0.0-2 2018-11-20 [1] CRAN (R 4.1.3) assertive.models 0.0-2 2018-10-21 [1] CRAN (R 4.1.3) assertive.numbers 0.0-2 2016-05-09 [1] CRAN (R 4.1.3) assertive.properties 0.0-5 2022-04-21 [1] CRAN (R 4.1.3) assertive.reflection 0.0-5 2020-07-31 [1] CRAN (R 4.1.3) assertive.sets 0.0-3 2016-12-30 [1] CRAN (R 4.1.3) assertive.strings 0.0-3 2016-05-10 [1] CRAN (R 4.1.3) assertive.types 0.0-3 2016-12-30 [1] CRAN (R 4.1.3) assertthat 0.2.1 2019-03-21 [1] CRAN (R 4.1.3) base64enc 0.1-3 2015-07-28 [1] CRAN (R 4.1.1) bit 4.0.4 2020-08-04 [1] CRAN (R 4.1.3) bit64 4.0.5 2020-08-30 [1] CRAN (R 4.1.3) cachem 1.0.6 2021-08-19 [1] CRAN (R 4.1.3) callr 3.7.2 2022-08-22 [1] CRAN (R 4.1.3) cli 3.3.0 2022-04-25 [1] CRAN (R 4.1.3) codetools 0.2-18 2020-11-04 [2] CRAN (R 4.1.3) crayon 1.5.1 2022-03-26 [1] CRAN (R 4.1.3) curl 4.3.2 2021-06-23 [1] CRAN (R 4.1.3) DBI 1.1.3 2022-06-18 [1] CRAN (R 4.1.3) devtools 2.4.4 2022-07-20 [1] CRAN (R 4.1.3) digest 0.6.29 2021-12-01 [1] CRAN (R 4.1.3) doSNOW 1.0.20 2022-02-04 [1] CRAN (R 4.1.3) dplyr 1.0.9 2022-04-28 [1] CRAN (R 4.1.3) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.1.3) evaluate 0.16 2022-08-09 [1] CRAN (R 4.1.3) fansi 1.0.3 2022-03-24 [1] CRAN (R 4.1.3) fastmap 1.1.0 2021-01-25 [1] CRAN (R 4.1.3) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.1.3) fs 1.5.2 2021-12-08 [1] CRAN (R 4.1.3) generics 0.1.3 2022-07-05 [1] CRAN (R 4.1.3) glue 1.6.2 2022-02-24 [1] CRAN (R 4.1.3) htmltools 0.5.3 2022-07-18 [1] CRAN (R 4.1.3) htmlwidgets 1.5.4 2021-09-08 [1] CRAN (R 4.1.3) httpuv 1.6.5 2022-01-05 [1] CRAN (R 4.1.3) iterators 1.0.14 2022-02-05 [1] CRAN (R 4.1.3) jsonlite 1.8.0 2022-02-22 [1] CRAN (R 4.1.3) keras 2.9.0 2022-05-23 [1] CRAN (R 4.1.3) knitr 1.40 2022-08-24 [1] CRAN (R 4.1.3) later 1.3.0 2021-08-18 [1] CRAN (R 4.1.3) lattice 0.20-45 2021-09-22 [2] CRAN (R 4.1.3) lifecycle 1.0.1 2021-09-24 [1] CRAN (R 4.1.3) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.1.3) Matrix 1.4-0 2021-12-08 [2] CRAN (R 4.1.3) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.1.3) mime 0.12 2021-09-28 [1] CRAN (R 4.1.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.1.3) ninetails 0.3.0 2022-08-24 [1] Github (LRB-IIMCB/ninetails@58a507c) pillar 1.8.1 2022-08-19 [1] CRAN (R 4.1.3) pkgbuild 1.3.1 2021-12-20 [1] CRAN (R 4.1.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.1.3) pkgload 1.3.0 2022-06-27 [1] CRAN (R 4.1.3) png 0.1-7 2013-12-03 [1] CRAN (R 4.1.1) prettyunits 1.1.1 2020-01-24 [1] CRAN (R 4.1.3) processx 3.7.0 2022-07-07 [1] CRAN (R 4.1.3) profvis 0.3.7 2020-11-02 [1] CRAN (R 4.1.3) promises 1.2.0.1 2021-02-11 [1] CRAN (R 4.1.3) ps 1.7.1 2022-06-18 [1] CRAN (R 4.1.3) purrr 0.3.4 2020-04-17 [1] CRAN (R 4.1.3) R6 2.5.1 2021-08-19 [1] CRAN (R 4.1.3) Rcpp 1.0.9 2022-07-08 [1] CRAN (R 4.1.3) remotes 2.4.2 2021-11-30 [1] CRAN (R 4.1.3) reticulate 1.25 2022-05-11 [1] CRAN (R 4.1.3) rhdf5 2.38.1 2022-03-10 [1] Bioconductor rhdf5filters 1.9.0 2022-08-25 [1] Github (grimbough/rhdf5filters@ee10d4f) Rhdf5lib 1.16.0 2021-10-26 [1] Bioconductor rlang 1.0.4 2022-07-12 [1] CRAN (R 4.1.3) rmarkdown 2.16 2022-08-24 [1] CRAN (R 4.1.3) rprojroot 2.0.3 2022-04-02 [1] CRAN (R 4.1.3) rstudioapi 0.14 2022-08-22 [1] CRAN (R 4.1.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.1.3) shiny 1.7.2 2022-07-19 [1] CRAN (R 4.1.3) snow 0.4-4 2021-10-27 [1] CRAN (R 4.1.1) stringi 1.7.6 2021-11-29 [1] CRAN (R 4.1.2) stringr 1.4.1 2022-08-20 [1] CRAN (R 4.1.3) tailfindr 1.3 2022-08-22 [1] Github (adnaniazi/tailfindr@7d17cc2) tensorflow 2.9.0 2022-05-21 [1] CRAN (R 4.1.3) tfruns 1.5.0 2021-02-26 [1] CRAN (R 4.1.3) tibble 3.1.8 2022-07-22 [1] CRAN (R 4.1.3) tidyselect 1.1.2 2022-02-21 [1] CRAN (R 4.1.3) tzdb 0.3.0 2022-03-28 [1] CRAN (R 4.1.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.1.3) usethis 2.1.6 2022-05-25 [1] CRAN (R 4.1.3) utf8 1.2.2 2021-07-24 [1] CRAN (R 4.1.3) vctrs 0.4.1 2022-04-13 [1] CRAN (R 4.1.3) vroom 1.5.7 2021-11-30 [1] CRAN (R 4.1.3) whisker 0.4 2019-08-28 [1] CRAN (R 4.1.3) withr 2.5.0 2022-03-03 [1] CRAN (R 4.1.3) xfun 0.32 2022-08-10 [1] CRAN (R 4.1.3) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.1.3) yaml 2.3.5 2022-02-21 [1] CRAN (R 4.1.2) zeallot 0.1.0 2018-01-28 [1] CRAN (R 4.1.3)
[1] D:/Dokumenty/R/win-library/4.1 [2] C:/Program Files/R/R-4.1.3/library