Closed laiajulianamu closed 4 months ago
Dear Laia, did you check the troubleshooting section ? I think this error is documented.
I'm unable to run the data you sent (even if the strata csv is converted to tsv)
Dear Thierry, Thank you so much for your response. Yes, I did check the the troubleshooting section and I added to my code the parallel.core =1. However, After that I am having another error
Step 1. Coverage visualization and helper table
Generating statistics Error in UseMethod("mutate") : no applicable method for 'mutate' applied to an object of class "list" In addition: There were 50 or more warnings (use warnings() to see the first 50) ✖ Writing files [61ms]
I already updated tidyverse following some suggestion from google butI am still traped in this point. Sadly I was not able to upload my data base because it has more than 178 MB. Thank you so muchchecked the troubleshooting section and added the parallel to my code, Laia
Send the real data by email I'll have a look
Don't share full dataset to the public
If not possible try making a smaller dataset that generate the same error so that I can reproduce
With my dataset I don't have any...
First test with your data works
test <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows
fstcore package v0.9.18
(OpenMP detected, using 56 threads)
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240514@1226.gds.rad
Number of chrom: 1
Number of locus: 45544
Number of SNPs: 76246
Number of strata: 6
Number of individuals: 479
Number of ind/strata:
PER = 47
MEX = 99
GAL = 216
COL = 32
GM = 11
ITW = 74
Number of duplicate id: 0
Computation time, overall: 102 sec
Using radiator::filter_rad
I do get an error, I think it's link to using DArT SNP format other than count (which I no longer use, favouring counts). I'll have a look for a fix.
This should not take long
Ok so I see the problem, it's during the coverage filter, when using DArT data without counts, the coverage information is based on average and the dataset provided by DArT.
I don't filter coverage when using 1 row and 2 rows provided by DArT, for several reasons that were mentioned during the workshop you attended. Simplify: By the time I am ready to look at the dataset coverage, 100% of the time I have already discarded samples. The statistics provided by DArT becomes obviously biased because it's using the full dataset...
I don't like that so I prefer not to filter it and advertise the idea to 1. get the FQ files for good bookkeeping 2. get the count files...
I will still fix the issue not to throw an error
Dear Thierry,
Thank you so much. Yes, I understand that I should avoid the coverage filter, however, when I tried to exclude (using filter_coverage = FALSE), the code did not run exclueded it.
Unknowned arguments identified inside "...": filter_coverage
Thank you again
In the earlier days I was trying to accommodate all format, now I'm more picky. I had plan to re-integrate the coverage statistics generated by DArT for 1row and 2rows format but I slept on it and I really thing it's almost 100% of the time biased with the dataset that I have seen, for the single reason that samples will eventually get blacklisted (removed) and the stats becomes biased. Request DArT count file the next time. Now the function will bypass the coverage for those files.
Coverage statistics information for 2rows and 1row DArT format is ignored
radiator cannot generate coverage statistics from source
potential blacklisted individuals bias the statistics generated by DArT
request DArT count format from DArT
Coverate information is not available for this dataset, returning GDS...
Latest radiator version should fix your problem
For your info... For your dataset exploration here is the filters thresholds I used:
filter_rad
without removing the samples with weird heterozygosity you end up creating false close-kin (it's the next filter that looks for duplicated samples, but provide the figure that shows distances). Removing those 57 samples really end up with a good dataset and highlight real relationships.Removing
Not removing
At the end of this first filtering exploration you have 18 427 SNPs and 387 samples
Hope this helps, re-open an issue if you are still having problems
In case you are wondering some of the differences between radiator and dartR...
Quickly if you want to test the mixed genomes (individual heterozygosity) or duplicated genomes (duplicate samples and close kin detection) without any filters:
data <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
#103 sec on my computer
mix <- radiator::detect_mixed_genomes(data = data)
#54 sec
dup <- radiator::detect_duplicate_genomes(data = data)
#47 sec
tictoc::tic()
data.gl <- dartR.base::gl.read.dart(filename = "SNPS1.csv", ind.metafile = "strata1_dartR.csv")
tictoc::toc()
# 167.62 sec elapsed
# longer to process and file is not saved...
tictoc::tic()
test.dartr.het <- dartR.base::gl.report.heterozygosity(x = data.gl, save2tmp = TRUE)
tictoc::toc()
# 104.738 sec elapsed
With radiator in less than 5 minutes you have the dataset imported in R and saved in the directory + 2 analysis that gives a pretty good idea of what filters will be required for your dataset and the problem encountered in the field or wet lab
dartR
gl.report.parent.offspring
function covers really what I want to see in this type of analysis. For parent/offspring analysis they are far better approaches. Here, it's all about duplicates and artifact samples.radiator:: detect_duplicate_genomes
you really see the impact of letting individual heterozygosity unchecked. The impact of biased close kin or artefactual close kin is quite impressive (the pair of samples around the 0.5 threshold on the figure...)Hope this helps highlight the differences in the 2 packages
Dear Thierry,
It is awesome, thank you so much; I also ran dartR and am seeing the differences. Sadly, I am struggling with a new error with the radiator righ now; it sounds like it is related with SNPrate.
Computation time, overall: 241 sec
######################## completed detect_mixed_genomes ########################
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240515@1021
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240515@1021.tsv
File written: random.seed (306779)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)
Could you post the exact command that you used, I've ran the analysis 4-5 times on your data without problems first try this:
data <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals
dup <- radiator::detect_duplicate_genomes(data = data)
does it work?
I am running the command below following all your instructions. However, I was not able to finish with the two last filters.
data <- radiator::filter_rad( data = "SNPS1.csv", strata = "stratayft.tsv", output = c("genind", "stockr", "genpop", "genlight"), parallel.core = 1 )
Below are the lines that you asked me to run: ###################### completed detect_duplicate_genomes ######################
data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv") Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows Generating genotypes and calibrating REF/ALT alleles... Number of markers recalibrated based on counts of allele: 11707 Generating GDS... File written: radiator_20240515@1113.gds.rad
Number of chrom: 1 Number of locus: 45544 Number of SNPs: 76246 Number of strata: 6 Number of individuals: 479
Number of ind/strata: PER = 47 MEX = 99 GAL = 216 COL = 32 GM = 11 ITW = 74
Number of duplicate id: 0
Computation time, overall: 90 sec
mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals ################################################################################ ######################## radiator::detect_mixed_genomes ######################## ################################################################################ Execution date@time: 20240515@1114 Folder created: 09_detect_mixed_genomes_20240515@1114 File written: radiator_detect_mixed_genomes_args_20240515@1114.tsv
Filters parameters file generated: filters_parameters_20240515@1114.tsv Error indplyr::mutate()
: ℹ In argument:MISSING_PROP = round(...)
. Caused by error in.DynamicClusterCall()
: ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1113\radiator_20240515@1113.gds.rad'. The process cannot access the file because it is being used by another process. Runrlang::last_trace()
to see where the error occurred.
Computation time, overall: 10 sec ######################## completed detect_mixed_genomes ########################
dup <- radiator::detect_duplicate_genomes(data = data) ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240515@1115 Folder created: 10_detect_duplicate_genomes_20240515@1115 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240515@1115.tsv File written: random.seed (747612)
Filters parameters file generated: filters_parameters_20240515@1115.tsv Error indplyr::mutate()
: ℹ In argument:MISSING_PROP = round(...)
. Caused by error in.DynamicClusterCall()
: ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1113\radiator_20240515@1113.gds.rad'. The process cannot access the file because it is being used by another process. Runrlang::last_trace()
to see where the error occurred.
Computation time, overall: 10 sec ###################### completed detect_duplicate_genomes ######################
I restarted everything, but I am still having the same problem
data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv")
Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows
fstcore package v0.9.18
(OpenMP detected, using 16 threads)
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240515@1528.gds.rad
Number of chrom: 1 Number of locus: 45544 Number of SNPs: 76246 Number of strata: 6 Number of individuals: 479
Number of ind/strata: PER = 47 MEX = 99 GAL = 216 COL = 32 GM = 11 ITW = 74
Number of duplicate id: 0
Computation time, overall: 118 sec Warning message: package ‘fstcore’ was built under R version 4.3.3
mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals ################################################################################ ######################## radiator::detect_mixed_genomes ######################## ################################################################################ Execution date@time: 20240515@1530 Folder created: 12_detect_mixed_genomes_20240515@1530 File written: radiator_detect_mixed_genomes_args_20240515@1530.tsv
Filters parameters file generated: filters_parameters_20240515@1530.tsv Error indplyr::mutate()
: ℹ In argument:MISSING_PROP = round(...)
. Caused by error in.DynamicClusterCall()
: ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1528\radiator_20240515@1528.gds.rad'. The process cannot access the file because it is being used by another process. Runrlang::last_trace()
to see where the error occurred.
Computation time, overall: 11 sec ######################## completed detect_mixed_genomes ########################
dup <- radiator::detect_duplicate_genomes(data = data) ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240515@1530 Folder created: 13_detect_duplicate_genomes_20240515@1530 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240515@1530.tsv File written: random.seed (622500)
Filters parameters file generated: filters_parameters_20240515@1530.tsv Error indplyr::mutate()
: ℹ In argument:MISSING_PROP = round(...)
. Caused by error in.DynamicClusterCall()
: ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1528\radiator_20240515@1528.gds.rad'. The process cannot access the file because it is being used by another process. Runrlang::last_trace()
to see where the error occurred.
And I am still with the same error :(
Computation time, overall: 200 sec
######################## completed detect_mixed_genomes ########################
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240515@1552
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240515@1552.tsv
File written: random.seed (891654)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)
Computation time, overall: 1 sec ###################### completed detect_duplicate_genomes ######################
Write down below the output of this command: devtools::session_info()
devtools::session_info() ─ Session info ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── setting value version R version 4.3.2 (2023-10-31 ucrt) os Windows 11 x64 (build 22631) system x86_64, mingw32 ui RStudio language (EN) collate English_United States.utf8 ctype English_United States.utf8 tz America/Chicago date 2024-05-16 rstudio 2023.12.1+402 Ocean Storm (desktop) pandoc NA
─ Packages ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── package version date (UTC) lib source ade4 1.7-22 2023-02-06 [1] CRAN (R 4.3.3) adegenet 2.1.10 2023-01-26 [1] CRAN (R 4.3.3) ape 5.8 2024-04-11 [1] CRAN (R 4.3.3) BiocManager 1.30.23 2024-05-04 [1] CRAN (R 4.3.3) boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3) cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2) class 7.3-22 2023-05-03 [1] CRAN (R 4.3.2) classInt 0.4-10 2023-09-05 [1] CRAN (R 4.3.3) cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2) cluster 2.1.6 2023-12-01 [1] CRAN (R 4.3.3) codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3) colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2) crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2) dartR.base 0.88 2024-04-10 [1] Github (green-striped-gecko/dartR.base@c5ebf2a) dartR.data 1.0.4 2024-04-10 [1] Github (green-striped-gecko/dartR.data@4f93e38) dartRverse 0.51 2023-12-01 [1] CRAN (R 4.3.3) data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2) DBI 1.2.2 2024-02-16 [1] CRAN (R 4.3.2) devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3) digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2) doParallel 1.0.17 2022-02-07 [1] CRAN (R 4.3.2) dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2) e1071 1.7-14 2023-12-06 [1] CRAN (R 4.3.3) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2) fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2) fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2) fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2) gdistance 1.6.4 2023-06-19 [1] CRAN (R 4.3.3) gdsfmt 1.39.3 2024-05-15 [1] Github (zhengxwen/gdsfmt@c15b23e) generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2) ggplot2 3.5.1.9000 2024-05-14 [1] Github (tidyverse/ggplot2@54cbb98) glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2) gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3) gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3) hierfstat 0.5-11 2022-05-05 [1] CRAN (R 4.3.3) htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2) htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2) httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2) igraph 2.0.3 2024-03-13 [1] CRAN (R 4.3.3) iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2) KernSmooth 2.23-22 2023-07-10 [1] CRAN (R 4.3.2) later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2) lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2) LEA 3.14.0 2023-10-24 [1] Bioconductor lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2) MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2) Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2) melfuR 1.1 2024-04-06 [1] Github (pygmyperch/melfuR@5d6bafa) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2) mgcv 1.9-1 2023-12-21 [1] CRAN (R 4.3.3) mime 0.12 2021-09-28 [1] CRAN (R 4.3.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2) munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3) nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2) pak 0.7.2 2024-03-17 [1] CRAN (R 4.3.3) patchwork 1.2.0 2024-01-08 [1] CRAN (R 4.3.3) pegas 1.3 2023-12-13 [1] CRAN (R 4.3.3) permute 0.9-7 2022-01-27 [1] CRAN (R 4.3.3) pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2) pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2) pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2) plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2) polysat 1.7-7 2022-08-23 [1] CRAN (R 4.3.3) poppr 2.9.6 2024-03-15 [1] CRAN (R 4.3.3) profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2) promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2) proxy 0.4-27 2022-06-09 [1] CRAN (R 4.3.3) purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2) R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2) radiator 1.3.3 2024-05-15 [1] Github (thierrygosselin/radiator@a2d63d5) raster 3.6-26 2023-10-14 [1] CRAN (R 4.3.3) Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2) remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3) reshape2 1.4.4 2020-04-09 [1] CRAN (R 4.3.2) rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2) rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3) scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2) seqinr 4.2-36 2023-12-08 [1] CRAN (R 4.3.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2) sf 1.0-16 2024-03-24 [1] CRAN (R 4.3.3) shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3) SNPRelate 1.37.5 2024-05-15 [1] Github (zhengxwen/SNPRelate@0996a58) sp 2.1-4 2024-04-30 [1] CRAN (R 4.3.3) StAMPP 1.6.3 2021-08-08 [1] CRAN (R 4.3.3) stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2) stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2) terra 1.7-71 2024-01-31 [1] CRAN (R 4.3.3) tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2) tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2) tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2) units 0.8-5 2023-11-28 [1] CRAN (R 4.3.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2) usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3) utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2) vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2) vegan 2.6-6 2024-05-14 [1] CRAN (R 4.3.2) xml2 1.3.6 2023-12-04 [1] CRAN (R 4.3.2) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2)
[1] C:/Program Files/R/R-4.3.2/library
Dear Thierry,
I hope you are doing great. I wonder if you have updates about my last bug. I am still having the same problem, and I definitely want to use Radiator for my data analyses. Thank you so much, Laia
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240531@1825
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240531@1825.tsv
File written: random.seed (118851)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)
Computation time, overall: 2 sec ###################### completed detect_duplicate_genomes ######################
Computation time, overall: 412 sec ############################# completed filter_rad #############################
Try the last version, re-open if problem still persist
Dear Thierry,
I am so sorry, but the problem persists. I tried with two different computers.
Computation time, overall: 21 sec
######################## completed detect_mixed_genomes ########################
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240605@1634
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240605@1634.tsv
File written: random.seed (16040)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)
Computation time, overall: 1 sec ###################### completed detect_duplicate_genomes ######################
Computation time, overall: 307 sec ############################# completed filter_rad #############################
devtools::session_info() ─ Session info ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── setting value version R version 4.3.2 (2023-10-31 ucrt) os Windows 11 x64 (build 22631) system x86_64, mingw32 ui RStudio language (EN) collate English_United States.utf8 ctype English_United States.utf8 tz America/Chicago date 2024-06-05 rstudio 2024.04.1+748 Chocolate Cosmos (desktop) pandoc NA
─ Packages ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── package version date (UTC) lib source ade4 1.7-22 2023-02-06 [1] CRAN (R 4.3.3) adegenet 2.1.10 2023-01-26 [1] CRAN (R 4.3.3) ape 5.8 2024-04-11 [1] CRAN (R 4.3.3) backports 1.5.0 2024-05-23 [1] CRAN (R 4.3.3) BiocGenerics 0.48.1 2023-11-01 [1] Bioconductor BiocManager 1.30.23 2024-05-04 [1] CRAN (R 4.3.3) Biostrings 2.70.3 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3) bit 4.0.5 2022-11-15 [1] CRAN (R 4.3.2) bit64 4.0.5 2020-08-30 [1] CRAN (R 4.3.2) bitops 1.0-7 2021-04-24 [1] CRAN (R 4.3.1) boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3) broom 1.0.6 2024-05-17 [1] CRAN (R 4.3.3) cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2) class 7.3-22 2023-05-03 [1] CRAN (R 4.3.2) classInt 0.4-10 2023-09-05 [1] CRAN (R 4.3.3) cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2) cluster 2.1.6 2023-12-01 [1] CRAN (R 4.3.3) codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3) colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2) crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2) curl 5.2.1 2024-03-01 [1] CRAN (R 4.3.3) dartR.base 0.88 2024-04-10 [1] Github (green-striped-gecko/dartR.base@c5ebf2a) dartR.data 1.0.4 2024-04-10 [1] Github (green-striped-gecko/dartR.data@4f93e38) dartRverse 0.51 2023-12-01 [1] CRAN (R 4.3.3) data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2) DBI 1.2.3 2024-06-02 [1] CRAN (R 4.3.3) devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3) digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2) doParallel 1.0.17 2022-02-07 [1] CRAN (R 4.3.2) dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2) e1071 1.7-14 2023-12-06 [1] CRAN (R 4.3.3) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2) fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2) farver 2.1.2 2024-05-13 [1] CRAN (R 4.3.3) fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2) fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2) fst 0.9.8 2022-02-08 [1] CRAN (R 4.3.3) fstcore 0.9.18 2023-12-02 [1] CRAN (R 4.3.3) gdistance 1.6.4 2023-06-19 [1] CRAN (R 4.3.3) gdsfmt 1.39.3 2024-05-15 [1] Github (zhengxwen/gdsfmt@c15b23e) generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2) GenomeInfoDb 1.38.8 2024-03-15 [1] Bioconductor 3.18 (R 4.3.3) GenomeInfoDbData 1.2.11 2024-04-09 [1] Bioconductor GenomicRanges 1.54.1 2023-10-29 [1] Bioconductor ggplot2 3.5.1 2024-04-23 [1] CRAN (R 4.3.3) glmnet 4.1-8 2023-08-22 [1] CRAN (R 4.3.3) glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2) gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3) gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3) HardyWeinberg 1.7.8 2024-04-06 [1] CRAN (R 4.3.3) hierfstat 0.5-11 2022-05-05 [1] CRAN (R 4.3.3) hms 1.1.3 2023-03-21 [1] CRAN (R 4.3.2) htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2) htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2) httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2) igraph 2.0.3 2024-03-13 [1] CRAN (R 4.3.3) IRanges 2.36.0 2023-10-24 [1] Bioconductor iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2) jomo 2.7-6 2023-04-15 [1] CRAN (R 4.3.3) KernSmooth 2.23-22 2023-07-10 [1] CRAN (R 4.3.2) labeling 0.4.3 2023-08-29 [1] CRAN (R 4.3.1) later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2) lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2) LEA 3.14.0 2023-10-24 [1] Bioconductor lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2) lme4 1.1-35.3 2024-04-16 [1] CRAN (R 4.3.3) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2) MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2) Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2) melfuR 1.1 2024-04-06 [1] Github (pygmyperch/melfuR@5d6bafa) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2) mgcv 1.9-1 2023-12-21 [1] CRAN (R 4.3.3) mice 3.16.0 2023-06-05 [1] CRAN (R 4.3.3) mime 0.12 2021-09-28 [1] CRAN (R 4.3.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2) minqa 1.2.7 2024-05-20 [1] CRAN (R 4.3.3) mitml 0.4-5 2023-03-08 [1] CRAN (R 4.3.3) munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3) nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2) nloptr 2.0.3 2022-05-26 [1] CRAN (R 4.3.2) nnet 7.3-19 2023-05-03 [1] CRAN (R 4.3.2) pak 0.7.2 2024-03-17 [1] CRAN (R 4.3.3) pan 1.9 2023-08-21 [1] CRAN (R 4.3.3) patchwork 1.2.0 2024-01-08 [1] CRAN (R 4.3.3) pegas 1.3 2023-12-13 [1] CRAN (R 4.3.3) permute 0.9-7 2022-01-27 [1] CRAN (R 4.3.3) pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2) pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2) pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2) plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2) polysat 1.7-7 2022-08-23 [1] CRAN (R 4.3.3) poppr 2.9.6 2024-03-15 [1] CRAN (R 4.3.3) profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2) promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2) proxy 0.4-27 2022-06-09 [1] CRAN (R 4.3.3) purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2) R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2) radiator * 1.3.4 2024-06-05 [1] Github (thierrygosselin/radiator@3a8cb4f) ragg 1.3.2 2024-05-15 [1] CRAN (R 4.3.3) raster 3.6-26 2023-10-14 [1] CRAN (R 4.3.3) Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2) RCurl 1.98-1.14 2024-01-09 [1] CRAN (R 4.3.2) readr 2.1.5 2024-01-10 [1] CRAN (R 4.3.2) remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3) reshape2 1.4.4 2020-04-09 [1] CRAN (R 4.3.2) rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2) rpart 4.1.23 2023-12-05 [1] CRAN (R 4.3.3) Rsolnp 1.16 2015-12-28 [1] CRAN (R 4.3.3) rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3) S4Vectors 0.40.2 2023-11-23 [1] Bioconductor scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2) SeqArray 1.42.4 2024-04-03 [1] Bioconductor 3.18 (R 4.3.3) seqinr 4.2-36 2023-12-08 [1] CRAN (R 4.3.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2) sf 1.0-16 2024-03-24 [1] CRAN (R 4.3.3) shape 1.4.6.1 2024-02-23 [1] CRAN (R 4.3.2) shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3) SNPRelate 1.37.5 2024-05-15 [1] Github (zhengxwen/SNPRelate@0996a58) sp 2.1-4 2024-04-30 [1] CRAN (R 4.3.3) StAMPP 1.6.3 2021-08-08 [1] CRAN (R 4.3.3) stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2) stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2) survival 3.6-4 2024-04-24 [1] CRAN (R 4.3.3) systemfonts 1.1.0 2024-05-15 [1] CRAN (R 4.3.3) terra 1.7-71 2024-01-31 [1] CRAN (R 4.3.3) textshaping 0.4.0 2024-05-24 [1] CRAN (R 4.3.3) tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2) tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2) tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2) truncnorm 1.0-9 2023-03-20 [1] CRAN (R 4.3.3) tzdb 0.4.0 2023-05-12 [1] CRAN (R 4.3.2) units 0.8-5 2023-11-28 [1] CRAN (R 4.3.3) UpSetR 1.4.0 2019-05-22 [1] CRAN (R 4.3.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2) usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3) utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2) vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2) vegan 2.6-6 2024-05-14 [1] CRAN (R 4.3.2) viridisLite 0.4.2 2023-05-02 [1] CRAN (R 4.3.2) vroom 1.6.5 2023-12-05 [1] CRAN (R 4.3.2) withr 3.0.0 2024-01-16 [1] CRAN (R 4.3.2) xml2 1.3.6 2023-12-04 [1] CRAN (R 4.3.2) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2) XVector 0.42.0 2023-10-24 [1] Bioconductor zlibbioc 1.48.2 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)
The code that you sent to me:
data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv")
Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240605@1636.gds.rad
Number of chrom: 1 Number of locus: 45544 Number of SNPs: 76246 Number of strata: 6 Number of individuals: 479
Number of ind/strata: PER = 47 MEX = 99 GAL = 216 COL = 32 GM = 11 ITW = 74
Number of duplicate id: 0
Computation time, overall: 112 sec
mix <- radiator::detect_mixed_genomes(data = data) (parallel.core = 1) # don't remove individuals ################################################################################ ######################## radiator::detect_mixed_genomes ######################## ################################################################################ Execution date@time: 20240605@1639 Folder created: 01_detect_mixed_genomes_20240605@1639 File written: radiator_detect_mixed_genomes_args_20240605@1639.tsv
Filters parameters file generated: filters_parameters_20240605@1639.tsv Error indplyr::mutate()
: ℹ In argument:MISSING_PROP = round(...)
. Caused by error in.DynamicClusterCall()
: ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240605@1636\radiator_20240605@1636.gds.rad'. The process cannot access the file because it is being used by another process. Runrlang::last_trace()
to see where the error occurred.
Computation time, overall: 10 sec ######################## completed detect_mixed_genomes ########################
dup <- radiator::detect_duplicate_genomes(data = data) ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240605@1639 Folder created: 02_detect_duplicate_genomes_20240605@1639 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240605@1639.tsv File written: random.seed (135945)
Filters parameters file generated: filters_parameters_20240605@1639.tsv Error indplyr::mutate()
: ℹ In argument:MISSING_PROP = round(...)
. Caused by error in.DynamicClusterCall()
: ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240605@1636\radiator_20240605@1636.gds.rad'. The process cannot access the file because it is being used by another process. Runrlang::last_trace()
to see where the error occurred.
Computation time, overall: 8 sec
data <- radiator::filter_rad( data = "SNPS1.csv", strata = "stratayft.tsv" )
Step 1. Minor Allele Statistics visualization and helper table
depth info not available, switching to count Error in .DynamicClusterCall(cl, length(cl), .fun = function(.proc_idx, : One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\filter_rad_20240513@2143\01_radiator\radiator_20240513@2143.gds.rad'. The process cannot access the file because it is being used by another process. In addition: There were 50 or more warnings (use warnings() to see the first 50) ✖ Calculating Minor Allele statistics [15.2s]
Computation time, overall: 16 sec ############################# completed filter_ma ##############################
Computation time, overall: 172 sec ############################# completed filter_rad ############################# Session info ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── setting value version R version 4.3.2 (2023-10-31 ucrt) os Windows 11 x64 (build 22631) system x86_64, mingw32 ui RStudio language (EN) collate English_United States.utf8 ctype English_United States.utf8 tz America/Chicago date 2024-05-13 rstudio 2023.12.1+402 Ocean Storm (desktop) pandoc NA
─ Packages ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── package version date (UTC) lib source backports 1.4.1 2021-12-13 [1] CRAN (R 4.3.1) BiocGenerics 0.48.1 2023-11-01 [1] Bioconductor BiocManager 1.30.22 2023-08-08 [1] CRAN (R 4.3.3) Biostrings 2.70.3 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3) bit 4.0.5 2022-11-15 [1] CRAN (R 4.3.2) bit64 4.0.5 2020-08-30 [1] CRAN (R 4.3.2) bitops 1.0-7 2021-04-24 [1] CRAN (R 4.3.1) boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3) broom 1.0.5 2023-06-09 [1] CRAN (R 4.3.2) cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2) cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2) codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3) colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2) crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2) data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2) devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3) digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2) dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2) fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2) farver 2.1.1 2022-07-06 [1] CRAN (R 4.3.2) fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2) fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2) fst 0.9.8 2022-02-08 [1] CRAN (R 4.3.3) fstcore 0.9.18 2023-12-02 [1] CRAN (R 4.3.3) gdsfmt 1.38.0 2023-10-24 [1] Bioconductor generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2) GenomeInfoDb 1.38.8 2024-03-15 [1] Bioconductor 3.18 (R 4.3.3) GenomeInfoDbData 1.2.11 2024-04-09 [1] Bioconductor GenomicRanges 1.54.1 2023-10-29 [1] Bioconductor ggplot2 3.5.0.9000 2024-04-11 [1] Github (tidyverse/ggplot2@1050f09) glmnet 4.1-8 2023-08-22 [1] CRAN (R 4.3.3) glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2) gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3) gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3) HardyWeinberg 1.7.8 2024-04-06 [1] CRAN (R 4.3.3) hms 1.1.3 2023-03-21 [1] CRAN (R 4.3.2) htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2) htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2) httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2) IRanges 2.36.0 2023-10-24 [1] Bioconductor iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2) jomo 2.7-6 2023-04-15 [1] CRAN (R 4.3.3) labeling 0.4.3 2023-08-29 [1] CRAN (R 4.3.1) later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2) lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2) lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2) lme4 1.1-35.3 2024-04-16 [1] CRAN (R 4.3.3) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2) MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2) Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2) mice 3.16.0 2023-06-05 [1] CRAN (R 4.3.3) mime 0.12 2021-09-28 [1] CRAN (R 4.3.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2) minqa 1.2.6 2023-09-11 [1] CRAN (R 4.3.2) mitml 0.4-5 2023-03-08 [1] CRAN (R 4.3.3) munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3) nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2) nloptr 2.0.3 2022-05-26 [1] CRAN (R 4.3.2) nnet 7.3-19 2023-05-03 [1] CRAN (R 4.3.2) pan 1.9 2023-08-21 [1] CRAN (R 4.3.3) pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2) pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2) pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2) plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2) profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2) promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2) purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2) R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2) radiator 1.3.0 2024-04-09 [1] Github (thierrygosselin/radiator@2eb1dfa) ragg 1.3.0 2024-03-13 [1] CRAN (R 4.3.3) Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2) RCurl 1.98-1.14 2024-01-09 [1] CRAN (R 4.3.2) readr 2.1.5 2024-01-10 [1] CRAN (R 4.3.2) remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3) rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2) rpart 4.1.23 2023-12-05 [1] CRAN (R 4.3.3) Rsolnp 1.16 2015-12-28 [1] CRAN (R 4.3.3) rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3) S4Vectors 0.40.2 2023-11-23 [1] Bioconductor scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2) SeqArray 1.42.4 2024-04-03 [1] Bioconductor 3.18 (R 4.3.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2) shape 1.4.6.1 2024-02-23 [1] CRAN (R 4.3.2) shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3) SNPRelate 1.36.1 2024-02-27 [1] Bioconductor 3.18 (R 4.3.2) stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2) stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2) survival 3.5-8 2024-02-14 [1] CRAN (R 4.3.3) systemfonts 1.0.6 2024-03-07 [1] CRAN (R 4.3.3) textshaping 0.3.7 2023-10-09 [1] CRAN (R 4.3.2) tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2) tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2) tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2) truncnorm 1.0-9 2023-03-20 [1] CRAN (R 4.3.3) tzdb 0.4.0 2023-05-12 [1] CRAN (R 4.3.2) UpSetR 1.4.0 2019-05-22 [1] CRAN (R 4.3.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2) usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3) utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2) vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2) vroom 1.6.5 2023-12-05 [1] CRAN (R 4.3.2) withr 3.0.0 2024-01-16 [1] CRAN (R 4.3.2) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2) XVector 0.42.0 2023-10-24 [1] Bioconductor zlibbioc 1.48.2 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)
[1] C:/Program Files/R/R-4.3.2/library
result.csv rstrataesult.csv