thierrygosselin / radiator

RADseq Data Exploration, Manipulation and Visualization using R
https://thierrygosselin.github.io/radiator/
GNU General Public License v3.0
59 stars 23 forks source link

Error in .DynamicClusterCall(cl, length(cl), .fun = function(.proc_idx #188

Closed laiajulianamu closed 4 months ago

laiajulianamu commented 5 months ago

data <- radiator::filter_rad( data = "SNPS1.csv", strata = "stratayft.tsv" )

Step 1. Minor Allele Statistics visualization and helper table

depth info not available, switching to count Error in .DynamicClusterCall(cl, length(cl), .fun = function(.proc_idx, : One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\filter_rad_20240513@2143\01_radiator\radiator_20240513@2143.gds.rad'. The process cannot access the file because it is being used by another process. In addition: There were 50 or more warnings (use warnings() to see the first 50) ✖ Calculating Minor Allele statistics [15.2s]

Computation time, overall: 16 sec ############################# completed filter_ma ##############################

Computation time, overall: 172 sec ############################# completed filter_rad ############################# Session info ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── setting value version R version 4.3.2 (2023-10-31 ucrt) os Windows 11 x64 (build 22631) system x86_64, mingw32 ui RStudio language (EN) collate English_United States.utf8 ctype English_United States.utf8 tz America/Chicago date 2024-05-13 rstudio 2023.12.1+402 Ocean Storm (desktop) pandoc NA

─ Packages ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── package version date (UTC) lib source backports 1.4.1 2021-12-13 [1] CRAN (R 4.3.1) BiocGenerics 0.48.1 2023-11-01 [1] Bioconductor BiocManager 1.30.22 2023-08-08 [1] CRAN (R 4.3.3) Biostrings 2.70.3 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3) bit 4.0.5 2022-11-15 [1] CRAN (R 4.3.2) bit64 4.0.5 2020-08-30 [1] CRAN (R 4.3.2) bitops 1.0-7 2021-04-24 [1] CRAN (R 4.3.1) boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3) broom 1.0.5 2023-06-09 [1] CRAN (R 4.3.2) cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2) cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2) codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3) colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2) crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2) data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2) devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3) digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2) dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2) fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2) farver 2.1.1 2022-07-06 [1] CRAN (R 4.3.2) fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2) fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2) fst 0.9.8 2022-02-08 [1] CRAN (R 4.3.3) fstcore 0.9.18 2023-12-02 [1] CRAN (R 4.3.3) gdsfmt 1.38.0 2023-10-24 [1] Bioconductor generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2) GenomeInfoDb 1.38.8 2024-03-15 [1] Bioconductor 3.18 (R 4.3.3) GenomeInfoDbData 1.2.11 2024-04-09 [1] Bioconductor GenomicRanges 1.54.1 2023-10-29 [1] Bioconductor ggplot2 3.5.0.9000 2024-04-11 [1] Github (tidyverse/ggplot2@1050f09) glmnet 4.1-8 2023-08-22 [1] CRAN (R 4.3.3) glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2) gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3) gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3) HardyWeinberg 1.7.8 2024-04-06 [1] CRAN (R 4.3.3) hms 1.1.3 2023-03-21 [1] CRAN (R 4.3.2) htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2) htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2) httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2) IRanges 2.36.0 2023-10-24 [1] Bioconductor iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2) jomo 2.7-6 2023-04-15 [1] CRAN (R 4.3.3) labeling 0.4.3 2023-08-29 [1] CRAN (R 4.3.1) later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2) lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2) lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2) lme4 1.1-35.3 2024-04-16 [1] CRAN (R 4.3.3) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2) MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2) Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2) mice 3.16.0 2023-06-05 [1] CRAN (R 4.3.3) mime 0.12 2021-09-28 [1] CRAN (R 4.3.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2) minqa 1.2.6 2023-09-11 [1] CRAN (R 4.3.2) mitml 0.4-5 2023-03-08 [1] CRAN (R 4.3.3) munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3) nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2) nloptr 2.0.3 2022-05-26 [1] CRAN (R 4.3.2) nnet 7.3-19 2023-05-03 [1] CRAN (R 4.3.2) pan 1.9 2023-08-21 [1] CRAN (R 4.3.3) pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2) pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2) pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2) plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2) profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2) promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2) purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2) R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2) radiator 1.3.0 2024-04-09 [1] Github (thierrygosselin/radiator@2eb1dfa) ragg 1.3.0 2024-03-13 [1] CRAN (R 4.3.3) Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2) RCurl 1.98-1.14 2024-01-09 [1] CRAN (R 4.3.2) readr 2.1.5 2024-01-10 [1] CRAN (R 4.3.2) remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3) rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2) rpart 4.1.23 2023-12-05 [1] CRAN (R 4.3.3) Rsolnp 1.16 2015-12-28 [1] CRAN (R 4.3.3) rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3) S4Vectors 0.40.2 2023-11-23 [1] Bioconductor scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2) SeqArray 1.42.4 2024-04-03 [1] Bioconductor 3.18 (R 4.3.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2) shape 1.4.6.1 2024-02-23 [1] CRAN (R 4.3.2) shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3) SNPRelate 1.36.1 2024-02-27 [1] Bioconductor 3.18 (R 4.3.2) stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2) stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2) survival 3.5-8 2024-02-14 [1] CRAN (R 4.3.3) systemfonts 1.0.6 2024-03-07 [1] CRAN (R 4.3.3) textshaping 0.3.7 2023-10-09 [1] CRAN (R 4.3.2) tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2) tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2) tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2) truncnorm 1.0-9 2023-03-20 [1] CRAN (R 4.3.3) tzdb 0.4.0 2023-05-12 [1] CRAN (R 4.3.2) UpSetR 1.4.0 2019-05-22 [1] CRAN (R 4.3.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2) usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3) utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2) vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2) vroom 1.6.5 2023-12-05 [1] CRAN (R 4.3.2) withr 3.0.0 2024-01-16 [1] CRAN (R 4.3.2) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2) XVector 0.42.0 2023-10-24 [1] Bioconductor zlibbioc 1.48.2 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)

[1] C:/Program Files/R/R-4.3.2/library

result.csv rstrataesult.csv

thierrygosselin commented 5 months ago

Dear Laia, did you check the troubleshooting section ? I think this error is documented.

I'm unable to run the data you sent (even if the strata csv is converted to tsv)

laiajulianamu commented 5 months ago

Dear Thierry, Thank you so much for your response. Yes, I did check the the troubleshooting section and I added to my code the parallel.core =1. However, After that I am having another error

Step 1. Coverage visualization and helper table

Generating statistics Error in UseMethod("mutate") : no applicable method for 'mutate' applied to an object of class "list" In addition: There were 50 or more warnings (use warnings() to see the first 50) ✖ Writing files [61ms]

I already updated tidyverse following some suggestion from google butI am still traped in this point. Sadly I was not able to upload my data base because it has more than 178 MB. Thank you so muchchecked the troubleshooting section and added the parallel to my code, Laia

thierrygosselin commented 5 months ago

Send the real data by email I'll have a look

thierrygosselin commented 5 months ago

Don't share full dataset to the public

If not possible try making a smaller dataset that generate the same error so that I can reproduce

With my dataset I don't have any...

thierrygosselin commented 5 months ago

First test with your data works

test <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
Reading DArT file...                                                
Number of blacklisted samples: 0                                      
DArT SNP format: alleles absence/presence in 2 Rows
fstcore package v0.9.18
(OpenMP detected, using 56 threads)
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240514@1226.gds.rad                        

Number of chrom: 1
Number of locus: 45544
Number of SNPs: 76246
Number of strata: 6
Number of individuals: 479

Number of ind/strata:
PER = 47
MEX = 99
GAL = 216
COL = 32
GM = 11
ITW = 74

Number of duplicate id: 0

Computation time, overall: 102 sec
thierrygosselin commented 5 months ago

Using radiator::filter_rad I do get an error, I think it's link to using DArT SNP format other than count (which I no longer use, favouring counts). I'll have a look for a fix.

This should not take long

thierrygosselin commented 5 months ago

Ok so I see the problem, it's during the coverage filter, when using DArT data without counts, the coverage information is based on average and the dataset provided by DArT.

I don't filter coverage when using 1 row and 2 rows provided by DArT, for several reasons that were mentioned during the workshop you attended. Simplify: By the time I am ready to look at the dataset coverage, 100% of the time I have already discarded samples. The statistics provided by DArT becomes obviously biased because it's using the full dataset...

I don't like that so I prefer not to filter it and advertise the idea to 1. get the FQ files for good bookkeeping 2. get the count files...

I will still fix the issue not to throw an error

laiajulianamu commented 5 months ago

Dear Thierry,

Thank you so much. Yes, I understand that I should avoid the coverage filter, however, when I tried to exclude (using filter_coverage = FALSE), the code did not run exclueded it.

Unknowned arguments identified inside "...": filter_coverage

Thank you again

thierrygosselin commented 5 months ago

In the earlier days I was trying to accommodate all format, now I'm more picky. I had plan to re-integrate the coverage statistics generated by DArT for 1row and 2rows format but I slept on it and I really thing it's almost 100% of the time biased with the dataset that I have seen, for the single reason that samples will eventually get blacklisted (removed) and the stats becomes biased. Request DArT count file the next time. Now the function will bypass the coverage for those files.

Coverage statistics information for 2rows and 1row DArT format is ignored

radiator cannot generate coverage statistics from source
potential blacklisted individuals bias the statistics generated by DArT

request DArT count format from DArT

Coverate information is not available for this dataset, returning GDS...
thierrygosselin commented 5 months ago

Latest radiator version should fix your problem

thierrygosselin commented 5 months ago

For your info... For your dataset exploration here is the filters thresholds I used:

  1. reproducibility: I'm filtering based on the outlier statistics, it blacklist 5424 SNPs
  2. filter for common markers removes 469 SNPs
  3. blacklist samples based on missingness and outlier statistics, 32 samples are removed. At this step I don't remove samples based on heterozygosity. At the end I re-run monomorphic markers removal, this discard 3428 markers.
  4. Minor allele: I use the MAC stats and use 4 alternate alleles, 17282 SNPs are removed (you could also use MAF, but I prefer the count, MAC).
  5. filter genotyping rate based on 0.2 Max missing removes 16779 SNPs
  6. the SNP position on the read, I keep all (option 1, filter is off).
  7. filter based on SNP number on the locus. Based on your data and figures generated I do want to blacklist some locus, and I use my own threshold here. We will keep only locus with a Max of 2 SNPs. This result in blacklisting 2515 locus and 8604 SNPs
  8. filter based on short distance LD, I say yes, because the locus with 2 SNPs (about half the dataset) carries the same information and it bias stats. To filter those I'm using option 1 (based on Mac). So it keeps the SNP with max Mac which makes sense because more samples carries the info). 5049 SNPs are blacklisted. I don't filter for long distance LD with this de novo dataset.
  9. filter individuals based on heterozygosity. You really want to check the samples here because I think you might have 2 species mixed together or there was some wet lab problem (uneven tissue, too much DNA, mixed DNA, etc)... and the one that have really low heterozygosity is probably another specie if it's not correlated with missing data (small bubbles) and if the missing data is important it's usually poor DNA quality... Based on the thresholds (lower and upper) on the figures, 57 samples are removed. This sounds like a lot but if you re-run filter_rad without removing the samples with weird heterozygosity you end up creating false close-kin (it's the next filter that looks for duplicated samples, but provide the figure that shows distances). Removing those 57 samples really end up with a good dataset and highlight real relationships.

Removing

manhattan plot distance

Not removing manhattan plot distance

  1. filtering individuals based on duplicated genomes: I'm using the 0.25 threshold here. If you have remove the problematic samples in the previous filtering steps (heterozygosity), the figure shows 3 pairs of samples from the same population that are potential duplicates, check your lab note or naming, etc. Because they are from the same pop, you can keep 3 based on missingness (keeping the samples with less missing data). So this step, will remove an additional 3 samples.
  2. Hardy Weinberg Disequilibrium filtering depends on the analysis you will do after, this is why in the folder it keeps all the different dataset so you can check its impact e.g. in population discovery analysis. I used 3 pops as threshold to discard the markers (3 sampling sites/pops need to be in disequilibrium to discard the SNPs) and the mid p value chosen here is 0.001. This is a light filtering threshold that blacklist 770 SNPS.

At the end of this first filtering exploration you have 18 427 SNPs and 387 samples

Hope this helps, re-open an issue if you are still having problems

thierrygosselin commented 5 months ago

radiator and dartR approaches

In case you are wondering some of the differences between radiator and dartR...

radiator

Quickly if you want to test the mixed genomes (individual heterozygosity) or duplicated genomes (duplicate samples and close kin detection) without any filters:

data <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
 #103 sec on my computer
mix <- radiator::detect_mixed_genomes(data = data) 
#54 sec

individual heterozygosity boxplot individual heterozygosity manhattan plot

dup <- radiator::detect_duplicate_genomes(data = data)
#47 sec

manhattan plot distance violin plot distance

Using dartR

tictoc::tic()
data.gl <- dartR.base::gl.read.dart(filename = "SNPS1.csv", ind.metafile = "strata1_dartR.csv")
tictoc::toc()
# 167.62 sec elapsed
# longer to process and file is not saved...
tictoc::tic()
test.dartr.het <- dartR.base::gl.report.heterozygosity(x = data.gl, save2tmp = TRUE)
tictoc::toc()
# 104.738 sec elapsed

dartR_hetRplot

Conclusion for:

Hope this helps highlight the differences in the 2 packages

laiajulianamu commented 5 months ago

Dear Thierry,

It is awesome, thank you so much; I also ran dartR and am seeing the differences. Sadly, I am struggling with a new error with the radiator righ now; it sounds like it is related with SNPrate.

Computation time, overall: 241 sec ######################## completed detect_mixed_genomes ######################## ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240515@1021 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240515@1021.tsv File written: random.seed (306779)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", : There is no SNP! In addition: There were 50 or more warnings (use warnings() to see the first 50)

thierrygosselin commented 5 months ago

Could you post the exact command that you used, I've ran the analysis 4-5 times on your data without problems first try this:

data <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals
dup <- radiator::detect_duplicate_genomes(data = data)

does it work?

laiajulianamu commented 5 months ago

I am running the command below following all your instructions. However, I was not able to finish with the two last filters.

data <- radiator::filter_rad( data = "SNPS1.csv", strata = "stratayft.tsv", output = c("genind", "stockr", "genpop", "genlight"), parallel.core = 1 )

Below are the lines that you asked me to run: ###################### completed detect_duplicate_genomes ######################

data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv") Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows Generating genotypes and calibrating REF/ALT alleles... Number of markers recalibrated based on counts of allele: 11707 Generating GDS... File written: radiator_20240515@1113.gds.rad

Number of chrom: 1 Number of locus: 45544 Number of SNPs: 76246 Number of strata: 6 Number of individuals: 479

Number of ind/strata: PER = 47 MEX = 99 GAL = 216 COL = 32 GM = 11 ITW = 74

Number of duplicate id: 0

Computation time, overall: 90 sec

mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals ################################################################################ ######################## radiator::detect_mixed_genomes ######################## ################################################################################ Execution date@time: 20240515@1114 Folder created: 09_detect_mixed_genomes_20240515@1114 File written: radiator_detect_mixed_genomes_args_20240515@1114.tsv
Filters parameters file generated: filters_parameters_20240515@1114.tsv Error in dplyr::mutate(): ℹ In argument: MISSING_PROP = round(...). Caused by error in .DynamicClusterCall(): ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1113\radiator_20240515@1113.gds.rad'. The process cannot access the file because it is being used by another process. Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 10 sec ######################## completed detect_mixed_genomes ########################

dup <- radiator::detect_duplicate_genomes(data = data) ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240515@1115 Folder created: 10_detect_duplicate_genomes_20240515@1115 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240515@1115.tsv File written: random.seed (747612)
Filters parameters file generated: filters_parameters_20240515@1115.tsv Error in dplyr::mutate(): ℹ In argument: MISSING_PROP = round(...). Caused by error in .DynamicClusterCall(): ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1113\radiator_20240515@1113.gds.rad'. The process cannot access the file because it is being used by another process. Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 10 sec ###################### completed detect_duplicate_genomes ######################

thierrygosselin commented 5 months ago
  1. Start a new session of R and R studio (if using). The problem stems from using too many times the object I think. Start fresh.
  2. install the latest radiator
  3. try re-running the functions above
laiajulianamu commented 5 months ago

I restarted everything, but I am still having the same problem

data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv") Reading DArT file... Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows fstcore package v0.9.18 (OpenMP detected, using 16 threads) Generating genotypes and calibrating REF/ALT alleles... Number of markers recalibrated based on counts of allele: 11707 Generating GDS... File written: radiator_20240515@1528.gds.rad

Number of chrom: 1 Number of locus: 45544 Number of SNPs: 76246 Number of strata: 6 Number of individuals: 479

Number of ind/strata: PER = 47 MEX = 99 GAL = 216 COL = 32 GM = 11 ITW = 74

Number of duplicate id: 0

Computation time, overall: 118 sec Warning message: package ‘fstcore’ was built under R version 4.3.3

mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals ################################################################################ ######################## radiator::detect_mixed_genomes ######################## ################################################################################ Execution date@time: 20240515@1530 Folder created: 12_detect_mixed_genomes_20240515@1530 File written: radiator_detect_mixed_genomes_args_20240515@1530.tsv
Filters parameters file generated: filters_parameters_20240515@1530.tsv Error in dplyr::mutate(): ℹ In argument: MISSING_PROP = round(...). Caused by error in .DynamicClusterCall(): ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1528\radiator_20240515@1528.gds.rad'. The process cannot access the file because it is being used by another process. Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 11 sec ######################## completed detect_mixed_genomes ########################

dup <- radiator::detect_duplicate_genomes(data = data) ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240515@1530 Folder created: 13_detect_duplicate_genomes_20240515@1530 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240515@1530.tsv File written: random.seed (622500)
Filters parameters file generated: filters_parameters_20240515@1530.tsv Error in dplyr::mutate(): ℹ In argument: MISSING_PROP = round(...). Caused by error in .DynamicClusterCall(): ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1528\radiator_20240515@1528.gds.rad'. The process cannot access the file because it is being used by another process. Run rlang::last_trace() to see where the error occurred.

laiajulianamu commented 5 months ago

And I am still with the same error :( Computation time, overall: 200 sec ######################## completed detect_mixed_genomes ######################## ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240515@1552 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240515@1552.tsv File written: random.seed (891654)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", : There is no SNP! In addition: There were 50 or more warnings (use warnings() to see the first 50)

Computation time, overall: 1 sec ###################### completed detect_duplicate_genomes ######################

thierrygosselin commented 5 months ago

Write down below the output of this command: devtools::session_info()

laiajulianamu commented 5 months ago

devtools::session_info() ─ Session info ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── setting value version R version 4.3.2 (2023-10-31 ucrt) os Windows 11 x64 (build 22631) system x86_64, mingw32 ui RStudio language (EN) collate English_United States.utf8 ctype English_United States.utf8 tz America/Chicago date 2024-05-16 rstudio 2023.12.1+402 Ocean Storm (desktop) pandoc NA

─ Packages ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── package version date (UTC) lib source ade4 1.7-22 2023-02-06 [1] CRAN (R 4.3.3) adegenet 2.1.10 2023-01-26 [1] CRAN (R 4.3.3) ape 5.8 2024-04-11 [1] CRAN (R 4.3.3) BiocManager 1.30.23 2024-05-04 [1] CRAN (R 4.3.3) boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3) cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2) class 7.3-22 2023-05-03 [1] CRAN (R 4.3.2) classInt 0.4-10 2023-09-05 [1] CRAN (R 4.3.3) cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2) cluster 2.1.6 2023-12-01 [1] CRAN (R 4.3.3) codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3) colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2) crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2) dartR.base 0.88 2024-04-10 [1] Github (green-striped-gecko/dartR.base@c5ebf2a) dartR.data 1.0.4 2024-04-10 [1] Github (green-striped-gecko/dartR.data@4f93e38) dartRverse 0.51 2023-12-01 [1] CRAN (R 4.3.3) data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2) DBI 1.2.2 2024-02-16 [1] CRAN (R 4.3.2) devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3) digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2) doParallel 1.0.17 2022-02-07 [1] CRAN (R 4.3.2) dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2) e1071 1.7-14 2023-12-06 [1] CRAN (R 4.3.3) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2) fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2) fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2) fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2) gdistance 1.6.4 2023-06-19 [1] CRAN (R 4.3.3) gdsfmt 1.39.3 2024-05-15 [1] Github (zhengxwen/gdsfmt@c15b23e) generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2) ggplot2 3.5.1.9000 2024-05-14 [1] Github (tidyverse/ggplot2@54cbb98) glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2) gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3) gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3) hierfstat 0.5-11 2022-05-05 [1] CRAN (R 4.3.3) htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2) htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2) httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2) igraph 2.0.3 2024-03-13 [1] CRAN (R 4.3.3) iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2) KernSmooth 2.23-22 2023-07-10 [1] CRAN (R 4.3.2) later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2) lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2) LEA 3.14.0 2023-10-24 [1] Bioconductor lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2) MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2) Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2) melfuR 1.1 2024-04-06 [1] Github (pygmyperch/melfuR@5d6bafa) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2) mgcv 1.9-1 2023-12-21 [1] CRAN (R 4.3.3) mime 0.12 2021-09-28 [1] CRAN (R 4.3.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2) munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3) nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2) pak 0.7.2 2024-03-17 [1] CRAN (R 4.3.3) patchwork 1.2.0 2024-01-08 [1] CRAN (R 4.3.3) pegas 1.3 2023-12-13 [1] CRAN (R 4.3.3) permute 0.9-7 2022-01-27 [1] CRAN (R 4.3.3) pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2) pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2) pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2) plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2) polysat 1.7-7 2022-08-23 [1] CRAN (R 4.3.3) poppr 2.9.6 2024-03-15 [1] CRAN (R 4.3.3) profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2) promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2) proxy 0.4-27 2022-06-09 [1] CRAN (R 4.3.3) purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2) R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2) radiator 1.3.3 2024-05-15 [1] Github (thierrygosselin/radiator@a2d63d5) raster 3.6-26 2023-10-14 [1] CRAN (R 4.3.3) Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2) remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3) reshape2 1.4.4 2020-04-09 [1] CRAN (R 4.3.2) rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2) rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3) scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2) seqinr 4.2-36 2023-12-08 [1] CRAN (R 4.3.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2) sf 1.0-16 2024-03-24 [1] CRAN (R 4.3.3) shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3) SNPRelate 1.37.5 2024-05-15 [1] Github (zhengxwen/SNPRelate@0996a58) sp 2.1-4 2024-04-30 [1] CRAN (R 4.3.3) StAMPP 1.6.3 2021-08-08 [1] CRAN (R 4.3.3) stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2) stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2) terra 1.7-71 2024-01-31 [1] CRAN (R 4.3.3) tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2) tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2) tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2) units 0.8-5 2023-11-28 [1] CRAN (R 4.3.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2) usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3) utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2) vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2) vegan 2.6-6 2024-05-14 [1] CRAN (R 4.3.2) xml2 1.3.6 2023-12-04 [1] CRAN (R 4.3.2) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2)

[1] C:/Program Files/R/R-4.3.2/library

laiajulianamu commented 5 months ago

Dear Thierry,

I hope you are doing great. I wonder if you have updates about my last bug. I am still having the same problem, and I definitely want to use Radiator for my data analyses. Thank you so much, Laia

###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240531@1825 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240531@1825.tsv File written: random.seed (118851)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", : There is no SNP! In addition: There were 50 or more warnings (use warnings() to see the first 50)

Computation time, overall: 2 sec ###################### completed detect_duplicate_genomes ######################

Computation time, overall: 412 sec ############################# completed filter_rad #############################

thierrygosselin commented 4 months ago

Try the last version, re-open if problem still persist

laiajulianamu commented 4 months ago

Dear Thierry,

I am so sorry, but the problem persists. I tried with two different computers.

Computation time, overall: 21 sec ######################## completed detect_mixed_genomes ######################## ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240605@1634 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240605@1634.tsv File written: random.seed (16040)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", : There is no SNP! In addition: There were 50 or more warnings (use warnings() to see the first 50)

Computation time, overall: 1 sec ###################### completed detect_duplicate_genomes ######################

Computation time, overall: 307 sec ############################# completed filter_rad #############################

devtools::session_info() ─ Session info ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── setting value version R version 4.3.2 (2023-10-31 ucrt) os Windows 11 x64 (build 22631) system x86_64, mingw32 ui RStudio language (EN) collate English_United States.utf8 ctype English_United States.utf8 tz America/Chicago date 2024-06-05 rstudio 2024.04.1+748 Chocolate Cosmos (desktop) pandoc NA

─ Packages ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── package version date (UTC) lib source ade4 1.7-22 2023-02-06 [1] CRAN (R 4.3.3) adegenet 2.1.10 2023-01-26 [1] CRAN (R 4.3.3) ape 5.8 2024-04-11 [1] CRAN (R 4.3.3) backports 1.5.0 2024-05-23 [1] CRAN (R 4.3.3) BiocGenerics 0.48.1 2023-11-01 [1] Bioconductor BiocManager 1.30.23 2024-05-04 [1] CRAN (R 4.3.3) Biostrings 2.70.3 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3) bit 4.0.5 2022-11-15 [1] CRAN (R 4.3.2) bit64 4.0.5 2020-08-30 [1] CRAN (R 4.3.2) bitops 1.0-7 2021-04-24 [1] CRAN (R 4.3.1) boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3) broom 1.0.6 2024-05-17 [1] CRAN (R 4.3.3) cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2) class 7.3-22 2023-05-03 [1] CRAN (R 4.3.2) classInt 0.4-10 2023-09-05 [1] CRAN (R 4.3.3) cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2) cluster 2.1.6 2023-12-01 [1] CRAN (R 4.3.3) codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3) colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2) crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2) curl 5.2.1 2024-03-01 [1] CRAN (R 4.3.3) dartR.base 0.88 2024-04-10 [1] Github (green-striped-gecko/dartR.base@c5ebf2a) dartR.data 1.0.4 2024-04-10 [1] Github (green-striped-gecko/dartR.data@4f93e38) dartRverse 0.51 2023-12-01 [1] CRAN (R 4.3.3) data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2) DBI 1.2.3 2024-06-02 [1] CRAN (R 4.3.3) devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3) digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2) doParallel 1.0.17 2022-02-07 [1] CRAN (R 4.3.2) dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2) e1071 1.7-14 2023-12-06 [1] CRAN (R 4.3.3) ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2) fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2) farver 2.1.2 2024-05-13 [1] CRAN (R 4.3.3) fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2) foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2) fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2) fst 0.9.8 2022-02-08 [1] CRAN (R 4.3.3) fstcore 0.9.18 2023-12-02 [1] CRAN (R 4.3.3) gdistance 1.6.4 2023-06-19 [1] CRAN (R 4.3.3) gdsfmt 1.39.3 2024-05-15 [1] Github (zhengxwen/gdsfmt@c15b23e) generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2) GenomeInfoDb 1.38.8 2024-03-15 [1] Bioconductor 3.18 (R 4.3.3) GenomeInfoDbData 1.2.11 2024-04-09 [1] Bioconductor GenomicRanges 1.54.1 2023-10-29 [1] Bioconductor ggplot2 3.5.1 2024-04-23 [1] CRAN (R 4.3.3) glmnet 4.1-8 2023-08-22 [1] CRAN (R 4.3.3) glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2) gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3) gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3) HardyWeinberg 1.7.8 2024-04-06 [1] CRAN (R 4.3.3) hierfstat 0.5-11 2022-05-05 [1] CRAN (R 4.3.3) hms 1.1.3 2023-03-21 [1] CRAN (R 4.3.2) htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2) htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2) httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2) igraph 2.0.3 2024-03-13 [1] CRAN (R 4.3.3) IRanges 2.36.0 2023-10-24 [1] Bioconductor iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2) jomo 2.7-6 2023-04-15 [1] CRAN (R 4.3.3) KernSmooth 2.23-22 2023-07-10 [1] CRAN (R 4.3.2) labeling 0.4.3 2023-08-29 [1] CRAN (R 4.3.1) later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2) lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2) LEA 3.14.0 2023-10-24 [1] Bioconductor lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2) lme4 1.1-35.3 2024-04-16 [1] CRAN (R 4.3.3) magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2) MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2) Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2) melfuR 1.1 2024-04-06 [1] Github (pygmyperch/melfuR@5d6bafa) memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2) mgcv 1.9-1 2023-12-21 [1] CRAN (R 4.3.3) mice 3.16.0 2023-06-05 [1] CRAN (R 4.3.3) mime 0.12 2021-09-28 [1] CRAN (R 4.3.1) miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2) minqa 1.2.7 2024-05-20 [1] CRAN (R 4.3.3) mitml 0.4-5 2023-03-08 [1] CRAN (R 4.3.3) munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3) nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2) nloptr 2.0.3 2022-05-26 [1] CRAN (R 4.3.2) nnet 7.3-19 2023-05-03 [1] CRAN (R 4.3.2) pak 0.7.2 2024-03-17 [1] CRAN (R 4.3.3) pan 1.9 2023-08-21 [1] CRAN (R 4.3.3) patchwork 1.2.0 2024-01-08 [1] CRAN (R 4.3.3) pegas 1.3 2023-12-13 [1] CRAN (R 4.3.3) permute 0.9-7 2022-01-27 [1] CRAN (R 4.3.3) pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2) pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3) pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2) pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2) plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2) polysat 1.7-7 2022-08-23 [1] CRAN (R 4.3.3) poppr 2.9.6 2024-03-15 [1] CRAN (R 4.3.3) profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2) promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2) proxy 0.4-27 2022-06-09 [1] CRAN (R 4.3.3) purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2) R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2) radiator * 1.3.4 2024-06-05 [1] Github (thierrygosselin/radiator@3a8cb4f) ragg 1.3.2 2024-05-15 [1] CRAN (R 4.3.3) raster 3.6-26 2023-10-14 [1] CRAN (R 4.3.3) Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2) RCurl 1.98-1.14 2024-01-09 [1] CRAN (R 4.3.2) readr 2.1.5 2024-01-10 [1] CRAN (R 4.3.2) remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3) reshape2 1.4.4 2020-04-09 [1] CRAN (R 4.3.2) rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2) rpart 4.1.23 2023-12-05 [1] CRAN (R 4.3.3) Rsolnp 1.16 2015-12-28 [1] CRAN (R 4.3.3) rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3) S4Vectors 0.40.2 2023-11-23 [1] Bioconductor scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2) SeqArray 1.42.4 2024-04-03 [1] Bioconductor 3.18 (R 4.3.3) seqinr 4.2-36 2023-12-08 [1] CRAN (R 4.3.3) sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2) sf 1.0-16 2024-03-24 [1] CRAN (R 4.3.3) shape 1.4.6.1 2024-02-23 [1] CRAN (R 4.3.2) shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3) SNPRelate 1.37.5 2024-05-15 [1] Github (zhengxwen/SNPRelate@0996a58) sp 2.1-4 2024-04-30 [1] CRAN (R 4.3.3) StAMPP 1.6.3 2021-08-08 [1] CRAN (R 4.3.3) stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2) stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2) survival 3.6-4 2024-04-24 [1] CRAN (R 4.3.3) systemfonts 1.1.0 2024-05-15 [1] CRAN (R 4.3.3) terra 1.7-71 2024-01-31 [1] CRAN (R 4.3.3) textshaping 0.4.0 2024-05-24 [1] CRAN (R 4.3.3) tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2) tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2) tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2) truncnorm 1.0-9 2023-03-20 [1] CRAN (R 4.3.3) tzdb 0.4.0 2023-05-12 [1] CRAN (R 4.3.2) units 0.8-5 2023-11-28 [1] CRAN (R 4.3.3) UpSetR 1.4.0 2019-05-22 [1] CRAN (R 4.3.3) urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2) usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3) utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2) vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2) vegan 2.6-6 2024-05-14 [1] CRAN (R 4.3.2) viridisLite 0.4.2 2023-05-02 [1] CRAN (R 4.3.2) vroom 1.6.5 2023-12-05 [1] CRAN (R 4.3.2) withr 3.0.0 2024-01-16 [1] CRAN (R 4.3.2) xml2 1.3.6 2023-12-04 [1] CRAN (R 4.3.2) xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2) XVector 0.42.0 2023-10-24 [1] Bioconductor zlibbioc 1.48.2 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)

The code that you sent to me: data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv") Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows Generating genotypes and calibrating REF/ALT alleles... Number of markers recalibrated based on counts of allele: 11707 Generating GDS... File written: radiator_20240605@1636.gds.rad

Number of chrom: 1 Number of locus: 45544 Number of SNPs: 76246 Number of strata: 6 Number of individuals: 479

Number of ind/strata: PER = 47 MEX = 99 GAL = 216 COL = 32 GM = 11 ITW = 74

Number of duplicate id: 0

Computation time, overall: 112 sec

mix <- radiator::detect_mixed_genomes(data = data) (parallel.core = 1) # don't remove individuals ################################################################################ ######################## radiator::detect_mixed_genomes ######################## ################################################################################ Execution date@time: 20240605@1639 Folder created: 01_detect_mixed_genomes_20240605@1639 File written: radiator_detect_mixed_genomes_args_20240605@1639.tsv
Filters parameters file generated: filters_parameters_20240605@1639.tsv Error in dplyr::mutate(): ℹ In argument: MISSING_PROP = round(...). Caused by error in .DynamicClusterCall(): ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240605@1636\radiator_20240605@1636.gds.rad'. The process cannot access the file because it is being used by another process. Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 10 sec ######################## completed detect_mixed_genomes ########################

dup <- radiator::detect_duplicate_genomes(data = data) ################################################################################ ###################### radiator::detect_duplicate_genomes ###################### ################################################################################ Execution date@time: 20240605@1639 Folder created: 02_detect_duplicate_genomes_20240605@1639 Function call and arguments stored in a file File written: radiator_detect_duplicate_genomes_args_20240605@1639.tsv File written: random.seed (135945)
Filters parameters file generated: filters_parameters_20240605@1639.tsv Error in dplyr::mutate(): ℹ In argument: MISSING_PROP = round(...). Caused by error in .DynamicClusterCall(): ! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240605@1636\radiator_20240605@1636.gds.rad'. The process cannot access the file because it is being used by another process. Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 8 sec