s3read_using lasts forever reading a VCF file

Hi @HediaTnani ,

It sounds to me like you're trying to read in 3,000 rice genomes. Which is a tremendous amount of data. I feel that one of R's strengths is how interactive it is. But R was not really designed for high performance. Its a slow language. I've improved this a lot in vcfR by using Rcpp, but it will never compete with a compiled code solution. If you have a really large amount of data, such as you appear to have, I would suggest using vcfR if you need to prototype your actions on a subset of the data. But use a compiled language to process the entire dataset. The software VCFtools is a great option.

https://vcftools.github.io/documentation.html

Note that they have compiled modules and interpreted modules (perl). The compiled modules should perform much better than an interpreted language (such as perl or R).

Good luck! Brian

knausb / vcfR

s3read_using lasts forever reading a VCF file #167

load package

session info for your system