Open NTNguyen13 opened 4 years ago
Hi @NTNguyen13
You are looking for: https://scikit-allel.readthedocs.io/en/stable/io.html
If your VCF contains multiple chromosomes, you will need to supply the region=??
argument to select a single chromosome.
Thanks for your question, but generally, this issue tracker is for potential bugs. User queries should be addressed to https://groups.google.com/g/scikit-allel
Hi, I'm currently calculating Fst using scikit-allel module.
I tried to use a pandas dataframe with format (n_variants, n_samples), each cell is a list of genotype, like this:
but scikit allele does not accept this format, and return the error:
Please advice me on how to process the right format for this function. At this moment I extract the genotype from VCF files, then I split the genotype by '|' and converting it to
int
, I wonder if there are native methods to read vcf to genotype array exists