Closed dosshra closed 1 month ago
Hi,
It seems something is off, especially since the number of rows you're getting is exactly half of what you expect. Could you please share two files with me—kmers_w_strand_path
and kmers_to_use.shareness
—so I can investigate further and try to understand what's going on?
Best regards, Yoav
Please see the attached files. I see that the problem begins in list_kmers_found_in_multiple_samples
. I took the 52 samples that were not included in the matrix and ran the list_kmers_found_in_multiple_samples
agin on the subset and see that the file kmers_to_use.stats.both contain only 27 lines and 27 columns. So it seems that there no problem with the individual kmer files but something splits the list by about half. It seems that the script is skipping every other line.
Thank you
kmers_to_use.shareness.txt
kmers_w_strand_path.txt
As mentioned in the manual, the kmers_w_strand_path.txt file should follow this format: each line should contain the full path to the k-mers list file, followed by a tab and the individual name. It seems that the individual names are missing in your file, so the program is interpreting every second file as the name of the previous one.
Please close the issue if it solved the problem.
Thank you The issue was resolved
Thank you The issue was resolved
Hello I am running these commands:
~/kmergwas/bin/list_kmers_found_in_multiple_samples -l kmers_w_strand_path -k 31 --mac 5 -p 0.2 -o kmers_to_use
~/kmergwas/bin/build_kmers_table -l kmers_w_strand_path -k 31 -a kmers_to_use -o kmer4wild_dom
~/kmergwas/bin/emma_kinship_kmers -t kmer4wild_dom -k 31 --maf 0.05 > kmers_table.kinship
I get these lines on the tail ofbuild_kmers_table
log fileAnd this at the the end of the log of
emma_kinship_kmers