parklab / NGSCheckMate

Software program for checking sample matching for NGS data
MIT License
126 stars 48 forks source link

missing the last sample in output_corr_matrix.txt #28

Open zhujack opened 4 years ago

zhujack commented 4 years ago

Hi,

Thanks for this robust tool. I found the last sample in my output_corr_matrix.txt is always missing and the same as in the dendrogram.

It looks adding the last sample to the temp in vaf_ncm.py will fix the problem:

temp.append([keyList[j],keyList[j]]) ## around line 265

When NGSCheckMate_all.txt is written, adding the following will remove the last line:

for i in range(0,len(samples)-1 ): ## around line 334

Also, the method of "ward.D" seems to work better for clustering (around line 382).

If the above makes sense, could you update your code?

Thanks, Jack