nebiolabs / domainator

A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.
Other
11 stars 0 forks source link

enhancements to matrix_report.py to be better for SSNs #13

Open seanrjohnson opened 3 months ago

seanrjohnson commented 3 months ago

matrix_report.py currently reports all datapoints in the matrix, including the diagonal. For symmetric matrixes intended for defining edges of SSNs, it would be nice to filter the data so that the diagonals are ignored, and only the lower or upper triangular is considered.

It would also be nice to have more SSN-specific plots, like number of clusters that would result from various score cutoffs, average/median connectivity from various score cutoffs. Perhaps there are other metrics that would be informative.