statisticalbiotechnology / maracluster

Matthew The's implementation of MaRaCluster
Apache License 2.0
11 stars 3 forks source link

Interpreting pvalue tree #16

Closed ArminGuntram closed 2 years ago

ArminGuntram commented 5 years ago

Hi,

What are the columns in the output files '.pvalue_tree.tsv' and 'overlap.pvalue_tree.tsv'

From the help doc I assume: "File containing the clustering tree result as a list of merged scannrs with corresponding p-value"

Also, is there a way I can access/print to file the clustering matrix?

Thanks, -Armin

MatthewThe commented 5 years ago

Each of the rows represents a link in the hierarchical clustering tree:

<file_idx_1> <scannr_1> <file_idx_2> <scannr_2> <p-value>

The distance matrix cannot be written to a file, as it is consumed "on the fly".

MatthewThe commented 2 years ago

Added to the FAQ (https://github.com/statisticalbiotechnology/maracluster/wiki/FAQ)