--hpo takes a path to HPO JSON
--phenopackets takes a path to a folder with Phenopacket JSON files. The code can expect that phenopackets are the only files in the folder
--output where to write the table with disease ranks
The CLI can take other options as necessary (e.g. a table with precomputed term IC values?)
I think, we can make our life simpler if we make sumsim bench create a table with ranks of all phenopackets:
We need a CLI entry point in the
sumsim
library.Setting up the entry point is described in setuptools documentation.
We can use
The entry point should look something like this:
The CLI can take other options as necessary (e.g. a table with precomputed term IC values?)
I think, we can make our life simpler if we make
sumsim bench
create a table with ranks of all phenopackets:Diseases for a patient are sorted such that the most likely disease is at the top. The table is a stack of sub-tables with diseases per patient
I think CSV is the best format since it is has out of the box support by
pandas
.