Improvement Description
calculate number of shared features between each pair of samples in a feature table.
Output similarity matrix of number of shared features per pair? (violates hollow 2d matrix requirement for skbio.DistanceMatrix)?
Would need to define new semantic type: SimilarityMatrix
If we want to go the SimilarityMatrix route: skbio.diversity.beta_diversity accepts lambda expressions and callables as metric, so we can define our own similarity metrics without touching skbio.
Also output lists of features that are shared/not-shared between pairs of samples — so a visualizer might be appropriate. We may be looking at a pipeline here, because having a similarity matrix that can be consumed by other methods (e.g., in q2-longitudinal) would be useful.
Improvement Description calculate number of shared features between each pair of samples in a feature table.
Output similarity matrix of number of shared features per pair? (violates hollow 2d matrix requirement for
skbio.DistanceMatrix
)?Would need to define new semantic type:
SimilarityMatrix
If we want to go the
SimilarityMatrix
route:skbio.diversity.beta_diversity
accepts lambda expressions and callables asmetric
, so we can define our own similarity metrics without touchingskbio
.Also output lists of features that are shared/not-shared between pairs of samples — so a visualizer might be appropriate. We may be looking at a pipeline here, because having a similarity matrix that can be consumed by other methods (e.g., in q2-longitudinal) would be useful.