Model visualization enhancements needed

lzamparo / embedding

Learning semantic embeddings for TF binding preferences directly from sequence

Other

0 stars 0 forks source link

To interpret the learned codes it would help to have the following visualizations:

distance matrix clustering for all TFs. Do probes from like families cluster together?
Visualization in 3D for probes from 3 specific factors (do we see separation??) Maybe from distinct families...
Look at nearest k-mers to center of mass for each factor: K-mers which are within a very small radius of the centre of mass of each factor.

Maybe something like exemplar-based clustering could work in the embedding space?

Further down the line, for a given probe, can we decode along a probe to find important Kmers (that might resemble motifs??)

lzamparo / embedding