scverse / scirpy

A scanpy extension to analyse single-cell TCR and BCR data.
https://scirpy.scverse.org/en/latest/
BSD 3-Clause "New" or "Revised" License
212 stars 34 forks source link

Thoughts on adding motif results to amino acid similarity? #530

Open n-hal opened 1 month ago

n-hal commented 1 month ago

I.e. in when clustering TCRs on amino-acid similarity it might be nice to return a string that encapsulates the conserved sequence, with a wild-card character (maybe %D:N, for example suggesting that substitutions between D:N in the blosum matrix would be tolerated in this group).

If this is of any interest I would be happy to contribute to it.

grst commented 1 month ago

Hi @n-hal,

in principle this is of interest. Could you elaborate whot that string would look like?

There's a long-standing request to add sequence logos #12 which might go into the same direction?