steineggerlab / foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.
https://foldseek.com
GNU General Public License v3.0
695 stars 92 forks source link

How does the process to calculate alnTMscore #221

Open jov131 opened 6 months ago

jov131 commented 6 months ago

The output 'ttmscore, alntmscore' seems always the same in my dataset.

query   target  qlen    tlen    alnlen  qtmscore    ttmscore    alntmscore
test3   test3   374 374 374 1.000E+00   1.000E+00   1.000E+00
test3   test7   374 385 161 1.909E-01   1.859E-01   1.859E-01
test3   test2   374 421 307 1.509E-01   1.361E-01   1.361E-01
test3   test5   374 111 58  5.909E-02   1.856E-01   1.856E-01
test3   test6   374 555 106 1.389E-01   9.837E-02   9.837E-02
test5   test5   111 111 111 1.000E+00   1.000E+00   1.000E+00
test5   test3   111 374 58  1.856E-01   5.909E-02   5.909E-02
test5   test6   111 555 25  1.638E-01   3.892E-02   3.892E-02
test5   test1   111 186 21  8.786E-02   6.173E-02   6.173E-02
test2   test2   421 421 421 1.000E+00   1.000E+00   1.000E+00
test2   test6   421 555 94  1.697E-01   1.313E-01   1.313E-01
test2   test1   421 186 107 1.132E-01   2.453E-01   2.453E-01
test2   test4   421 385 31  7.175E-02   7.833E-02   7.833E-02
test2   test3   421 374 104 1.325E-01   1.471E-01   1.471E-01
test2   test7   421 385 62  1.105E-01   1.200E-01   1.200E-01
test2   test5   421 111 91  4.794E-02   1.266E-01   1.266E-01
test6   test6   555 555 555 1.000E+00   1.000E+00   1.000E+00
test6   test2   555 421 94  1.313E-01   1.697E-01   1.697E-01
test6   test1   555 186 54  8.732E-02   2.518E-01   2.518E-01
test6   test7   555 385 92  1.030E-01   1.431E-01   1.431E-01
test6   test4   555 385 48  7.624E-02   1.070E-01   1.070E-01
test6   test3   555 374 127 9.940E-02   1.399E-01   1.399E-01
test6   test5   555 111 19  3.221E-02   1.580E-01   1.580E-01
test7   test7   385 385 385 1.000E+00   1.000E+00   1.000E+00
test7   test3   385 374 160 1.849E-01   1.899E-01   1.899E-01
test7   test6   385 555 150 7.445E-02   5.358E-02   5.358E-02
test7   test4   385 385 203 7.652E-02   7.652E-02   7.652E-02
test7   test2   385 421 234 1.045E-01   9.876E-02   9.876E-02
test7   test1   385 186 164 6.405E-02   9.706E-02   9.706E-02
test7   test5   385 111 28  6.846E-02   2.200E-01   2.200E-01
test1   test1   186 186 186 1.000E+00   1.000E+00   1.000E+00
test1   test2   186 421 173 2.451E-01   1.132E-01   1.132E-01
test1   test6   186 555 54  2.518E-01   8.732E-02   8.732E-02
test1   test4   186 385 36  1.848E-01   9.085E-02   9.085E-02
test1   test3   186 374 48  6.865E-02   3.691E-02   3.691E-02
test1   test7   186 385 68  1.789E-01   9.483E-02   9.483E-02
test4   test4   385 385 385 1.000E+00   1.000E+00   1.000E+00
test4   test1   385 186 36  9.085E-02   1.848E-01   1.848E-01
test4   test6   385 555 39  9.821E-02   6.859E-02   6.859E-02
test4   test2   385 421 36  9.087E-02   8.324E-02   8.324E-02
test4   test7   385 385 302 1.086E-01   1.086E-01   1.086E-01
test4   test3   385 374 308 1.013E-01   1.039E-01   1.039E-01

I think the probability that TMscore is identical when normalized by alnLen or by tLen is extremely low. I wonder whether there is some bugs or my understanding to TMscore is incorrect.