Open stianale opened 3 weeks ago
Hi @stianale ,
Thank you for raising this concern, and we appreciate your feedback!
You're absolutely right to point out the apparent discrepancy between high ipTM and low pLDDT. We've observed similar behavior in our model and believe it stems from the fact that the pTM (and ipTM) predictions tend to be "over-confident." As shown in Figure 8 of our technical report, the majority of the "chain pair ipTM" values are clustered around or above 0.4, with very few falling in the 0-0.4 range.
Figure 8: N represents the number of samples.
We hypothesize that this issue arises from the difference between the training stages of the PAE head and the other confidence heads: the PAE head, which computes pTM, is trained in the final stage only. We're actively working on refining the training process to mitigate this over-confidence issue.
In light of this, we consider pTM as a more reliable "relative" measure for comparing different samples rather than an "absolute" accuracy measure. We can use pTM when selecting a sample from many generated ones, but the exact scores are not entirely accurate.
Thanks again for your thoughtful comment!
Congrats with an impressive work
I wonder a bit about my results for my protein-dsDNA complex, which are of very high ipTM, but suprisingly low pLDDT. How does this even make sense? I would think that they are at least somewhat related measures.