Closed DaniyarM closed 3 years ago
Have you checked the results of your model using multiple (2,3) heads? Does this lead to better results or does it not make sense with yours model?
We tried our model with multiple heads and performance do not improve.
Thank you!
Have you checked the results of your model using multiple (2,3) heads? Does this lead to better results or does it not make sense with yours model?