Smaller Models for MonoTransQuest possible?

Hi,

I wanted to ask, whether you tried to use smaller models than XLMR-large for MonoTransquest? I tried it with XLMR-base, which works fine and faster than the large version, but I would also want to try even smaller distilled models.

I found out, that in other situations, the multilingual versions of MiniLM are a very good and fast alternative for XLMR. But when I try to train mMiniLM6 or mMiniLM12 for Transquest, it just produces random scores (correlation around 0). So these models seem to be not really helpful. Did you make similar observations and/or have a hint for me?

Btw: I used the XLM-R class for MiniLM, I guess there is no special one needed, since training worked without errors...

TharinduDR / TransQuest

Smaller Models for MonoTransQuest possible? #37