Closed janetzki closed 1 year ago
We can evaluate the dictionary creation on at least one LRL (and not only on HRL/MRL). Motivation: evaluate DC -> create 1 LRL dict with F1 > 30%
todo
The MMR lowers with an increasing number of selected questions. Apparently, getting more questions right is "more difficult".
Goal
We can evaluate the dictionary creation on at least one LRL (and not only on HRL/MRL). Motivation: evaluate DC -> create 1 LRL dict with F1 > 30%
Tasks
todo
s in commentsResults
Observation
The MMR lowers with an increasing number of selected questions. Apparently, getting more questions right is "more difficult".