stefan-it / turkish-bert

Turkish BERT/DistilBERT, ELECTRA and ConvBERT models
482 stars 42 forks source link

Evaluation Methodology #37

Open Nuri-Tas opened 7 months ago

Nuri-Tas commented 7 months ago

F1 scores are reported for the evaluation, however I'd like to know if you used macro or weighted F1 scores for downstream tasks (such as for NER). Would it also be possible to learn hyperparameters you set for finetuning, like max sequence length?