Open malteos opened 1 year ago
#MODEL=xlm-roberta-base
#MODEL=bert-base-multilingual-cased
MODELS_DIR=${DATASETS_DIR}/huggingface_transformers/pytorch
# xnli
python src/finetune_eval_harness/main.py --tasks xnli_en --model_name_or_path ${MODELS_DIR}/bert-base-multilingual-cased \
--max_train_samples 10 --max_eval_samples 10 \
--output_dir ./output/verify --results_log_path ./output/verify/metrics --report_to none --overwrite_cache
# pawsx
python src/finetune_eval_harness/main.py --tasks pawsx_en --model_name_or_path ${MODELS_DIR}/bert-base-multilingual-cased \
--max_train_samples 10 --max_eval_samples 10 \
--output_dir ./output/verify --results_log_path ./output/verify/metrics --report_to none --overwrite_cache
Model/tokenizer support
Languages: en, de, fr, it, es
Tasks: Verify scores with reported scores in the literature
Verify with
bert-base-multilingual-cased
,xlm-roberta-base
[x] XNLI: en, de, fr, es (missing: it)
[ ]
XQUAD: en, de, es (missing: it, fr)no train splitt[x] PAWSX: en, de, es, fr (missing: it)
[x] XStance: de, fr, it (missing: en, es)
[ ] UDPOS: en, de, fr, it (missing: es)
[ ] XTREME MLQA:
[x] XTREME PANX NER: en, de, es, fr, it
[x] multi_eurlex: en, de, fr, it, es