DeNederlandscheBank / nqm

A Transformer-based Machine for answering questions on insurance companies
MIT License
0 stars 0 forks source link

Add Model Evaluation Process #14

Closed jm-glowienke closed 3 years ago

jm-glowienke commented 3 years ago

A first model was trained, but can not be evaluated on the test set(s) yet.

What needs to be done?

SPARQL decoder

Test sets

Evaluation metrics

jm-glowienke commented 3 years ago

fairseq-train --eval-bleu \ --eval-bleu-args '{"beam": 5, "max_len_a": 1.2, "max_len_b": 10}' \ --eval-bleu-detok moses \ --eval-bleu-remove-bpe \ --eval-bleu-print-samples \ --best-checkpoint-metric bleu --maximize-best-checkpoint-metric

jm-glowienke commented 3 years ago

Quotation marks:

EDIT 14-04: This has been fixed by stripping quotes, which come with the names.