alexandrainst / coral

Danish ASR and TTS models associated with the CoRal project.
MIT License
9 stars 1 forks source link

Unpack abbreviations during evaluation #96

Open saattrupdan opened 2 months ago

saattrupdan commented 2 months ago

When evaluating ASR models, we should unpack all common abbreviations from both the ground truth and the model's output, to avoid unfair measurement.

This includes (at least) the following:

We should be careful not to mistakenly replace instances of these when they're not abbreviations, however - they need to be separated by word boundaries at least.