Current evaluate.py in the asr_eval subproject is reading the text content from "gold" transcript file directly, but as we've seen, the "gold" files are quite noisy and need some clean-up (https://github.com/clamsproject/clams-utils/issues/2) before being used for asr evaluation.
New Feature Summary
Current evaluate.py in the asr_eval subproject is reading the text content from "gold" transcript file directly, but as we've seen, the "gold" files are quite noisy and need some clean-up (https://github.com/clamsproject/clams-utils/issues/2) before being used for asr evaluation.
Since we have a new cleaner implementation (https://github.com/clamsproject/clams-utils/https://github.com/clamsproject/clams-utils/pull/3), it's time to update the eval.py to use the cleaned copies of the transcript files.
Related
No response
Alternatives
No response
Additional context
No response