Closed sanchit-gandhi closed 1 year ago
Gently pinging @lopez86 :)
hi @sanchit-gandhi, thanks for reporting this. In 0.4.0 to 0.5.0 there were a couple bugs that were fixed that could have an affect on scoring. It looks like this is a sort of contrived short test case, so I'm not surprised that there's a noticeable difference in scores. I think for more realistic inputs you should see results that are very similar, but not necessarily identical. If there are large differences in outputs on realistic cases, then there would be a problem. My expectation would be that in most cases, the final text is more or less the same with a small difference in scoring. See https://github.com/kensho-technologies/pyctcdecode/pull/96 https://github.com/kensho-technologies/pyctcdecode/pull/98
Hey @lopez86, thanks very much for getting back to me here. Indeed, this test case is quite contrived in order that it 1) runs quickly and 2) accentuates the numerical differences obtained by upgrading to v0.5.0. Good to know that these differences should be less of a problem for more realistic use cases. I'll observe the scores for larger models and more realistic inputs and report back here if there are large numerical differences 🙌 Thanks for highlighting the PR's that incorporated the bug fixes, much appreciated!
Going to close this as the differences are insignificant for real-world use cases. Thanks for your help here @lopez86
Hey pyctcdecode team 👋
Thanks for your awesome work on this library! Loving the easy integration with HF Transformers 🤗
Upgrading from PyPI version 0.4.0 to 0.5.0 yields quite different results with
BeamSearchDecoderCTC
. This is currently causing a failing assertion test on Transformers: https://github.com/huggingface/transformers/pull/21226Is this difference expected? We can update the test if so, but such differences between versions ring alarm bells for a silent regression!
Code snippet to repro:
With v0.4.0:
With v0.5.0:
We observe that the logit/lm scores are significantly different (outside the diff we could attribute to numerical precision).