Testing torch bindings and code doesn't work on large models.
Models are converted using a converter version against current master.
The issue is not occuring while using 1b5 and 3b models.
The self.output after running interop.forward method stays the same(nAn).
On the other hand, state is being changed. So there is some problem with output setting with CPP code.
Attached a jupyter notebook to reproduce, but with .md extension. So make sure to rename it back to .ipynb.
(GH doesn't allow uploading ipynb for some reason)
untitled_1.md
Testing torch bindings and code doesn't work on large models.
Models are converted using a converter version against current master. The issue is not occuring while using 1b5 and 3b models.
The
self.output
after runninginterop.forward
method stays the same(nAn). On the other hand,state
is being changed. So there is some problem withoutput
setting with CPP code.Attached a jupyter notebook to reproduce, but with
.md
extension. So make sure to rename it back to.ipynb
. (GH doesn't allow uploading ipynb for some reason) untitled_1.md