Closed digisomni closed 3 weeks ago
Hi @digisomni, thanks for reporting the error! Could you provide the complete error message and the script to reproduce the error so we can better identify the problem? I failed to reproduce this error on my device, but not certain if this is related the environment.
🐛 Bug
I believe when the final token completes or is about to be completed in a request, the entire app crashes with
libc++abi: terminating due to uncaught exception of type std::length_error: vector
To Reproduce
Steps to reproduce the behavior:
Expected behavior
MLC should not crash, which in turn crashes my whole Python application.
Environment
conda
, source):python -m pip install --pre -U -f https://mlc.ai/wheels mlc-llm-nightly mlc-ai-nightly
pip
, source): Was already installed with the mlc-nightly pip installpython -c "import tvm; print('\n'.join(f'{k}: {v}' for k, v in tvm.support.libinfo().items()))"
, applicable if you compile models): N/A