li-plus / chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
MIT License
2.92k stars 334 forks source link

Fix nan by rescheduling attention scaling #322

Closed li-plus closed 3 months ago