issues
search
li-plus
/
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
MIT License
2.92k
stars
334
forks
source link
Fix nan by rescheduling attention scaling
#322
Closed
li-plus
closed
3 months ago