Oneflow-Inc / one-codegeex

Apache License 2.0
7 stars 1 forks source link

add group_matmul_bias and fused linear optimization #5

Closed BBuf closed 1 year ago

BBuf commented 1 year ago

main: N_token_prompt: 127 Total generation time: 26.495075638173148 # Tokens: 897 0.029537431034752672s/token

pr: N_token_prompt: 127 Total generation time: 25.905965376878157 # Tokens: 897 0.0288806748906111s/token