Open DidaDidaDidaD opened 2 years ago
为什么转换后的HDF5模型,推理时间反而比Hugging Face慢?原本0.24妙推理一个句子,转换模型后反而到了0.33
Maybe your GPU doesn't support tensorcore for fp16, you can try to build LightSeq with fp32 mode: ENABLE_FP32=1 pip3 install -e $PROJECT_DI
为什么转换后的HDF5模型,推理时间反而比Hugging Face慢?原本0.24妙推理一个句子,转换模型后反而到了0.33