QwenLM / qwen.cpp

C++ implementation of Qwen-LM
Other
506 stars 40 forks source link

💡 [Question] - QwenCPP Python Binding 如何 支持 BLAS CPU 加速 #64

Open micronetboy opened 6 months ago

micronetboy commented 6 months ago

起始日期 | Start Date

No response

实现PR | Implementation PR

No response

相关Issues | Reference Issues

No response

摘要 | Summary

QwenCPP Python Binding 如何 支持 BLAS CPU 加速

基本示例 | Basic Example

缺陷 | Drawbacks

未解决问题 | Unresolved questions

No response

micronetboy commented 6 months ago

python binding 如何设置 OpenBLAS 在CPU上加速?谢谢

python binding 如何设置 cuBLAS 在GPU上加速?谢谢

micronetboy commented 6 months ago

llama-cpp-python 有如下的选项,qwen-cpp 是否有类似的选项?

On Linux and Mac you set the CMAKE_ARGS like this:

CMAKE_ARGS="-DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install llama-cpp-python