[Fix] Fix problem that ktransformers cannot offload whole layer in cpu - Githubissues

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Apache License 2.0

741 stars 39 forks source link

[Fix] Fix problem that ktransformers cannot offload whole layer in cpu #62

Closed Azure-Tang closed 2 months ago

Azure-Tang commented 2 months ago

Fix bug that ktransformers cannot offload whole layer in cpu.
Update DeepseekV2‘s multi gpu yaml examples to evenly allocate layers.
Update Docker file.