baichuan-inc / Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.
https://huggingface.co/baichuan-inc/baichuan-7B
Apache License 2.0
5.67k stars 506 forks source link

想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s? #138

Open HJT9328 opened 9 months ago

HJT9328 commented 9 months ago

Required prerequisites

Questions

7B模型实现了A800 上单卡吞吐的情况下实现了 70tokens/s 比较怀疑,

Checklist