openppl-public / ppl.llm.serving

Apache License 2.0
122 stars 13 forks source link

[refactor][fix] use barrier to fix decoder and work thread sync, refa… #23

Closed Vincent-syr closed 10 months ago

Vincent-syr commented 10 months ago

use barrier to fix decoder and work thread sync, refactor tokenizer layout