openppl-public / ppl.llm.serving

Apache License 2.0
122 stars 13 forks source link

Syr/dev #28

Closed Vincent-syr closed 10 months ago

Vincent-syr commented 10 months ago

add offline inference and update README refactor server, decoupling server, tokenizer, model, backend, config, and some code format