PygmalionAI / aphrodite-engine

PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
660 stars 80 forks source link

feat: async tokenization #374

Closed AlpinDale closed 2 months ago

AlpinDale commented 2 months ago

This PR separates the tokenization into a different process, to prevent blocking the event loop when tokenizing incoming requests.