replicate / cog-llama-template

LLaMA Cog template
Apache License 2.0
307 stars 52 forks source link

Engines #38

Closed daanelson closed 1 year ago

daanelson commented 1 year ago

WIP: Refactor of llama inference into separate engines which can serve predictions with different underlying inference code.

This has:

This does not yet have:

daanelson commented 1 year ago

one thing that's coming out of this, at least imo, is that a lot of the config we're setting really lives at the engine level - that's especially true if we have special requirements for cog.yaml per engine, etc.

technillogue commented 1 year ago

are we still going to merge this?

technillogue commented 1 year ago

replaced by #47