CTransformers LLM - Githubissues

Description

@vince-westmonroe has been playing around with locally hosted LLMs and LangChain. It turns out that LangChain has internal support for the CTransformers package that will download models from HuggingFaceHub and run them locally. This includes the quantized Llama-2 models that are SOTA for open source. It would be really nice to have this available in TestBench.

Acceptance

We have a new CTransformers LLM option in the LLM interface. This option allows you to specify an LLM model from huggingface hub and model type in addition to the normal hyperparameters.
We have the LLMSpec models on the front end and back end to configure, transport, save and instantiate CTransformers LangChain LLM within chains.
Documentation on how to use this option in the README (or additional documentation page). I believe the testbench server will need to be run outside the docker container for this to work.

wm-pxel / langchain-testbench

CTransformers LLM #65

Description

Acceptance