-
**Description**
New cache options have been added a while ago. There are now Q6 and Q8 options which I don't think have been added here. I think it'd be useful for people who can use other options …
-
find a way to make it persistent. e.g. apply a patch after u get https://github.com/oobabooga/text-generation-webui.git in dockerfile to change the default dir for settings.yaml to eg /extensions
i…
-
```
*** Error loading script: langchainapi.py
Traceback (most recent call last):
File "/home/julien/Projects/cloned/stable-diffusion-webui/modules/scripts.py", line 469, in load_scripts
…
-
Will you add support for oobabooga's text-generation-webui? An llm initialization for post requests and a few patterns might be sufficient. I've been trying to do it, but I've had to try to figure out…
-
I'm getting this error with the latest ooba pull.
```
File "P:\ai\llms\oobabooga\text-generation-webui\extensions\sd_api_pictures_tag_injection\script.py", line 216, in create_suffix
if char…
-
Several services have started to make the OpenAI API standard their own. Not only if you (for some unknown reason) want to use Azure OpenAI Service, but most notably Oobabooga has recently migrated th…
-
Hi, I started test simulation using `base_the_ville_isabella_maria_klaus`, and for 40 seconds of simulation it spent ~2.6$. Are this expenditure rates sound right? If I'm correct, it's somewhere aroun…
-
Description
A Optimum-NVIDIA is the first Hugging Face inference library to benefit from the new float8 format supported on NVIDIA Ada Lovelace and Hopper architectures. FP8, in addition to the adv…
-
I think text completion would be a great feature. I have been messing around with with other text completion programs (open WebUI playground, Oobabooga), and I think having that feature would be a gr…
-
Hello,
With ROCm 5.5, 2.1.0.dev20230502+rocm5.4.2, Triton 2.0.0.post1, pytorch-triton-rocm 2.0.2
Running oobabooga text webui loading a quantized model.
`python: /project/lib/Dialect/TritonGP…