-
add support for llama.ccp for local Ai inferencing
-
Do you have any ideas to support CuBlas to increase inference speed by offloading some layers to GPU
-
1) I just wanted to ask if you guys are planing to add MPT GPU support as well somet ime? I see its supported for LLAMA models.
2) Real reason for the ticket, I am having issue getting it to really u…
-
it seems it do not work offline, when I put my wifi in plane mode, I receive this error
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceed…
-
### First, confirm
- [X] I have read the [instruction](https://github.com/Gourieff/sd-webui-reactor/blob/main/README.md) carefully
- [X] I have searched the existing issues
- [X] I have updated the e…
-
## Expected Behavior
I would expect a relatively low barrier-to-entry to see functionality working.
## Current Behavior
The app come with no models, which is understandable. However, it's loosely…
-
### Describe the bug
installation breaks because fastapi and elevenlabs require incompatible versions of pydantic.
### Is there an existing issue for this?
- [X] I have searched the existing issues…
-
### Describe the bug
$ ./update_linux.sh
$ ./start_linux.sh
2023-10-06 18:04:19 INFO:Loading settings from settings.yaml...
2023-10-06 18:04:19 INFO:Loading the extension "gallery"...
Starting s…
-
Hi,
We have an old model trained with ktrain (0.21.4). We have the `.h5` file and the `.preproc` file. However, we have lost a reference to the data that we used to train it(perhaps even the training…
-
Hi and thanks for your awesome backend.
I was just wondering if you can add those options to your generate method:
repeat_penalty
repeat_last_n
They are standard GPT parameters and your backend …