-
In the most recent version (Version: f2.0.1v1.10.1-previous-304-g394da019 Commit hash: 394da01959ae09acca361dc2be0e559ca26829d4)
I get the following error and also no longer see a lora-dir argument…
-
- [ ] [self-speculative-decoding/README.md at main · dilab-zju/self-speculative-decoding](https://github.com/dilab-zju/self-speculative-decoding/blob/main/README.md?plain=1)
# Self-Speculative Decod…
-
Many-Shot In-Context Learning
https://arxiv.org/abs/2404.11018
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
https://arxiv.org/abs/2404.14219
https://github.com/apple…
-
### Describe the bug
I tried to initialize an inference session with a Uint8Array representation of a very simple .ort model (see screenshot of the .ort file viewed in Netron below), but it gave me t…
-
### Describe the bug
trun on https,and code click button return gr.components.File,then wait about 5s, click download link,browser show the download error.
### Have you searched existing issues?…
-
1. I am trying to use infer model on multiple GPU but I get an error of unathurised access to GPU.Do I need to config the repo accordingly / how to use cuda visible devices ?
2. Can I infer with imag…
-
**Is your feature request related to a problem? Please describe.**
This is a feature related to how to deploy a model with LoRA supported.
**Describe the solution you'd like**
I have a UNet model…
-
I'm trying to use DSPY with Oracle Cloud's Gen AI platform. API is here -- https://docs.oracle.com/en-us/iaas/api/#/en/generative-ai-inference/20231130/GenerateTextResult/GenerateText. Below is the co…
-
I have a prolem with running CUDA on GPU. When I'm runnig command:
`python inference_codeformer.py --bg_upsampler realesrgan --face_upsample -w 0.7 --input_path G:\AI\CodeFormer\results\test1.jpg`
…
-
## Description
The [SEP] token used in input for the Question Answering model "distilbert" of the DJL is returned as part of the extracted answer to the question.
Shouldn't be the answer extracted f…