-
Failed to load model: No LM Runtime found for format 'safetensors!
Model: Phi-3.5-vision-instruct-gguf
![image](https://github.com/user-attachments/assets/0f9b1a82-c260-45bb-a286-9118dcefb33c)
…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
`def create_piazza_index(json_file_path, index_folder, levels_back=None, collapse_length…
-
Hi, I'm having some issues running the SERAC methed. I took a closer look at the https://github.com/zjunlp/EasyEdit/issues/261, but the issue is still not resolved
I tried `JackFram/llama-160m`, but…
-
Hi all,
I have an Expo application that I created from scratch `npx create-expo-app@latest ./ --template` and added a function to touch LLama model with this.
``` javascript
import Replicate fro…
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expe…
-
### Feature Description
We can add an argument (for example, `--context-shift`, `--no-context-shift`) to enable/disable context shift.
If disabled:
- Requests bigger than context window will re…
-
**Describe the issue**
I want to get .tflite model. I have an app which can deploy tensorflow lite models, while it only accepts .tflite file. I followed the instructions [here](https://huggingface.c…
-
Hi thanks for the package! I want to play with LoRA on llama3.1 8B, but the tutorials https://docs.unsloth.ai/get-started/unsloth-notebooks seems only to discuss with qlora. Thus I wonder what to do f…
-
### Anything you want to discuss about vllm.
I am not sure if this should be a bug report, this is why I am starting by submitting this as a discussion.
We are running vllm on 4 gpus via Kubernete…
-
When I run _Meta-Llama-3-8B-Instruct_ or _Meta-Llama-3.1-8B-Instruct_ with
1. python 3.12.5
2. scalellm 0.1.9+cu118torch2.2.2
3. torch 2.2.2+cu1…