qwen-api Search Results

1000+ results
for qwen-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PygmalionAI/aphrodite-engine #330

[sparsetral and Qwen2idae]: support for mixtral of lora

### The model to consider. https://huggingface.co/serpdotai/sparsetral-16x7B-v2-SPIN_iter1 https://huggingface.co/LoneStriker/sparsetral-16x7B-v2-8.0bpw-h8-exl2/tree/main https://huggingface.co/h…

sorasoras updated 6 months ago
27
frdel/agent-zero #3

[feature request] local only

having the ability to use the api to paid services is cute and all. can we have local only. nobody wants to pay for these services anymore especially as llama3.1 blew them away with costly tie…

Tom-Neverwinter updated 2 months ago
7
EleutherAI/lm-evaluation-harness #2096

Question: Realtoxicityprompts takes >10 seconds per query, i…

Hello, I've tried running realtoxicityprompts (github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/tasks/realtoxicityprompts/) through the Hugging Face leaderboard backend code (https://h…

meg-huggingface updated 4 months ago
1
sgl-project/sglang #1945

[Bug] tp-size=2，model launch error

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. - [ ] 3. Please note that if the bug-related issue y…

linqingxu updated 16 hours ago
3
ollama/ollama #7443

Reply GGGGGGGGGGGGGG running nemotron:latest

### What is the issue? Ollama using Docker mode. When execute 'sudo docker exec -it ollama ollama run nemotron:latest', or "sudo docker exec -it ollama ollama run qwen2.5:72b" it replied "GGGGGGG…

3keyallen3 updated 2 weeks ago
3
intel-analytics/ipex-llm #11709

update ollama 0.3.x support

''ipex-llm[cpp]==2.5.0b20240527 is consistent with [v0.1.34] of ollama. Our current version is consistent with [v0.1.39] of ollama.'' Is it possible to update supported ollama version to 0.3.x?

przybjul updated 2 months ago
9
NVIDIA/TensorRT-LLM #1920

Qwen2-72B-Instruct-GPTQ-Int4 Conversion Success, Run Failure

### System Info NVIDIA-SMI 535.154.05 Driver Version: 535.154.05 CUDA Version: 12.4 - GPU properties - GPU name: NVIDIA L20 - GPU memory size: 46068MiB - Libraries - Te…

linchpinlin updated 1 day ago
8
vllm-project/vllm #5298

[Bug]: After fine-tuning Qwen Lora, the inference results di…

### Your current environment ```text The output of `python collect_env.py` ``` ### 🐛 Describe the bug https://github.com/hiyouga/LLaMA-Factory/issues/4049 transformers+lora ![image](https:/…

lonngxiang updated 2 weeks ago
14
datawhalechina/DOPMC #180

self-llm

### 你是否已经阅读并同意《Datawhale开源项目指南》？ - [X] 我已阅读并同意[《Datawhale开源项目指南》](https://github.com/datawhalechina/DOPMC/blob/main/GUIDE.md) ### 你是否已经阅读并同意《Datawhale开源项目行为准则》？ - [X] 我已阅读并同意[《Datawhale开源项目行为准则》](h…

KMnO4-zx updated 10 months ago
22
vllm-project/vllm #9761

[Feature]: Qwen2.5 model : ValueError: This model does not …

### 🚀 The feature, motivation and pitch Using Qwen2.5 model : ValueError: This model does not support the 'embedding' task. Supported tasks: {'generate'} reproduction : `python -m vllm.entryp…

tarikbeijing updated 12 hours ago
3

上一页 1...37 38 39 40 41 42 43...100 下一页

1000+ results for qwen-api

1000+ results
for qwen-api