qwen Search Results - Githubissues

1000+ results
for qwen

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #5496

[Bug]: Qwen/Qwen2-72B-Instruct 128k server down

### Your current environment PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu Jammy Jellyfish (development branch…

junior-zsy updated 1 week ago
11
unslothai/unsloth #1108

Resize embeddings, tokenizers - adding new tokens don't work

From Twitter - adding new tokens to Qwen don't work? ```python # Add special tokens to the tokenizer num_added_tokens = tokenizer.add_special_tokens({"additional_special_tokens": special_tokens}) …

danielhanchen updated 1 month ago
3
daje0601/CoT-Reasoning_without_Prompting #1

큰 모델 사용 시 오답

좋은 작업 감사합니다. Qwen/Qwen2.5-72B-Instruct으로 실행했을때 아래와 같은 문제가 있습니다. ![image](https://github.com/user-attachments/assets/1ecf1efd-85d7-42ee-8935-d700a7d6acf5)

cut-tip updated 1 month ago
1
alibaba/rtp-llm #63

Qwen Chat CUDA OutOfMemory

RTX 4090 24G, Qwen-7B-Chat loads OK: ``` model_config = ModelConfig(lora_infos={ "lora_1": conf['lora_1'], "lora_2": conf['lora_2'], }) model = ModelFactory.from_huggingface(conf['b…

xorange updated 5 months ago
2
QwenLM/Qwen-VL #143

ValueError: Unrecognized configuration class <class 'transfo…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

JR-s763 updated 2 months ago
3
QwenLM/Qwen2.5 #1049

[Bug]: AttributeError: Model Qwen2ForCausalLM does not suppo…

### Model Series Qwen2.5 ### What are the models used? Qwen/Qwen2.5-1.5B-Instruct ### What is the scenario where the problem happened? [inference with] with [vllm] ### Is this a known issue? - …

yananchen1989 updated 2 weeks ago
1
quic/ai-hub-models #112

[Question] What's the groupsize of w4a16 + w8a16

Hello , in the https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4/blob/c34a4a91629f09f73a285f32dbd26106b033c654/config.json#L29 has mentioned the groupsize is 128 for 4bit or 8bit. So could y…

xiguadong updated 3 days ago
1
Maplemx/Agently #175

如何连接qwen-vl多模态大模型

这个proj非常好用，感谢开发组。但是在应用过程中发现本proj主要支持LLM，能否提供一个支持Qwen-VL的例子呢（API调用），非常感谢！

XiaoFanTong updated 4 weeks ago
1
aws-neuron/aws-neuron-sdk #940

Please add Qwen/Qwen2-7B model to the neuron cache

Please add Qwen/Qwen2-7B model to the neuron cache

jas0n1iu updated 6 days ago
1
alibaba/Pai-Megatron-Patch #360

对qwen-2.5扩充词表后loss飙升

对qwen-2.5扩充词表后loss飙升到2000，可以帮忙看下问题吗

QianguoS updated 3 weeks ago
1

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for qwen

1000+ results
for qwen