qwen1-5 Search Results - Githubissues

1000+ results
for qwen1-5

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lm-sys/FastChat #3196

Looking forward to adding support for Qwen1.5

Looking forward to adding support for Qwen1.5, including Qwen1.5-7B-Chat, Qwen1.5-7B-Chat-GPTQ-Int8, and so on. Qwen1.5 is more powerful than Qwen. Thank you.

davidjia1972 updated 1 month ago
4
HKUNLP/ChunkLlama #20

Why the pretraining_length = 32384

Thank you for your contributions. I have a question regarding why the pretraining_length is 32384, while in https://huggingface.co/Qwen/Qwen1.5-14B-Chat/blob/main/config.json, the "max_position_embedd…

relic-yuexi updated 19 hours ago
8
datawhalechina/self-llm #123

微调Qwen1.5-0.5b报错 PermissionError: [Errno 13] Permission deni…

训练代码 ```python from datasets import Dataset import pandas as pd from transformers import AutoTokenizer, AutoModelForCausalLM, \ DataCollatorForSeq2Seq, TrainingArguments, Trainer import to…

ykallan updated 1 month ago
5
logikon-ai/cot-eval #45

Evaluate: Qwen/Qwen1.5-MoE-XX

For `XX` in [A2.7B-Chat, A2.7B]: Check upon issue creation: * [x] The model has not been evaluated yet and doesn't show up on the [CoT Leaderboard](https://huggingface.co/spaces/logikon/open_cot…

ggbetz updated 3 months ago
1
QwenLM/Qwen2 #382

how to reproduce QWEN1.5-7B-CHAT results

how to reproduce QWEN1.5-7B-CHAT results as you report: ![image](https://github.com/QwenLM/Qwen1.5/assets/17668109/3c634665-4907-4d0b-b60c-b097a9a70981) i got to TOEFL=30.198 by https://github.c…

chunniunai220ml updated 1 day ago
4
EleutherAI/lm-evaluation-harness #1885

Multiple issues Encountered During Tasks Verification

I am currently verifying all tasks under the `lm-evaluation-harness`. I will raise the issues I encounter one after another in this issue thread. Thank you for your inspection and response! @haileysch…

zhabuye updated 2 weeks ago
21
intel-analytics/ipex-llm #11109

Qwen1.5-4b and Qwen1.5-7b model cannot be loaded correctly i…

I save qwen1.5-4b and 7b int4 model in my computer, when loaded these models, there are some errors: Some weights of the model checkpoint at ./models/qwen1.5-4b were not used when initializing Q…

grandxin updated 1 month ago
9
QwenLM/Qwen2 #707

基于qwen1.5 14B 100k训练重复

如题，基于qwen1.5 14B进行continue pretrain 95B，然后sft，发现超过32k就出现重复问题，解码重复生成某些字符串。具体做法：在进行100k训练时rope base不变还是1m， max position embedding 和 seq len改成100k。请问这样做有什么问题嘛？初步看是位置编码没学好？配置问题？

520jefferson updated 2 weeks ago
2
logikon-ai/cot-eval #27

Evaluate: Qwen/Qwen1.5-XX-Chat

For `{XX}` in [0.5B, 1.8B, 4B, 7B, 14B, 32B and 72B]: Check upon issue creation: * [x] The model has not been evaluated yet and doesn't show up on the [CoT Leaderboard](https://huggingface.co/sp…

ggbetz updated 3 months ago
1
RUC-NLPIR/FlashRAG #54

按照步骤不能正常运行

按照英文界面的操作，我修改了模型，用的是qwen1.5-7n-chat，然后用python simple_pipeline.py运行，报错如下： OSError: Not enough disk space. Needed: Unknown size (download: Unknown size, generated: Unknown size, post-processed: Unknown…

WangXuCh updated 4 hours ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for qwen1-5

1000+ results
for qwen1-5