internlm2 Search Results

666 results
for internlm2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

RLHFlow/RLHF-Reward-Modeling #26

Regarding the Gemma2 Reward Model Structure

I tried to reproduce your gemma2B reward model training again and found that the reward model architecture fine tuned with internlm2 had an output header of 1. I downloaded your GRM-Gemma-2B-Sftrug re…

Loong435 updated 3 weeks ago
2
unslothai/unsloth #767

unsloth-internLM 2.5

can we please get official support for internLM-2.5? I have seen a closed issue regarding that #734. however, the model mentioned there might be broken as it fails to load for instance. It woul…

rezzie-rich updated 3 weeks ago
17
InternLM/lmdeploy #1849

[Bug] Segmentation fault: address not mapped to object at ad…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [x] 2. The bug has not been fixed in the latest version. ### Describe the bug Segmentation fault whe…

austingg updated 2 months ago
4
InternLM/xtuner #898

如何实现多张卡共同存放单个模型

背景：模型internlm2.5-7b-chat 单机4卡A10，单卡24GB 问题： 1. xtuner能实现模型切分么？即多个卡共用一个模型，而不是在每个卡上都单独加上一个模型再去微调训练，这样的话很容易会显存不足；能做到说模型切分到了某张卡运行，其余的卡加载数据训练吗？ 2. 按官方要求的多轮对话格式训练时，其中的单个message json就达到了几百个，且暂时不拆分数…

RyanOvO updated 9 hours ago
1
InternLM/lmdeploy #1735

[Bug] 量化模型时无输出

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug ``` shell D:\AI_model>lmdepl…

NB-Group updated 2 months ago
4
ollama/ollama #5670

The usage of VRAM has significantly increased

### What is the issue? In previous versions, I set the context length of each of my models to the maximum value that could be fully loaded onto the GPU memory. However, after the update, I found that…

lingyezhixing updated 1 month ago
5
InternLM/lmdeploy #1719

[Bug] Why does prefix caching change the generated content

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. ### Describe the bug Model： internlm2-chat-7b GPU…

DayDayupupupup updated 2 months ago
16
modelscope/ms-swift #1019

可以支持一下InternLM2-Math-Plus-Mixtral8x22B的微调吗

这个模型效果非常不错,数学上面接近gpt4o了

zhangfan-algo updated 2 months ago
1
InternLM/InternEvo #262

[Bug] 好像没有把internevo的MoE权重转换成huggingface版本的脚本？

### Describe the bug 我好像没有找到用internevo训练然后转换成对应的hf的脚本？请问有提供嘛？ ### Environment 官方代码 ### Other information _No response_

Cerberous updated 1 month ago
8
SmartFlowAI/EmoLLM #266

请问爹系男友心理咨询师模型的数据集是哪个？

如题，是开放出来了？对应是datasets文件夹下的哪个数据？同理：老母亲心理咨询师模型对应的训练数据是哪几个？艾薇御姐模型对应的训练数据是哪几个？感谢。

RyanOvO updated 1 month ago
5

上一页 1...5 6 7 8 9 10 11...67 下一页

666 results for internlm2

666 results
for internlm2