internlm2 Search Results

517 results
for internlm2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenGVLab/InternVL #160

Streaming output support?

Wondering if streaming output is supported? Or are there any results about the time to first token and time per output token? Thanks.

xiangqi1997 updated 2 weeks ago
2
OpenGVLab/InternVL #287

finetune时候运行torchrun报这个错

gangxu822 updated 2 days ago
4
InternLM/xtuner #507

复现官方教程出现 grad_norm:nan

根据官方教程进行复现进行微调时出现grad_norm:nan 参数配置如下： # Model pretrained_model_name_or_path = 'internlm/internlm2-chat-7b' use_varlen_attn = False # Data data_path = 'data' prompt_template = PROMPT_TEMPLAT…

kaisersama112 updated 2 months ago
6
InternLM/InternLM #719

万人血书 InternLM2-4B ❗❕❗❕❗

### 描述该功能血书 4B 模型 - Reason： 1. InternLM2-1.8B-Chat 在能力上有待提高，经过量化处理后效果不够好，无法做到中文翻译成日语的工作。 2. Qwen-4B-Chat-Int4 在赫萝的翻译任务上把中文翻译成日语效果较好。尽管 InternLM2-Chat-7B 也可以完成，但是显存消耗过高。 3. InternLM2 缺少一个中量级模型，显存较…

SaaRaaS-1300 updated 1 month ago
1
InternLM/lmdeploy #1815

[Bug] lmdeploy部署intermlm2-chat-20b，遇到<|im_end|>不会停止

### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [x] 2. The bug has not been fixed in the latest version. ### Describe the bug lmdeploy serve api_server --s…

jeinlee1991 updated 3 weeks ago
5
InternLM/InternLM-Math #17

Suggestion for Official Releases of LLMs: Include Quantized …

对于官方发布的 LLMS 大模型，建议在未来可以附上 awq 和 gptq 的量化版本。这种做法几乎没有成本，但却能帮助许多缺乏 GPU 的潜在用户。这会让用户在使用模型时更加方便，因为大家普遍认为官方发布的量化版本更具权威性。 For officially released LLMs, it is suggested that awq and gptq quantized versions b…

epochaudio updated 1 month ago
1
xorbitsai/inference #1474

多卡启动自定义模型时，报Remote server unixsocket错误

两张卡，第一张（12G/32G）,第二张（1G/32G）。模型是internlm2-chat-7b。 - 只使用第二张卡加载，显存占用大概30G，可以正常启动； - 因为占用快满了，想利用下第一张卡，在注册页面设置了gpu-index为0，1。启动时就报错Remote server unixsocket

mrkingsun updated 4 days ago
2
OpenGVLab/InternVL #321

xformers attention implementation

I'm working on an attention backend based on `xformers` to improve performance on V100; is there anything I need to be aware of when doing so or should it be straightforward?

bayley updated 1 week ago
1
InternLM/lmdeploy #1224

[Feature] change InternLM2 modeling to unified type

### Motivation when do the w8a8 quantization in pytorch engine, I found that InternLM2 modeling like. It use self.attention, self.feed_forward... ```python class InternLM2DecoderLayer(nn.Module)…

yinfan98 updated 3 months ago
1
SmartFlowAI/Llama3-Tutorial #12

How to deploy and fine-tune llama3 on a multi-graphics machi…

Does llama3 support inference and fine-tuning on multi-graphics card machines? Could you please add some sample code for a single machine with multiple cards?

AllYoung updated 2 months ago
1

上一页 1...1 2 3 4 5 6 7...52 下一页

517 results for internlm2

517 results
for internlm2