minicpm Search Results - Githubissues

890 results
for minicpm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagOpen/FlagEmbedding #629

Fine-tune bge-reranker-v2-minicpm-layerwise ，loss过高，查看日志，发现…

Some weights of the model checkpoint at ./BAAI/bge-reranker-v2-minicpm-layerwise were not used when initializing LayerWiseMiniCPMForCausalLM: ['lm_head.0.linear_head.weight', 'lm_head.1.linear_head.we…

zhenpengguo updated 5 months ago
5
OpenBMB/MiniCPM-V #213

Questions about finetuning

Hi guys, i have some questions about finetuning. 1. In `finetune_lora.sh`, q and k are selected for lora_target_modules. My understanding is that, considering the efficiency of LoRA, q and v should b…

whyiug updated 3 months ago
3
OpenBMB/MiniCPM #45

[Bad Case]: IOS 应用找不到CPM设置模板

### Description / 描述 https://github.com/OpenBMB/LLMFarm-MiniCPM 这个文档里选择完模型之后，第二步，IOS App LLM Farm 上找不到这个模板，导致模型无法使用 ![image](https://github.com/OpenBMB/MiniCPM/assets/41416092/64e98d0b-0ea5-4588-…

qingfengfenga updated 5 months ago
5
trapoom555/Language-Model-STS-CFT #10

Preprocess DataLoader with Tokenization

- Make training process faster - All training sample should be padded to the same `max_length`

trapoom555 updated 4 months ago
1
OpenBMB/MiniCPM-V #44

V2.0 可以直接使用 V1 的微调代码么

RT 看到 swift 里面有 V1 的微调代码，请问在 V2.0 上可以直接使用么，还是需要重新开发一下？

rover5056 updated 4 months ago
2
OpenBMB/VisCPM #40

在线的demo一直请求失败，api调用也显示http无法连接，麻烦看下是不支持在线使用了吗？

在线的demo一直请求失败，api调用也显示http无法连接，麻烦看下是不支持在线使用了吗？

yatoubusha updated 4 months ago
2
BAAI-DCAI/Bunny #8

Questions about the technical report

Hello, It's a great work! And there are several questions: 1. In the technical report you mentioned > We find that LoRA empirically leads to better performance than fully tuning across all c…

StarCycle updated 2 months ago
3
OpenBMB/MiniCPM #84

显存占用过高

我使用如下代码进行推理，显存占用在10G以上，这对于一个2B模型是否太多了？ ```python from transformers import AutoModelForCausalLM, AutoTokenizer import torch from peft import PeftModel import json torch.manual_seed(0) lora_pat…

Ishiki-Iroha updated 6 months ago
4
modelscope/ms-swift #701

lora_ft多模态chat时不支持png格式的灰度图输入，抛出维度错误，

**Describe the bug** RuntimeError: output with shape [1, 210, 446] doesn't match the broadcast shape [3, 210, 446] **Your hardware and system info** CUDA11.8 pytorch2.2 A100-40g **Addition…

hardlipay updated 4 months ago
3
OpenBMB/MiniCPM-V #73

自定义数据集微调时，数据集中如果有history，报错ValueError: Expected input batch_…

### 自定义数据集（dataset.jsonl）样式 {"query": "问题问题问题", "response": "答案答案", "history": [["历史问题1", "历史回答1"], ["历史问题2", "历史回答2"]], "images": ["图片路径"]} 说明： dataset.jsonl中每一行是一个json，有若干行，每张图片对应一行json 由于数据集暂…

Baklok updated 4 months ago
1

上一页 1...75 76 77 78 79 80 81...89 下一页

890 results for minicpm

890 results
for minicpm