forward-modeling Search Results

1000+ results
for forward-modeling

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenBMB/MiniCPM-V #643

[BUG] <title> Inference error. Replacing the LLM part with L…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

CCRss updated 1 week ago
7
thunlp/InfLLM #55

Error when reproducing mistral results

When I want to reproduce the original model results of mistral-7b-v0.2 without `flash-attn` I got the error: ``` Traceback (most recent call last): File "/home/yuanye/long_llm/InfLLM/benchmark/pr…

yuanyehome updated 2 weeks ago
2
unslothai/unsloth #1314

failed finetune qwen32b_awq_int4 using lora with llama-facto…

I want to lora finetune Qwen2.5-32B-Instruct-AWQ model(4bit quant already) through llama-factory, but occured an error. ` [INFO|configuration_utils.py:677] 2024-11-21 19:44:25,957 >> loading confi…

Daya-Jin updated 5 days ago
5
kvcache-ai/ktransformers #96

Error Compile with `TORCH_USE_CUDA_DSA` to enable device-sid…

I'm trying to run a DeepSeek-V2.5 model. Command used: ```python -m ktransformers.local_chat --model_path ./DeepSeek-V2.5/ --gguf_path ../ ``` ``` Chat: hi Traceback (most recent call last): Fi…

drrros updated 1 month ago
5
huggingface/optimum #1810

[Tracking] TorchDynamo ONNX exporter issues

# Export Error Summary Dashboard ## - This report is generated from branch of https://github.com/huggingface/optimum/pull/1712 - Produced by `RUN_SLOW=1 pytest tests/exporters/onnx -k "test_export…

BowenBao updated 1 month ago
1
huggingface/transformers #34706

Gemma2: RuntimeError: Expected all tensors to be on the same…

### System Info - `transformers` version: 4.47.0.dev0 - Platform: Linux-5.15.0-1052-oracle-x86_64-with-glibc2.35 - Python version: 3.10.12 - Huggingface_hub version: 0.25.2 - Safetensors version:…

Terrencezzj updated 1 week ago
7
ContextualAI/gritlm #60

Train llama 3.1 with GRIT

I'm now trying to train llama3.1 with GRIT pipeline. At first I directly change ``--model_name_or_path`` and run the training code (the training script I used is as follows) ``` #!/bin/bash #SB…

ThisisXXZ updated 2 weeks ago
5
LLaVA-VL/LLaVA-NeXT #321

self.image_newline inside the forward pass question

I'm currently working with the code and I'm having some trouble OVERRIDE forward where the self.module.image_newline attribute is set or initialized in the model. I've traced the model through the …

YerongLi updated 1 month ago
1
AIDC-AI/Ovis #29

running on 4bit model

hey, while running on 4bit quantized model from https://huggingface.co/ThetaCursed/Ovis1.6-Gemma2-9B-bnb-4bit i am getting the following error ``` { "name": "RuntimeError", "message": "self an…

haiderasad updated 3 weeks ago
2
invoke-ai/InvokeAI #6962

[bug]: AttributeError: 'NoneType' object has no attribute 'c…

### Is there an existing issue for this problem? - [X] I have searched the existing issues ### Operating system Linux ### GPU vendor AMD (ROCm) ### GPU model AMD Radeon RX 7800 XT…

Developer-42 updated 2 weeks ago
4

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for forward-modeling

1000+ results
for forward-modeling