-
I am trying to use the HF model "mlfoundations/dclm-7b-it" for inference, simply using the code below:
```
model = AutoModelForCausalLM.from_pretrained("mlfoundations/dclm-7b-it")
gen_kwargs = {"…
-
raise NotImplementedError(msg)
NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs:
query : shape=(2, 2688, 1, 512) (torch.float16)
key …
-
ApplyPulid
No operator found for `memory_efficient_attention_forward` with inputs: query : shape=(1, 577, 16, 64) (torch.float16) key : shape=(1, 577, 16, 64) (torch.float16) value : shape=(1, 577, 1…
-
你好作者!
非常有意义的工作,请问 我在将蒸馏模型改进为旋转框时,在第一轮训练是正常的但是第二轮损失等均为nan请问我应该如何改进
这是log:
2024/11/09 14:25:17 - mmengine - INFO -
------------------------------------------------------------
System environment:
…
-
使用DAT_light_x2.pth 和 DAT_2_x2.pth模型
Traceback (most recent call last):
File "basicsr/test.py", line 44, in
test_pipeline(root_path)
File "basicsr/test.py", line 34, in test_pipeline
…
-
Traceback (most recent call last):
File "/home/notebook/code/group/benny/modflows/./train_encoder_v2.py", line 124, in
trained_model.load_state_dict(trained_param)
File "/opt/conda/lib/pyt…
-
Hi @X-Lai, @tianzhuotao, @yukang2017, @yanwei-li , @xbkaishui
I am evaluating your method with your provided model (xinlai/LISA-13B-llama2-v1) as a part of my research studies. I see the significant …
-
Tasks:
Replace the current representation with a sparse matrix format using scipy.sparse.csr_matrix:
Rows: Quadrants (e.g., 4 rows for a 4-quadrant image).
Columns: One-hot encoding for each co…
-
Hello, I had the following problem when trying to run demo:
RuntimeError: Error(s) in loading state_dict for ModuleList:
size mismatch for 0.proj.weight: copying a param with shape torch.S…
-
size mismatch for cls.predictions.decoder.bias: copying a param with shape torch.Size([31090]) from checkpoint, the shape in current model is torch.Size([31092]).
Hello, when I run the training com…