是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
[X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
该问题是否在FAQ中有解答? | Is there an existing answer for this in FAQ?
[X] 我已经搜索过FAQ | I have searched FAQ
当前行为 | Current Behavior
Traceback (most recent call last):
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in
0%| | 0/1 [00:00<?, ?it/s]
0%| | 0/1 [00:00<?, ?it/s]
Traceback (most recent call last):
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in
Traceback (most recent call last):
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(args, kwargs)
return forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
ret_val = func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
ret_val = func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
ret_val = func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
ret_val = func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
ret_val = func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
ret_val = func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
ret_val = func(*args, *kwargs)
ret_val = func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward
loss = self.module(*inputs, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(*inputs, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(inputs, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(*inputs, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(*inputs, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(inputs, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(*inputs, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
loss = self.module(*inputs, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
result = forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
result = forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
return self.base_model(
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
return self.base_model(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
result = forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
result = forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward
return self.model.forward(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return self.model.forward(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return self.model.forward(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return self.model.forward(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return self.model.forward(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return self.model.forward(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
transformer_outputs = self.transformer(
return self.model.forward(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return self.model.forward(*args, **kwargs)
transformer_outputs = self.transformer( File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
result = forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
result = forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
result = forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
result = forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
result = forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
result = forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
result = forward_call(*args, **kwargs)
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
result = forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_listrotary_pos_emb_list = [
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
result = forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
result = forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
result = forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
result = forward_call(*args, *kwargs)result = forward_call(args, **kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
result = forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
result = forward_call(args, **kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
emb = rearrange(emb, "n d -> 1 n 1 d")
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
emb = rearrange(emb, "n d -> 1 n 1 d")
result = forward_call(*args, **kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
emb = rearrange(emb, "n d -> 1 n 1 d") File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
emb = rearrange(emb, "n d -> 1 n 1 d")emb = rearrange(emb, "n d -> 1 n 1 d")
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
emb = rearrange(emb, "n d -> 1 n 1 d")
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
emb = rearrange(emb, "n d -> 1 n 1 d")
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
emb = rearrange(emb, "n d -> 1 n 1 d")
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
return reduce(tensor, pattern, reduction="rearrange", axes_lengths)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
return reduce(tensor, pattern, reduction="rearrange", axes_lengths)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
return reduce(tensor, pattern, reduction="rearrange", axes_lengths)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
return reduce(tensor, pattern, reduction="rearrange", axes_lengths)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
return reduce(tensor, pattern, reduction="rearrange", axes_lengths)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
return reduce(tensor, pattern, reduction="rearrange", axes_lengths)
return reduce(tensor, pattern, reduction="rearrange", **axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
return reduce(tensor, pattern, reduction="rearrange", **axes_lengths)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = get_backend(tensor)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend
backend = BackendSubclass()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
backend = BackendSubclass()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
backend = BackendSubclass()
backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
backend = BackendSubclass()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
backend = BackendSubclass()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
backend = BackendSubclass()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
backend = BackendSubclass()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in <module>
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return forward_call(args, kwargs)
transformer_outputs = self.transformer( File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, *kwargs)return forward_call(args, **kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
transformer_outputs = self.transformer(
return forward_call(*args, **kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
transformer_outputs = self.transformer(
transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
transformer_outputs = self.transformer(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
inputs_embeds = self.wte(input_ids)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
return forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
return forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
inputs_embeds = self.wte(input_ids)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
inputs_embeds = self.wte(input_ids)
return forward_call(args, kwargs)inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
return forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
inputs_embeds = self.wte(input_ids)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
inputs_embeds = self.wte(input_ids)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
inputs_embeds = self.wte(input_ids)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward
inputs_embeds = self.wte(input_ids)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return forward_call(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return forward_call(args, **kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeErrorreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse):
'weight' must be 2-Dreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: RuntimeError'weight' must be 2-D:
'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return F.embedding(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeErrorreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse):
'weight' must be 2-Dreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: RuntimeError'weight' must be 2-D:
'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
该问题是否在FAQ中有解答? | Is there an existing answer for this in FAQ?
当前行为 | Current Behavior
Traceback (most recent call last): File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in
0%| | 0/1 [00:00<?, ?it/s]
0%| | 0/1 [00:00<?, ?it/s]
Traceback (most recent call last):
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in
Traceback (most recent call last):
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
train()
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
trainer.train()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
return inner_training_loop(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
tr_loss_step = self.training_step(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
loss = self.compute_loss(model, inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = model(inputs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
return forward_call(args, kwargs)
return forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn ret_val = func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, *kwargs)
ret_val = func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward loss = self.module(*inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward return self.model.forward(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer(
return self.model.forward(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, **kwargs) transformer_outputs = self.transformer( File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
result = forward_call( args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
result = forward_call(*args, **kwargs)
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
result = forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_listrotary_pos_emb_list = [
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
rotary_pos_emb_list = [
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
result = forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
result = forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
result = forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
result = forward_call(*args, *kwargs)result = forward_call(args, **kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
result = forward_call(args, **kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d") File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d")
result = forward_call(*args, **kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange emb = rearrange(emb, "n d -> 1 n 1 d") File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange emb = rearrange(emb, "n d -> 1 n 1 d")emb = rearrange(emb, "n d -> 1 n 1 d")
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d") File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange emb = rearrange(emb, "n d -> 1 n 1 d")
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d") File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) return reduce(tensor, pattern, reduction="rearrange", **axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", **axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
from . import _torch_specific # noqa
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
allow_ops_in_compiled_graph()
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from torch._dynamo import allow_in_graph
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from . import allowed_functions, convert_frame, eval_frame, resume_execution
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
from .utils import HAS_NUMPY, is_safe_constant, np
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import cProfile
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
import profile as _pyprofile
File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
pred = model.generate(inputs, generation_config=config)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return super().generate(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return func(*args, *kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
return self.sample(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
outputs = self(
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return forward_call(*args, kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return forward_call(*args, *kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
return forward_call(args, kwargs)
transformer_outputs = self.transformer( File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, *kwargs)return forward_call(args, **kwargs)
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer(
return forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl transformer_outputs = self.transformer(
transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl inputs_embeds = self.wte(input_ids)
return forward_call(args, kwargs)inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return forward_call(args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeErrorreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse): 'weight' must be 2-Dreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: RuntimeError'weight' must be 2-D: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D
期望行为 | Expected Behavior
No response
复现方法 | Steps To Reproduce
No response
运行环境 | Environment
备注 | Anything else?
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeErrorreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse): 'weight' must be 2-Dreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: RuntimeError'weight' must be 2-D: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D