QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Apache License 2.0
13.59k stars 1.11k forks source link

[BUG] RuntimeError: 'weight' must be 2-D #928

Closed lyc202001 closed 8 months ago

lyc202001 commented 9 months ago

是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答? | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

Traceback (most recent call last): File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in 0%| | 0/1 [00:00<?, ?it/s] 0%| | 0/1 [00:00<?, ?it/s] Traceback (most recent call last): File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in Traceback (most recent call last): File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 360, in train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train train() File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/finetune.py", line 353, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train trainer.train() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1555, in train return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop return inner_training_loop( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 1837, in _inner_training_loop tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step tr_loss_step = self.training_step(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2682, in training_step loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss loss = self.compute_loss(model, inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/trainer.py", line 2707, in compute_loss outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = model(inputs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(args, kwargs) return forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn ret_val = func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward ret_val = func(*args, *kwargs)
ret_val = func(
args,
kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1769, in forward loss = self.module(*inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl loss = self.module(*inputs, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/peft_model.py", line 1073, in forward return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl return self.base_model( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward result = forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/peft/tuners/tuners_utils.py", line 103, in forward return self.model.forward(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer(
return self.model.forward(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return self.model.forward(*args, **kwargs) transformer_outputs = self.transformer( File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in result = forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in result = forward_call(*args, **kwargs)
self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward

result = forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl rotary_pos_emb_list = [

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 851, in forward self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_listrotary_pos_emb_list = [

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in rotary_pos_emb_list = [ File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 852, in self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl self.rotary_emb(kv_seq_len, ntk_alpha=ntk_alpha) for ntk_alpha in ntk_alpha_list File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl result = forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward result = forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward result = forward_call(*args, *kwargs)result = forward_call(args, **kwargs)

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache result = forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha)
result = forward_call(
args, **kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache

self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d") File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d")
result = forward_call(*args, **kwargs)
File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange emb = rearrange(emb, "n d -> 1 n 1 d") File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1313, in forward

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange emb = rearrange(emb, "n d -> 1 n 1 d")emb = rearrange(emb, "n d -> 1 n 1 d")

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d") File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange emb = rearrange(emb, "n d -> 1 n 1 d")
self.update_rotary_pos_emb_cache(max_seq_len, ntk_alpha) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1307, in update_rotary_pos_emb_cache emb = rearrange(emb, "n d -> 1 n 1 d") File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 591, in rearrange return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", axes_lengths) return reduce(tensor, pattern, reduction="rearrange", **axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce return reduce(tensor, pattern, reduction="rearrange", **axes_lengths) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/einops.py", line 518, in reduce backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = get_backend(tensor) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 53, in get_backend backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init backend = BackendSubclass() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_backends.py", line 221, in init from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in from . import _torch_specific # noqa from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in

  File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in <module>

from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in from . import _torch_specific # noqa File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 127, in allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph allow_ops_in_compiled_graph() File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/einops/_torch_specific.py", line 106, in allow_ops_in_compiled_graph from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from torch._dynamo import allow_in_graph File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/init.py", line 1, in from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from . import allowed_functions, convert_frame, eval_frame, resume_execution
from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from . import allowed_functions, convert_frame, eval_frame, resume_execution File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/allowed_functions.py", line 18, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in from .utils import HAS_NUMPY, is_safe_constant, np File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 4, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import cProfile File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/cProfile.py", line 11, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in import profile as _pyprofile File "/home/bmm-system/data/lyc/Qwenbase/Qwen-main/profile.py", line 66, in pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context pred = model.generate(inputs, generation_config=config) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1261, in generate return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return super().generate( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return func(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 1642, in generate return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample return self.sample( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/transformers/generation/utils.py", line 2724, in sample outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl outputs = self( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward return forward_call(args, kwargs)
transformer_outputs = self.transformer( File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, *kwargs)return forward_call(args, **kwargs)

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer(
return forward_call(*args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl transformer_outputs = self.transformer(
transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 1045, in forward transformer_outputs = self.transformer( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl inputs_embeds = self.wte(input_ids)
return forward_call(
args,
kwargs)inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl

File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward return forward_call(*args, *kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(args, kwargs) File "/root/.cache/huggingface/modules/transformers_modules/Qwen-72B-Chat/modeling_qwen.py", line 824, in forward inputs_embeds = self.wte(input_ids) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(args, kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return forward_call(*args, *kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return forward_call(args, **kwargs) File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/modules/sparse.py", line 162, in forward return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeErrorreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse): 'weight' must be 2-Dreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)

RuntimeError: RuntimeError'weight' must be 2-D: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D

期望行为 | Expected Behavior

No response

复现方法 | Steps To Reproduce

No response

运行环境 | Environment

- OS:
- Python:3.10
- Transformers:
- PyTorch:
- CUDA (`python -c 'import torch; print(torch.version.cuda)'`):

备注 | Anything else?

File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return F.embedding( File "/home/bmm-system/data/lyc/envs/130b2/lib/python3.10/site-packages/torch/nn/functional.py", line 2210, in embedding return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeErrorreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse): 'weight' must be 2-Dreturn torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)

RuntimeError: RuntimeError'weight' must be 2-D: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse) RuntimeError: 'weight' must be 2-D

jklj077 commented 8 months ago

Hi, please delete the profile.py file from the root (or rename it) and see what will happen.