Closed ainilian closed 1 year ago
Looking forward to your reply! @pacman100 @younesbelkada @sayakpaul
Looking forward to your reply! @pacman100 @younesbelkada @sayakpaul
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Was wondering if this has been addressed.
System Info
GPU: 2*V100(64GB) CPU: 16vCPUs 128GB
envs:
Who can help?
@pacman100 @younesbelkada @sayakpaul
Information
Tasks
examples
folderReproduction
error
add print details for error
File "/***/lib/python3.7/site-packages/transformers/models/gpt_bigcode/modeling_gpt_bigcode.py", line 247, in forward key_value = torch.cat((layer_past, key_value), dim=-2) RuntimeError: Tensors must have same number of dimensions: got 5 and 3
I add some print-codes arround the "key_value = torch.cat((layer_past, key_value), dim=-2)"
the print details"
codes
huggingface trainer arguments:
use deepspeed zero3-offload
Expected behavior
Can prefix tuning be used for multi-query model like bigcode/starcoder? if prefix tuning support bigcode/starcoder, how to set the prefix tuning config?