bojone / rerope

Rectified Rotary Position Embeddings
330 stars 27 forks source link

Generating same token #21

Open Madhu000 opened 3 months ago

Madhu000 commented 3 months ago

I was running the code using your Rerope implementation with vicuna-7b for Code completion task but each time it is producing the same sequence of tokens like nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody nobody'. I am using transformer 4.41 can you please inform me what could be the issue behind the generation. Dataset that I am using here is https://huggingface.co/datasets/microsoft/LCC_python. I was actually implementing this to reproduce the baseline results of Hirope..