Closed rahimentezari closed 11 months ago
This issue is caused by an anomaly with the rope position encoding when the beam size is greater than 1. We are currently addressing this problem. In the meantime, you can temporarily set the beam size to 1, and adjust the topk and topp values to make the output more randomized.
This problem has been solved. You can update to latest SAT via:
git clone https://github.com/THUDM/SwissArmyTransformer
cd SwissArmyTransformer
pip install . --no-deps
I am trying to generate multiple caption using chat function and using BeamSearchStrategy instead of BaseStrategy. Is this the right way?
Because I am getting this error