Open Rickylht opened 3 months ago
Replace sep="" as sep="<s>" Otherwise, the model will wrongly process the seperation token. It will regard the blank space as stop token and only output one token each time during inferece. After change, the model outputs correctly.
sep=""
sep="<s>"
Replace
sep=""
assep="<s>"
Otherwise, the model will wrongly process the seperation token. It will regard the blank space as stop token and only output one token each time during inferece. After change, the model outputs correctly.