some questions about code and paper

sunnweiwei / user-satisfaction-simulation

"Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21

33 stars 4 forks source link

Hi, thank you for this good work, we were trying to revive the model performance in the paper. there are some details that are confused. The paper mentioned that vocab size of 30566 were used in all model, as we know, Bert-base English uncased has a vocab size of 30566 and bert chinese has a vocab size of 21128. According to your dataset, JDDC is a Chinese dataset and others are English, so could you please clarify which pretrain models are used on these dataset separately? And can you share the evaluation accuracy on these dataset after the finetuning(satisfaction prediction task). thank you

sunnweiwei / user-satisfaction-simulation

some questions about code and paper #3