a question about the backbone models

thu-coai / Emotional-Support-Conversation

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

Other

219 stars 32 forks source link

a question about the backbone models #16

Closed BaiMeiyingxue closed 1 year ago

BaiMeiyingxue commented 1 year ago

Hi,

Great work! I was wondering whether it is possible to change the English task into a Chinese task. Do you think it is possible to make the strategy come true through the Chinese data and the Chinese model?

Thanks for your reply!

chujiezheng commented 1 year ago

I think the strategies still work but you may need to manually annotate Chinese dialogue utterances using our strategy schema.

BaiMeiyingxue commented 1 year ago

got it, thank you!

chujiezheng commented 1 year ago

BTW, you can refer to our other paper PsyQA. It proposes a Chinese long-text counseling dataset and adopts a similar strategy schema. Hope this helps.

BaiMeiyingxue commented 1 year ago

BTW, you can refer to our other paper PsyQA. It proposes a Chinese long-text counseling dataset and adopts a similar strategy schema. Hope this helps. sorry, i just got the message, i'm glad to find the paper!

BaiMeiyingxue commented 1 year ago

may i ask for you some detail about the nlp technology? if i wanna realize the ESC language model based on Chinese, in addition to the training data, what other parts of the code do I need to change? could i just use the original pretrained model and parameters? Looking forward to your reply!

chujiezheng commented 1 year ago

You should instead use a Chinese pre-trained dialogue model. Unfortunately, I did not see a direct alternative for BlenderBot (e.g., a huggingface checkpoint) and you should build your own pipeline for data processing and model training.

BaiMeiyingxue commented 1 year ago

i got it~ about the paper PsyQA, could i get the source code?

chujiezheng commented 1 year ago

Please follow its repo's README: https://github.com/thu-coai/PsyQA

BaiMeiyingxue commented 1 year ago

ok, thank you for your answer!

BaiMeiyingxue commented 1 year ago

You should instead use a Chinese pre-trained dialogue model. Unfortunately, I did not see a direct alternative for BlenderBot (e.g., a huggingface checkpoint) and you should build your own pipeline for data processing and model training.

BTW, if we wanna pick one pretrained model, what condition should we think about since we find the Chinese pretrained modelhttps://github.com/thu-coai/CDial-GPT. Whether it can be used?

chujiezheng commented 1 year ago

CDial-GPT is OK but may not be satisfactory. You can consider our released EVA 2.0. It is one of the most powerful Chinese dialogue models open-sourced so far (while there are only a few open-sourced Chinese dialogue models :) ).

BaiMeiyingxue commented 1 year ago

CDial-GPT is OK but may not be satisfactory. You can consider our released EVA 2.0. It is one of the most powerful Chinese dialogue models open-sourced so far (while there are only a few open-sourced Chinese dialogue models :) ).

thanks for your effort~