morioka / reading

0 stars 0 forks source link

CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering #28

Open morioka opened 3 years ago

morioka commented 3 years ago

https://twitter.com/arxiv_cscl/status/1325025521709707267

CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering Xiang Yue, Xinliang Frederick Zhang, Ziyu Yao, Simon Lin, Huan Sun The Ohio State University Clinical question answering (QA) aims to automatically answer questions from medical professionals based on clinical texts. Studies show that neural QA models trained on one corpus may not generalize well to new clinical texts from a different institute or a different patient group, where large-scale QA pairs are not readily available for retraining. To address this challenge, we propose a simple yet effective framework, CliniQG4QA, which leverages question generation (QG) to synthesize QA pairs on new clinical contexts and boosts QA models without requiring manual annotations. In order to generate diverse types of questions that are essential for training QA models, we further introduce a seq2seq-based question phrase prediction (QPP) module that can be used together with most existing QG models to diversify their generation. Our comprehensive experiment results show that the QA corpus generated by our framework is helpful to improve QA models on the new contexts (up to 8% absolute gain in terms of Exact Match), and that the QPP module plays a crucial role in achieving the gain. ![image](https://user-images.githubusercontent.com/1615546/98459285-d8dfcf00-21dc-11eb-9a35-64662712f907.png)