Reading: HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis

0. Paper

They try to answer these research questions:

Is it valid to use pre-trained BERT to model historical semantic change?
Can additional training on (balanced) historical data help to improve the precision of BERT in quantifying historical semantic change?

スクリーンショット 2022-06-11 10 27 16

Training starts from the last checkpoint of pre-trained BERT model

Data: Diachronic Usage Pair Similarity (DUPS) dataset
Task: Given word usage pairs from different time decades, participants predict whether target words changed their meaning
Evaluation: Speaman's rank correlation coefficient

HistBERT models outperform the original BERT model.