Open bmschmidt opened 8 years ago
I'm looking at your intriguing blog post of 23 Dec, 2016 and examining this plot, below, and it seems like you somehow solved this problem of tracing the movement of a word over time, while holding the context words in a constant location.
Would it be possible to know roughly how you accomplished this for that figure? Did you replace empire
with, for example, empire_1979-2010
in the text, and then generate a single embedding for the entire corpus including documents from all periods? Or did you generate embeddings for each period and somehow align them, etc.? Thank you.
The simplest approach I've found is the 'Temporal Referencing' method described in:
Haim Dubossarsky, Simon Hengchen, Nina Tahmasebi and Dominik Schlechtweg. 2019. Time-Out: Temporal Referencing for Robust Modeling of Lexical Semantic Change. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Florence, Italy. Association for Computational Linguistics.
That's giving me quite good results, so I'd recommend it.
This Stanford paper describes the most promising method I've seen so far for aligning multiple different models; it would be a useful addition here.