a1da4 / paper-survey

Summary of machine learning papers
32 stars 0 forks source link

Reading: The Dependence on Frequency of Word Embedding Similarity Measures #242

Open a1da4 opened 1 year ago

a1da4 commented 1 year ago

0. Paper

1. What is it?

They analyze the relationship between word frequency and word similarity.

2. What is amazing compared to previous works?

Previous works tried to find relationships between frequency and:

This work has a different view.

3. Where is the key to technologies and techniques?

They use three embedding models:

Two experiments are conducted:

4. How did evaluate it?

4.1 Internal evaluation

スクリーンショット 2022-11-16 14 14 42

Figure 2 shows that the similarity is high in high-frequency bins.

スクリーンショット 2022-11-16 14 16 30

Figure 5 shows that word vector models encode frequency.

4.2 External evaluation

スクリーンショット 2022-11-16 14 23 29

Figure 6 shows that the measured bias is changed drastically by the frequency (especially the frequency of B (he)).

5. Is there a discussion?

6. Which paper should read next?