Train BERT Embedding - Githubissues

tigerchen52 / LOVE

ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost

MIT License

39 stars 7 forks source link

Train BERT Embedding #9

Closed MingyuKim-2933 closed 12 months ago

MingyuKim-2933 commented 12 months ago

Hello! I am currently trying to reproduce the LOVE model,

The sentence "For ease of implementation, we learn only from the words that are not separated into pieces." in your paper.

As I understand, In the vocab.txt file you provided, you did not use special tokens (e.g. "[PAD]", "[UNK]") and separated words (e.g. ##ir, ##di).

Is my understanding correct?

Thank you!

tigerchen52 commented 12 months ago

Hi,

Yes, your understanding is correct. We only use embeddings of single and complete words. Note that learning these embeddings can improve robustness for sentence-level tasks and we can only have marginal improvements for token-level tasks. Maybe you should try distilled embeddings for the latter.

MingyuKim-2933 commented 12 months ago

Thank you for your answer!