ymcui / MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
https://www.aclweb.org/anthology/2020.findings-emnlp.58/
Apache License 2.0
639 stars 56 forks source link

如果计算得到的相似词与原词长度不同咋办呢?谢谢 #12

Closed TriLoo closed 2 years ago

TriLoo commented 2 years ago

感谢作者,思路简单有效,但是我还是有两个疑问啊:

  1. N-Gram 按照理解应该是以词组为单位进行mask吧?比如 4-Gram 就mask掉4个连续的 Whole word ?
  2. 对于 Mac 部分,输入一个词组,但是得到的近义词的长度与原来词组的长度不一致了怎么办呢?

谢谢。

stale[bot] commented 2 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] commented 2 years ago

Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.