Closed MrZilinXiao closed 2 years ago
Thank you for the pull request!
This is my fault, caused by a recent change in the library.
We intend that the commands work without the --language
option in monolingual settings so we have fixed the code accordingly https://github.com/studio-ousia/luke/pull/165.
Please pull the code from the latest master
branch and run the current command.
Thanks again for your report.
If not setting language explicitly, all
lang
fields of entities inentity_vocab.jsonl
will be left null, leading to https://github.com/studio-ousia/luke/blob/a40c580c5f1ad2f189dd02d195002921f6a4c994/luke/pretraining/dataset.py#L366 triggered exceptions since https://github.com/studio-ousia/luke/blob/a40c580c5f1ad2f189dd02d195002921f6a4c994/luke/pretraining/dataset.py#L317 getsentity_id
byDumpDB.lang
.