relation extraction on Chinese ?

plkmo / BERT-Relation-Extraction

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

Apache License 2.0

565 stars 132 forks source link

relation extraction on Chinese ? #15

Closed Jim0530 closed 4 years ago

Jim0530 commented 4 years ago

What is the purpose of adding config=>>>fine_tuning_task=semeval If I use it on Chinese what should I be aware of? thanks

plkmo commented 4 years ago

the config --task allows for other tasks to be included in the future, such as fewrel. For chinese, you would need chinese pre-trained BERT with appropriate tokenizers (such as jieba), You will also need entity recognizers for chinese which is not yet supported in spacy, in order to generated annotated data for pretraining/auto-inference, but otherwise the code would be largely unchanged.

wangguanhua commented 3 years ago

What is the purpose of adding config=>>>fine_tuning_task=semeval If I use it on Chinese what should I be aware of? thanks

What is the purpose of adding config=>>>fine_tuning_task=semeval If I use it on Chinese what should I be aware of? thanks

我觉得作者说的貌似不对，因为中文是按字切分的，所以不需要用jieba。我觉得我们应该需要更换预训练模型，然后把wordpiece换为中文的处理方式就好。