izhx / NER-unlabeled-data-retrieval

[COLING 22] Domain-Specific NER via Retrieving Correlated Samples.
https://arxiv.org/abs/2208.12995
Apache License 2.0
23 stars 0 forks source link

Question about "Unlabeled Data Retrieval" #1

Open Aureole-1210 opened 1 year ago

Aureole-1210 commented 1 year ago

请问 “Unlabeled Data Retrieval” 有具体实现的代码或步骤吗

izhx commented 1 year ago

您好,我把elasticsearch相关代码已经上传到仓库中 es.py

主要步骤包括:

  1. 数据清洗,去重规范化等。
  2. elasticsearch 索引构建,对于 es.pyadd_raw_copus()
  3. 相关文本检索,对应 es.pyretrieval()