-
首先,关于生成训练数据部分有一些小问题想要请教:
1. schema文件的作用是什么?我看到微调时也不会传参到脚本中,个人理解是只有train和test数据才是对最后微调有意义的文件?但每一个任务data文件夹下给的参考都是多个文件,没太理解作用都是什么
2. data文件夹下给的例子中,看起来像是jsonl文件,数据最外部没有[ ],但下载下来是json文件,那传入文件到脚本时应该是什么…
-
when i run 'run_ner.py' show some error like this:
run_ner.py: error: the following arguments are required: --task_type, --task_save_name, --data_dir, --data_name, --model_n
ame, --model_name_or_pat…
-
Hi,
Thank you for your amazing work on this paper. I found it truly insightful. I wanted to inquire about the release of the Universal NER Benchmark Data mentioned in the paper and outlined in Appe…
-
Using the preliminary output gleaned by scraping abstracts with the NER, create R code that helps sort, visualize, and prioritize papers to follow up on.
-
**Use case**
```
┌─sum(cityHash64(article, byline, dates, newspaper_metadata, antitrust, civil_rights, crime, govt_regulation, labor_movement, politics, protests, ca_topic, ner_words, ner_label…
-
Hi. I want to finetune a model on data where some of them do not contain entities (so that there is less fp). I tried to do it with such examples in the dataset:
{'tokenized_text': ['In', 'this', 'ye…
-
各位大佬,我还是个新手,请教一下大家都是怎么准备自己的数据集的?
我现在都不知道怎么让程序跑起来,根据 `README.md` 的指示下载一些文件(如下图),但是不知道怎么存放、怎么重命名。
看了代码中好多用了绝对路径的地方,应该都是要改成自己的路径吧,具体是怎么改呢,对应的文件上哪找呢?
```txt
/data2/wangshuhe/gpt3_ner/gpt3-data…
-
`pii_model = bolt.NER.load("./models/pretrained_multilingual.model")`
failed with
```
---------------------------------------------------------------------------
ValueError …
-
iocextraction can be done via regexpes. Add those and label text accordingly
-
Hello is NER supported, and does it have entity categories listed like Spacy
papyr updated
3 months ago