Open AnnaKholkina opened 3 months ago
You cannot train the model without any entity types. The model needs entity types to compute de matching scores.
you can pre-define the list of labels under the key "label", if the list of named entities is empty:
{'tokenized_text': ['In', 'this', 'year', '.'], 'ner': [], 'label': ["person", "org"]}
Hi. I want to finetune a model on data where some of them do not contain entities (so that there is less fp). I tried to do it with such examples in the dataset: {'tokenized_text': ['In', 'this', 'year', '.'], 'ner': []}, And I have an error:
Or this format: {'tokenized_text': ['In', 'this', 'year', '.'], 'ner': [[]]}, And error:
Is there any way to fix this?