Open Xiaomin-HUANG opened 5 days ago
@Xiaomin-HUANG , I think this artifacts of dataset on which this models were trained, the best way to fix it - fine-tune your model. I would like you recommend this notebook. It contains Gradio interfaces to help you label your data. Considering your tasks the amount of required examples should be small.
Model version : "knowledgator/gliner-multitask-large-v0.5", "urchade/gliner_multi-v2.1",
Issue : I used those 2 models to detect ["name_surname", "email","organization", "phone_number"], but some returned entities didn't bring any useful information.
Examples :
PS : Those unwanted entities, which are similar to label names, have a high confident score ( like 0.95). So if there are any method to filter those undesired entities ? Thank you so much.