Here, I would like to develop a custom resume parser model that can accurately predict the sections for EDUCATION, SKILLS, and EXPERIENCE based on the resume. I have fine-tuned the LayoutLMv3 model on a custom dataset that is similar to the FUNSD dataset.
Although the LayoutLM model can predict education keywords, it only does so at the word level. For instance, if the resume states "My education is in computer engineering from LD College Ahmedabad," the model will label "computer" and "engineering" as EDUCATION. However, I aim to have all classified words in a single section rather than in individual word sections.
Therefore, here are some random screenshots of the LayoutLM model output.
And here, I would like the output to include box coordinates for the EDUCATION section as well as the SKILLS section, identified by their respective keywords.
Note: I have attempted to use the Layout Parser model with the PublayNet dataset. However, this model was unable to accurately predict and classify the sections for EDUCATION, SKILLS,EXPERIENCE, etc.
If there are any other models that would be suitable for my use case, please kindly suggest them.
Thank you all for your help.
Model I am using (LayoutLM ...):
Here, I would like to develop a custom resume parser model that can accurately predict the sections for EDUCATION, SKILLS, and EXPERIENCE based on the resume. I have fine-tuned the LayoutLMv3 model on a custom dataset that is similar to the FUNSD dataset.
Although the LayoutLM model can predict education keywords, it only does so at the word level. For instance, if the resume states "My education is in computer engineering from LD College Ahmedabad," the model will label "computer" and "engineering" as EDUCATION. However, I aim to have all classified words in a single section rather than in individual word sections.
Therefore, here are some random screenshots of the LayoutLM model output.![Screenshot from 2023-03-13 18-36-47](https://user-images.githubusercontent.com/105478351/225011201-5e94ac74-3628-4d5b-9281-7e5063eb9807.png)
And here, I would like the output to include box coordinates for the EDUCATION section as well as the SKILLS section, identified by their respective keywords.![Screenshot from 2023-03-13 18-32-05](https://user-images.githubusercontent.com/105478351/225011228-5610487c-e15f-4894-b03d-9de0e8846dce.png)
Note: I have attempted to use the Layout Parser model with the PublayNet dataset. However, this model was unable to accurately predict and classify the sections for EDUCATION, SKILLS, EXPERIENCE, etc.
If there are any other models that would be suitable for my use case, please kindly suggest them. Thank you all for your help.