7matic / SPZZ

2 stars 0 forks source link

Implementing Custom NER Model for CV Parsing with Dockerization #12

Closed 7matic closed 11 months ago

7matic commented 11 months ago

Key Features:

Known Issues:

Fixes #6

7matic commented 11 months ago

Things to improve:

* Skills part of response is sometimes too long. It should include only keywords and not whole sentences.

* Example4 and 5 have mostly the same structure, yet it misses most of data from example 5.

* extract_names function may not work for people that have multiple names ex. van, von ...

Things to fix:

* Date (clock) in response headers is wrong.

* validate_file returns HTTP 400 - File does not exist, if the file is NOT a pdf. According
  to the docs. it should return 400 - File is not a PDF.

Notes:

* Maybe force the user to use a specific CV structure, like example 4, as it seems to work the best, if the model cannot be improved.

Thanks a lot for taking the time to review pull request and providing your feedback.