Open salekinsirajus opened 2 months ago
Did some brain storming:
1)Learn about structure if PDF file 2) Exporting the PDF onto the back end as a whole file. 3) Using Python libraries to parse the content. 4) Write down an algorithm to do the parsing. This will be modified to increase accuracy and parse content more universally for the resumes out there.
Problem Statement The goal of this task is to come up with an algorithm and implementation to analyze the entirety of the content. Note that the deliverable is not a perfectly annotated resume, as that is an almost impossible task. Rather we want to focus on annotating 80% or more of the content with reasonable accuracy.
Files Changed Implement this in the backend - similar to how issue #4 is done
Approaches There are multiple ways to attack this problem; please consider the pros and cons of all
Note This issue needs a substantial research/whiteboarding session before implementation.