Open Swarn10 opened 4 years ago
Hi @Swarn10, nice work, but parsing means converting the text into some structured format, like a json or python dictionary. So it would be great if you build up on that. Also instead of taking the file in a hardcoded fashion, take the input of the file from the command line with some flag, say -f. You might want to remove the ocr.py and pdf_to_image.py as they are redundant.
Hey ! I have added programs to obtain text from the resumes in a neat and clean manner so that resume parsing can be done smoothly. I have also added the text that was obtained from the sample pdf(resume.txt) for better understanding.