OmkarPathak / pyresparser

A simple resume parser used for extracting information from resumes
GNU General Public License v3.0
774 stars 395 forks source link

no parsing done for tables in the resume pdf/doc #5

Open annapurnarelan20 opened 4 years ago

annapurnarelan20 commented 4 years ago

Hi, Have been trying to run the parser with the resumes containing data in tabular format like skills or experience in the resume is listed in a table , that information is skipped and is not parsed by the parser.

can you help in correcting the issue.

OmkarPathak commented 4 years ago

Textract and pdfminer find it hard to read tables. You can try something like: https://blog.chezo.uno/tabula-py-extract-table-from-pdf-into-python-dataframe-6c7acfa5f302

OmkarPathak commented 4 years ago

@annapurnarelan20 can you provide a sample resume so that I can use it for testing purposes

annapurnarelan20 commented 4 years ago

Hi, PFA a pdf resume with tabular format data. Sorry for late reply!

Thanks! Annapurna Relan

On Thu, Oct 3, 2019 at 7:40 PM Omkar Pathak notifications@github.com wrote:

@annapurnarelan20 https://github.com/annapurnarelan20 can you provide a sample resume so that I can use it for testing purposes

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/OmkarPathak/pyresparser/issues/5?email_source=notifications&email_token=ALH6MNLP77JSL2SGZKMUOMLQMX4ORA5CNFSM4IOFIQP2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAIKTFA#issuecomment-537962900, or mute the thread https://github.com/notifications/unsubscribe-auth/ALH6MNKD2B35GFRSP5IUAMTQMX4ORANCNFSM4IOFIQPQ .

annapurnarelan20 commented 4 years ago

Hey,

I just signed the petition "Sushant Singh Rajput: Boycott Karan Johar, YRF films, Salman Khan" and wanted to see if you could help by adding your name.

Our goal is to reach 3,000,000 signatures and we need more support. You can read more and sign the petition here:

http://chng.it/42Kn9G6mLt

Thanks! annapurna