Open rsaim opened 4 years ago
Please let me know if it can be closed.
I see that few more files got successfully parsed in https://github.com/rsaim/supplementary/pull/21/files. However, the exception was updated for others.
We should add more parsing logic so that all pdfs for which the exception was raised are successfully parsed.
EARLIER
We were expecting: date to be strictly present in line: "THE RESULT OF THE CANDIDATES WHO APPEARED IN THE FOLLOWING EXAMINATIONS HELD IN DEC-2014 IS DECLARED AS UNDER : -" Now: logic goes as If the line contains month{any character}year, then parser it and put it in examination date.
For some pdf's, pdfPlumber is not able to exatract the text For now I have to put the None check there. I think that is not the correct way but we will be able to narrow down the issue to one point in this way.