Open rajeshkumargp opened 6 years ago
Hello @rajeshkumargp !
Thanks for reporting the problem, could you add the PDF (or send it to me by email if it is not public) so that we can reproduce the error?
Please refer the PDF in the below link. http://www.jpma.org.pk/PdfDownload/8618.pdf
Reading order issue, the title comes at the end of the page in the PDF stream and for some obscure reasons it vanishes in the limbos.
Hi, I tried to convert PDF to Full Text Document.
The title of the document is missing in the extracted xml document. Here are my trails.
Trail 1:
In Webapp , the title statement is missing.
In Webapp, with Consolidate header option enabled,
Trail 2: From CURL Command,
curl -v --form input=@./ASample.pdf http://172.16.28.52:8900/api/processFulltextDocument
curl -v --form input=@./ASample.pdf consolidateHeader=1 http://localhost:8900/api/processFulltextDocument
curl -v --form input=@./ASample.pdf --form consolidateHeader=1 http://localhost:8900/api/processFulltextDocument
For all three, I got
Title is missing.
Please guide/suggest me steps to improve/get title fileld in results. Batch mode is also fine.