Okay thanks for your response. Actually my intention is to extract some data from pdf files to display as title and description in the search page just like we show for normal html pages where I am getting empty field when I try to index "title" in crawl pattern.
Search result should be like below:
[PDF] Automotive Tote Labeling
... Printers & Media Application Brief Automotive Manufacturing
Labeling Industry Need Public Safety and 24/7 production ...
Please let me know any alternate solution to index and fetch any particular data in pdf files which we are able to do in our current search application. As of now I am just able to index only URL and body fields for pdf's in ES where almost body content is in binary format.
Hi,
Can anyone help me on indexing text between particular
This is to index some content in pdf files as per my requirement.
Thanks In Advance, Srinivas
Is div tag in PDF file??
Yes, this div tag is in pdf file. I need to index all such kind of pdf data for my requirement.
Hmm, extracting contents with CSS query supports HTML only. So, it's difficult to do that..
Okay thanks for your response. Actually my intention is to extract some data from pdf files to display as title and description in the search page just like we show for normal html pages where I am getting empty field when I try to index "title" in crawl pattern.
Search result should be like below:
[PDF] Automotive Tote Labeling ... Printers & Media Application Brief Automotive Manufacturing Labeling Industry Need Public Safety and 24/7 production ...
Please let me know any alternate solution to index and fetch any particular data in pdf files which we are able to do in our current search application. As of now I am just able to index only URL and body fields for pdf's in ES where almost body content is in binary format.
Thanks, Srinivas
An attachment type might work... Please see Use attachment type.