4teamwork / ftw.tika

This product integrates Apache Tika for full text indexing with Plone.
4 stars 1 forks source link

PDF indexed but not office documents #17

Closed wunderlins closed 10 years ago

wunderlins commented 10 years ago

Hi

environment:

I have now a seemingly woking setup. When uploading PDF files they can be searched nearly instantly. This, however, doesn't work for docx nor xlsx.

I have run the installed tika version 1.5 from the commandline with the argument --text on a docx file and it produces text output to stdout.

However, when I add the same file as "File" to my plone site the quicksearch will not find it.

wunderlins commented 10 years ago

Bugreport itself is a bug, can be closed.

jone commented 10 years ago

Have you installed ftw.tika in the addons control panel of the Plone settings? This registers the tika transforms and needs to be done for converting docx.

wunderlins commented 10 years ago

This was actually my stupid error. Intereating however is, that pdfs got indexed.

Sent from my brain. On 7 Aug 2014 13:39, "Jonas Baumann" notifications@github.com wrote:

Have you installed ftw.tika in the addons control panel of the Plone settings? This registers the tika transforms and needs to be done for converting docx.

— Reply to this email directly or view it on GitHub https://github.com/4teamwork/ftw.tika/issues/17#issuecomment-51460706.

jone commented 10 years ago

ok then :wink: