Pdf text extract does not work properly

SenseNet / sensenet

Open Source Content Services Platform written in .NET

https://sensenet.com

GNU General Public License v2.0

173 stars 112 forks source link

Pdf text extract does not work properly #74

Closed borsi closed 7 years ago

borsi commented 7 years ago

There should be a text extract functionality in Services, but it looks like it does not extract much text from pdf's. @kavics knows more about the issue, please share any relevant info here.

kavics commented 7 years ago

The issue is triggered by the missing PDF IFilter on the benchmark webservers. After installing the appropriate component, the test extracting works properly. Existing an appropriate PDF IFilter on every web node is a prerequisite of the installing SenseNet.Services.