vaites / php-apache-tika

Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats
MIT License
116 stars 22 forks source link

Support for --text-main? #5

Closed NinoSkopac closed 7 years ago

NinoSkopac commented 7 years ago

Hey, I know there's --text-main in Tika-app, how about Tika-server?

vaites commented 7 years ago

This function is not documented on server mode. I looked at source code and seems unimplemented, so will ask in Tika community.

NinoSkopac commented 7 years ago

Hey man, I already did: https://issues.apache.org/jira/browse/TIKA-2343

vaites commented 7 years ago

Great, keep me informed then...

NinoSkopac commented 7 years ago

This is now possible. Tika 1.15 has been released today. Please see this: https://issues.apache.org/jira/browse/TIKA-2343

vaites commented 7 years ago

Thanks Nico, I expected to have more time until the new version. I'm working on this feature and secuencial read, will release both soon.

Meanwhile, can you provide an example of document or file that needs text-main?. I need a new sample for this feature, so if you have one will save me some time...

NinoSkopac commented 7 years ago

Meanwhile, can you provide an example of document or file that needs text-main?. I need a new sample for this feature, so if you have one will save me some time...

Absolutely: https://drive.google.com/open?id=0B7AKQvQJ6zw2M3ZWaWNOaUI1T00

NinoSkopac commented 7 years ago

Awesome

vaites commented 7 years ago

The 0.3.7 version is released with this feature. Thanks for your suggestion.

NinoSkopac commented 7 years ago

Thank you for the implementation!

NinoSkopac commented 7 years ago

I just implemented this on my end, works as should. Just wanted to let you know.