For resources of kind "Document", it would be useful to extract and store text from them. E.g. for PDF resources, text layer should be similar to what is emitted by Linux utility pdftotext. The text layer can be used later for filtering resources by specified text in content, or for various text analytics (e.g. counting words).
For resources of kind "Document", it would be useful to extract and store text from them. E.g. for PDF resources, text layer should be similar to what is emitted by Linux utility
pdftotext
. The text layer can be used later for filtering resources by specified text in content, or for various text analytics (e.g. counting words).