Closed angelo337 closed 7 years ago
The simplest is probably to use the SubstringTransformer as a post-parse handler.
<transformer class="com.norconex.importer.handler.transformer.impl.SubstringTransformer"
end="10000"/>
The above example will truncate after the 10,000 character (exclusive).
Does that work for you?
thanks a lot for your fast answer, I will try it and let you know
hi there I have a question regarding the importer, Is it possible to limit the content size of a File, I am having issues with a some large files in MS-Excel, and I would like to just index a couple first MB instead of the 45MB os the full file.
could you please point me out some resource or give me some clues how to deal with that large files? thanks a lot best regards angelo