4teamwork / ftw.tika

This product integrates Apache Tika for full text indexing with Plone.
4 stars 1 forks source link

Use transform caching if possible #4

Closed lukasgraf closed 11 years ago

lukasgraf commented 11 years ago

Currently, when the portal transform is used by an indexer, it gets called multiple times and therefore calls Tika to convert the exact same document multiple times.

Evaluate the caching options offered by portal_transforms, and if possible, cache the transformation based on the input document, input MIME type and output MIME type.

lukasgraf commented 11 years ago

Caching in portal_transforms apparently works by passing the object keyword argument to TransformTool.convertTo(). This object then will be used by PortalTransforms.cache.Cache.setCache() to store the cached data in the _v_transform_cache attribute.