Alfresco / SearchServices

Alfresco Search Services
GNU Lesser General Public License v3.0
31 stars 36 forks source link

mechanism to temporarily prevent text retrieval #396

Open hi-ko opened 2 years ago

hi-ko commented 2 years ago

the content tracker still needs to run synchronously. We need a mechanism to temporarily prevent text retrieval to avoid scalability issues and timeouts (caused by async transactions) - especially if we already know that they are long running like for ocr.

In the old, sync transformer framework it was possible to fake such a feature by setting cm:isContentIndexed=false to prevent the node to be catched up from the repository before it has been transformed and to remove that aspect later when the text transformation is available.

395 / SEARCH-2974 breakes this old "feature". So either we get a new feature to postpone the text retrieval or the mechanism for isContentIndexed is working again as expected e.g.