I've reindexed the searchable text of 2718 files (different sizes and types) in 53 minutes and 8 seconds, which gives an average of 1.1 second per file.
I'm not sure how fast the other implementations were, but it feels quite slow.
We should try to optimize this.
Maybe running tika in server mode (which could easily be set up as a supervisor program), although it only seems to support HTML..
The performance isn't that good at the moment.
I've reindexed the searchable text of 2718 files (different sizes and types) in 53 minutes and 8 seconds, which gives an average of
1.1 second per file
. I'm not sure how fast the other implementations were, but it feels quite slow.We should try to optimize this. Maybe running tika in server mode (which could easily be set up as a supervisor program), although it only seems to support HTML..