Closed yarondbb closed 11 years ago
one of the requirements was omitting certain kind of files from indexing. is it still relevant?
Don't know, maybe we should have some "blacklist" of files which should not be indexed, @vshiray what do you think?
I found that the usage today is very time consuming: Each file which is indexex is taking ~55 mS (actual SHAI takes ~1.5 mS) More over.. when conetn data is getting bigger, each index is taking more and more time: (100mS after one minute) (200mS after one minute) ... Look at the log after few minutes: [INFO] [2013-06-02 13:49:12] [index_agent.rb:76:Yaron:start index 170] [INFO] [2013-06-02 13:49:14] [index_agent.rb:76:Yaron:start index 719] [INFO] [2013-06-02 13:49:14] [index_agent.rb:76:Yaron:start index 871] [INFO] [2013-06-02 13:49:15] [index_agent.rb:76:Yaron:start index 20]
Only 2 files where indexed in 1S
This is due to the Pattern\Indexer mutual work for each file
Also treat: hostname
is taking ~30mS (doing it for each indexing)
Using of Patterns is pretty basic. Concider remove use of pattern object