dachcom-digital / pimcore-lucene-search

Pimcore Website Indexer (powered by Zend Search Lucene)
Other
26 stars 20 forks source link

[LuceneSearch1] multiple start urls for crawling #42

Closed dpfaffenbauer closed 6 years ago

dpfaffenbauer commented 6 years ago

Multiple Start Urls for crawling result in having only indexed the latest one. For example:

b2b.domain.com
www.domain.com
test.domain.com

LuceneSearch crawls these domains, but only indexes the last one.

dpfaffenbauer commented 6 years ago

Doesn't seem to happen for v2, as it tries to open an existing index before doing anything else:

https://github.com/dachcom-digital/pimcore-lucene-search/blob/master/src/LuceneSearchBundle/Task/Parser/ParserTask.php#L755

in comparison: v1 always creates the index:

https://github.com/dachcom-digital/pimcore-lucene-search/blob/pimcore4/lib/LuceneSearch/Model/Parser.php#L1125

solverat commented 6 years ago

fixed with #44 .