Closed cbleek closed 6 years ago
All HTML Parsing should be done by solr. I'm sure solr knows how to index HTML. But since we are not knowing, how solr have to be configured to do it right, we have to parse in PHP
Done in https://github.com/yawik/SimpleImport/commit/9d9d8db3d2ac3d40cd0a29ff08a78a4440be887c
Currently the following string gets stored in solr:
Encoding is wrong: example: Sprüngli.