opensemanticsearch / open-semantic-search

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
https://opensemanticsearch.org
GNU General Public License v3.0
957 stars 167 forks source link

Documentation for "advanced configuration"? #116

Open JPMoresmau opened 6 years ago

JPMoresmau commented 6 years ago

Hello, I've tried to install the deb file. I seem to get solr and tika running, but the web server and the extraction fail because they cannot connect to RabbitMQ. I suppose it's not part of the deb file, and I have a running instance, that requires authentication. Where can I set the properties for the RabbitMQ connection (url, username, password)? https://www.opensemanticsearch.org/doc/admin/config/queue does not have any information on that (or how to disable RabbitMQ usage altogether) Also I have a running SOLR server I'd like to reuse. What core do I need to create with what configuration? It seems there is some documentation missing for all of this. I also add to install a lot of python packages manually or the extractor would fail. Unfortunately using the VM was not an option for me for various reason, so a step by step guide to the installation and configuration of dependencies would be welcome.

YoannMR commented 6 years ago

Hi,

I transferred an existing SOLR core to OSS into /var/solr/data/opensemanticsearch/ (following https://lucene.apache.org/solr/guide/6_6/making-and-restoring-backups.html).

Note that this allowed me to search through my documents but does not enable using the visualization graph (I think it needs to re-index running the NER part of the ETL to fill out the second SOLR core /var/solr/data/opensemanticsearch-entities/, I opened an issue for that).

On what type of machine are you trying to install OSS? I used a debian EC2 instance with the latest deb package (open-semantic-search_18.05.21.deb) following instructions at https://www.opensemanticsearch.org/doc/admin/install/search_server and it worked. Note that I was installing it over an existing OSS version so I might have dealt with the missing packages in the past.

Hope that helps! Yoann

opensemanticsearch commented 5 years ago

After finishing implementations for some projects, the documentation will be extended next weeks with more infos about architecture, older features and new features like taxonomies.