opensemanticsearch / open-semantic-search

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
https://opensemanticsearch.org
GNU General Public License v3.0
968 stars 169 forks source link

Does opensemanticsearch support Chinese grammar? #423

Open zhangyilibecnu opened 2 years ago

zhangyilibecnu commented 2 years ago

I'm from China, opensemanticsearch is especially suitable for my project, but I can't find Chinese grammar support and can't ocr Chinese content.

ingenika commented 2 years ago

You need to download chinese trained data (it will be a file like chi_sim.traineddata) and add it to your tessdata folder.