Library for document analysis (segmentation, tokenization, normalization, aggregation) with the goal to get a set of items that can be inserted into a strus storage. Also some functions for analysing tokens or phrases of the strus query are provided.
Adding a segmenter TSV requires extending in src/analyzer/libstrus_analyzer_objbuild.cpp in strusAnalyzer and in src/analyzerObjectBuilder.cpp in strusModule.
Adding a segmenter TSV requires extending in
src/analyzer/libstrus_analyzer_objbuild.cpp
in strusAnalyzer and insrc/analyzerObjectBuilder.cpp
in strusModule.