Library for document analysis (segmentation, tokenization, normalization, aggregation) with the goal to get a set of items that can be inserted into a strus storage. Also some functions for analysing tokens or phrases of the strus query are provided.
This leads to the question how unknown or illegal dates should be handled: Either have an unknown value in the metadata or map it to a default value.