weaviate / contextionary

Weaviate's own language vectorizer, which allows for semantic context-based searches in Weaviate
https://weaviate.io/developers/weaviate/modules/retriever-vectorizer-modules/text2vec-contextionary
BSD 3-Clause "New" or "Revised" License
14 stars 2 forks source link

Feature/compound splitting #36

Closed fefi42 closed 4 years ago

fefi42 commented 4 years ago

Added compound splitting check to the contextionary. Added extra preprocessing step to the pipeline to create a dict file used by the splitter.