weaviate / contextionary

Weaviate's own language vectorizer, which allows for semantic context-based searches in Weaviate
https://weaviate.io/developers/weaviate/modules/retriever-vectorizer-modules/text2vec-contextionary
BSD 3-Clause "New" or "Revised" License
14 stars 2 forks source link

Introduce max word length #38

Closed fefi42 closed 4 years ago

fefi42 commented 4 years ago

Limit the word max word length to 100 characters to prevent long running splits.

closes #37

etiennedi commented 4 years ago

Thanks, great idea adding the tests below and above the boundary and even measuring the time. I'll merge.