infiniflow / infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
https://infiniflow.org
Apache License 2.0
2.68k stars 275 forks source link

Fix NLTK tokenizer within RAG analyzer #2228

Closed yingfeng closed 1 week ago

yingfeng commented 1 week ago

What problem does this PR solve?

Issue link:#2226

Type of change