issues
search
meilisearch
/
charabia
Library used by Meilisearch to tokenize queries and documents
MIT License
246
stars
86
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add Turkish normalizer
#305
tkhshtsh0917
closed
1 week ago
1
Persian language support for normalization and segmentation
#304
Ja7ad
opened
3 weeks ago
6
feat: Adds German compound words decomposition with new segmenter
#303
luflow
opened
4 weeks ago
7
Update version for the next release (v0.9.0) in Cargo.toml files
#302
meili-bot
closed
1 month ago
0
Add math symbols to default separators
#301
phillitrOSU
closed
1 month ago
1
Add Math symbols in the default separator list
#300
ManyTheFish
closed
1 month ago
0
Simplify lang detection
#299
ManyTheFish
closed
1 month ago
1
update internal dependencies for release
#298
irevoire
closed
2 months ago
1
Update dependencies
#297
irevoire
closed
2 months ago
1
Normalizer for russian
#296
aignatovich
opened
2 months ago
8
Add null byte as hard context separator
#295
LukasKalbertodt
closed
2 months ago
2
Normalization Issue for Turkish Characters in Charabia
#294
niyazialpay
closed
1 week ago
3
Update version for the next release (v0.8.11) in Cargo.toml files
#293
meili-bot
closed
3 months ago
2
Upgrade Lindera to 0.31.0
#292
mosuka
closed
3 months ago
3
fix: fixed `chinese-normalization-pinyin` feature test failed
#291
tkhshtsh0917
closed
3 months ago
3
The `chinese-normalization-pinyin` feature flag doesn't compile
#290
ManyTheFish
closed
3 months ago
6
latin-camelcase feature make wrong segmentation
#289
hamano
opened
4 months ago
11
Update version for the next release (v0.8.10) in Cargo.toml files
#288
meili-bot
closed
4 months ago
0
Add swedish recomposition normalizer and link it to a feature
#287
ManyTheFish
closed
4 months ago
1
Update bors.toml with missing tests
#286
curquiza
closed
4 months ago
1
Rework Chinese Pinyin normalizer
#285
ManyTheFish
opened
4 months ago
0
Update README.md
#284
ManyTheFish
closed
4 months ago
1
Update version for the next release (v0.8.9) in Cargo.toml files
#283
meili-bot
closed
4 months ago
1
Make the pinyin-normalization optional
#282
ManyTheFish
closed
4 months ago
1
Fix char boundary panic
#281
ManyTheFish
closed
4 months ago
2
Add `\t` as recognized separator
#280
Gusted
closed
4 months ago
1
Update Lindera to 0.30.0
#279
mosuka
closed
4 months ago
1
Adds a new normalizer to normalize œ to oe and æ to ae
#278
Soham1803
closed
3 months ago
10
Update version for the next release (v0.8.8) in Cargo.toml files
#277
meili-bot
closed
5 months ago
4
Tag and release new version?
#276
6543
closed
5 months ago
1
Support markdown formatted codeblocks
#275
6543
closed
5 months ago
3
Bump release-drafter/release-drafter from 5 to 6
#274
dependabot[bot]
closed
6 months ago
1
Update Lindera to 0.28.0
#273
mosuka
closed
6 months ago
1
[Maintainance] Review and amend documentation in files
#272
ManyTheFish
opened
6 months ago
0
Numbers are not segmented the same way depending on the Script/Language
#271
ManyTheFish
opened
6 months ago
11
Vietnamese: Add laking tests and fix bug
#270
ManyTheFish
closed
6 months ago
2
Update README.md
#269
ManyTheFish
closed
6 months ago
1
Normalize "œ" / "æ" into "oe" / "ae"
#268
ManyTheFish
closed
3 months ago
0
Add vietnamese benchmarks
#267
ManyTheFish
closed
6 months ago
1
Update version for the next release (v0.8.7) in Cargo.toml files
#266
meili-bot
closed
6 months ago
1
Cross-compiling charabia for arm
#265
chiru091096
opened
7 months ago
5
Bump peter-evans/create-pull-request from 5 to 6
#264
dependabot[bot]
closed
7 months ago
1
Bump Swatinem/rust-cache from 2.7.1 to 2.7.3
#263
dependabot[bot]
closed
7 months ago
1
Update dependencies
#262
agourlay
closed
7 months ago
5
Fix unused FstSegmenter warning when not using khmer compiler features
#261
timvisee
closed
7 months ago
3
Compilation warnings when not using default features
#260
timvisee
closed
7 months ago
0
Fix compilation when vietnamese feature is disabled
#259
timvisee
closed
7 months ago
2
Compiler failure without vietnamese feature
#258
timvisee
closed
7 months ago
0
normalize Ð and Đ into d
#257
ngdbao
closed
7 months ago
5
Fix `update-kvariants` CI
#256
choznerol
closed
7 months ago
2
Next