Closed utuhiro78 closed 3 years ago
These expressions are decoded when a dictionary is built. https://github.com/WorksApplications/Sudachi/blob/c4a363ad1a092892d79e43475aefcb4105d18d64/src/main/java/com/worksap/nlp/sudachi/dictionary/DictionaryBuilder.java#L403
These expressions are decoded when a dictionary is built.
Thanks! I didn't know the dictionary needed to be built.
Hello,
core_lex.csv and notcore_lex.csv have \u**** characters. I checked them with ripgrep on Arch Linux.
Examples.
Are they OK?
Thank you for providing a big dictionary.