Closed jzohrab closed 1 year ago
This is not possible due to a long standing issue in MeCab that causes the UniDic config file to take precedence. Your command line version only works because your config (presumably IPAdic) doesn't specify a default format. I made a PR to fix it six years ago but never received any response.
https://github.com/taku910/mecab/pull/38
However, rather than using MeCab's rather arcane format syntax, I suggest you use fugashi's structured Node objects to create formatted node output - it should be much easier.
Hello, thank you very much for your work on this project. I'm using MeCab for a language-learning program, and would like to use this library if possible.
The mecab binary allowed for some arguments to be passed which would affect its output. For example:
Is there a way to get the same with this python library? I tried some obvious attempts, e.g.
but this still outputs the same as the default Tagger output:
I edited
unidic_lite/dicdir/dicrc
:With that, the output was more or less what I expected (the third column is different, but that doesn't matter):
I did try with unidic, instead of unidic_lite,
and got the default unidic output:
Thank you again!