Open chenjindong opened 4 years ago
@kpu what is the use of this flag https://github.com/kpu/kenlm/blob/bdf3c71a34a874de11ab02f23ebe0a0b877c27ef/lm/build_binary_main.cc#L28
It means you can convert an ARPA from another toolkit without these symbols to binary format. However, there is currently no support to train a model without sentence boundary symbols.
@kpu can you give any leads how can I add support currently I am not familiar with code base.
This is currently not supported though if you want to send a pull request...