kpu / kenlm

KenLM: Faster and Smaller Language Model Queries
http://kheafield.com/code/kenlm/
Other
2.5k stars 513 forks source link

Input format (Should each sentence be newline-delimited) ? #172

Closed SachithSriRam closed 6 years ago

SachithSriRam commented 6 years ago

Should the input to lmplz have each sentence in a newline or can they just be delimited by a period ( '.' )?

kpu commented 6 years ago

One sentence per line. Not all sentences end with a period!