katspaugh / kuromoji-gloss

Automated glossing of Japanese texts based on the Kuromoji tokenizer
https://kuromoji.fluentcards.com
12 stars 3 forks source link

When generating TSV, garbage row is created. #2

Closed benedictchen closed 7 years ago

benedictchen commented 7 years ago

I expected 3 rows, but a third row of garbage with all the words are loaded for some odd reason. Please see screenshot.

screen shot 2017-03-06 at 8 48 39 pm
katspaugh commented 7 years ago

It's not garbage, it's context! 😆

The last column is the sentence in which a word was encountered: https://github.com/katspaugh/kuromoji-gloss/blob/master/src/services/csv.js#L7

If you're an Anki user, you can use this column for cloze tests, for instance. The idea is to produce the same format as here.

benedictchen commented 7 years ago

Oh, haha I see :-).