himselfv / jptools

A set of tools for parsing and studying Japanese
5 stars 1 forks source link

Tags from EDICT entries are merged together #1

Closed himselfv closed 11 years ago

himselfv commented 11 years ago

Original report by me.

Originally reported on Google Code with ID 1

This is a problem of Wakan dictionary format, where it merges together all grammar tags
from a list of entries.

But even when this is fixed in Wakan (by converting to EDICT2 style dictionaries),
we'll have no way of knowing, for the same kana-kanji pair which translation entry
on EDICT relates to which translation entry on Warodai.

So we'll have to skip the cases where grammar flags are not for the whole article.

At most we can try to find common flags, the ones repeating in every entry, and apply
those as common for translation.

Reported by himselfv on 2013-01-04 14:19:48

himselfv commented 11 years ago
Fixed long ago by moving Wakan to EDICT2

Reported by himselfv on 2013-02-03 11:27:18