loge-gh / jp-tools

Automatically exported from code.google.com/p/jp-tools
1 stars 0 forks source link

Tags from EDICT entries are merged together #1

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
This is a problem of Wakan dictionary format, where it merges together all 
grammar tags from a list of entries.

But even when this is fixed in Wakan (by converting to EDICT2 style 
dictionaries), we'll have no way of knowing, for the same kana-kanji pair which 
translation entry on EDICT relates to which translation entry on Warodai.

So we'll have to skip the cases where grammar flags are not for the whole 
article.

At most we can try to find common flags, the ones repeating in every entry, and 
apply those as common for translation.

Original issue reported on code.google.com by himse...@gmail.com on 4 Jan 2013 at 2:19

GoogleCodeExporter commented 9 years ago
Fixed long ago by moving Wakan to EDICT2

Original comment by himse...@gmail.com on 3 Feb 2013 at 11:27