jmccrae / gwn-scala-api

API for working with GWN formats
Apache License 2.0
10 stars 0 forks source link

Produces WNDB files with flawed offset #7

Open 1313ou opened 5 years ago

1313ou commented 5 years ago

Conversion tool still produces WNDB files with flawed offset as described in issue #162 here : https://github.com/globalwordnet/english-wordnet/issues/162

1313ou commented 5 years ago

My guess is that it somehow mishandles the apostrophe in "Hawaiʻi" at line 46747 Hawaiʻi_Volcanoes_National_Park 0 Hawaii_Volcanoes_National_Park 0 But this is by no means certain as other lemmas do have an apostrophe like"philosopher's stone"

1313ou commented 5 years ago

The merge.py script may be responsible for incomplete/inconsistent data. see issue