Open goodmami opened 21 hours ago
A third and fourth issue for PWN 1.5:
.zip
and an old Mac .bin
file. For the Windows file, the filenames are all-caps and slightly different (e.g., DICT/NOUN.DAT
instead of dict/data.noun
), so we'd need to do some special file-loading. Luckily, there doesn't seem to be any strange encoding or line-ending problems.The 3rd column of the older cili mappings is a confidence score. We should probably parse that and use it for thresholding out low-quality mappings.
For context: https://github.com/goodmami/wn/issues/199
There are two main issues:
older-wn-mappings/
has 3 columns instead of 2verb.Framestext
, so we'd need to hard-code the frames into the script