own-pt / glosstag

Semantically Tagged PWN glosses
Other
7 stars 4 forks source link

original annotations of adj satellites uses wrong sense-key #18

Closed arademaker closed 4 years ago

arademaker commented 4 years ago
  1. fix the sense keys in the XML files
  2. adjust the XML to PLIST/JSON script to not fix it during transformation.
hmuniz commented 4 years ago

I started a lisp script to fix that in ar-new+ukn branch but I realize that a simple sed command solves the problem. In princeton branch run:

sed -i 's/%\(3\)\(:[0-9][0-9]:[0-9][0-9]:[[:print:]]\+:[0-9][0-9]\)/%5\2/' merged/* 
sed -i 's/%\(3\)\(:[0-9][0-9]:[0-9][0-9]:[[:print:]]\+:[0-9][0-9]\)/%5\2/' standoff/index.*
sed -i 's/%\(3\)\(:[0-9][0-9]:[0-9][0-9]:[[:print:]]\+:[0-9][0-9]\)/%5\2/' standoff/*/*/*

Do you see any problem with that solution? @arademaker

hmuniz commented 4 years ago

I will close this issue. The data in merged directory from princeton was updated via sed command commented before.