UniversalDependencies / UD_Irish-IDT

Irish data
Other
6 stars 7 forks source link

Progress on fixing flats #73

Closed kscanne closed 3 years ago

kscanne commented 3 years ago

All my changes to dev file are done, and I made it through sentence 1867 in the train file. @tlynn747: It's probably a good idea to merge this PR before you start your own editing on the second half of the train file since I did some "global" fixes on common phrases like "Dún na nGall", "Fianna Fáil", "Fine Gael", which should all be correct now. Also fixed the adjective "Eorpach" everywhere which is usually mistagged.

tlynn747 commented 3 years ago

Good to see that there weren't too many changes needed wrt to head attachments and most of the cases were covered by the generalised script!

Will work on the rest of the training file this afternoon.

tlynn747 commented 3 years ago

Sentences from 1868 to 3366 (train) have been reviewed for removal of flat and any other named entities that should be flat