mivoq / hunpos

Automatically exported from code.google.com/p/hunpos
11 stars 7 forks source link

hurrá is UTT-INT, not a noun #6

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
hurrá   NOUN<CAS<TRA>>
jaj     UTT-INT
húrrá   NOUN<CAS<TRA>>

What is the expected output? What do you see instead?
hurrá  expected: UTT-INT

What version of the product are you using? On what operating system?
Most current 2009, april 24, linux debian

Please provide any additional information below.
hurrá is not a conjugated noun, but an UTT-INT (indulatszó). 

Original issue reported on code.google.com by krie...@gmx.de on 25 Apr 2009 at 8:13

GoogleCodeExporter commented 9 years ago
Hungarian Hunpos is trained on Szeged Corpus. "Hurrá" can't be found in that 
corpus. What you can do is to 
provide a morphtable to hunpos as it can read in a static possible tags table.

Original comment by hala...@gmail.com on 6 May 2009 at 10:40

GoogleCodeExporter commented 9 years ago
I would provide a morphtable, if I knew its format. Where can I find 
documentation
and/or a sample morphtable?

Original comment by krie...@gmx.de on 6 May 2009 at 12:16