giellalt / shared-smi

Shared Sámi lexical resources
GNU General Public License v3.0
2 stars 0 forks source link

+URL får ikke riktig format i hfst-tokenise ( #6

Closed albbas closed 6 years ago

albbas commented 6 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 2474

Date: 2018-05-08T09:32:40+02:00 From: Lene Antonsen <> To: Sjur Nørstebø Moshagen <> CC: lene.antonsen, linda.wiechetek, sjur.n.moshagen, trond.trosterud

Last updated: 2018-08-30T14:50:54+02:00

albbas commented 6 years ago

Comment 12791

Date: 2018-05-08 09:32:40 +0200 From: Lene Antonsen <>

+URL får ikke riktig format i HFST

echo UiT:s lea http://uit.no čujuhussan hfst-tokenize --giella-cg --weight-classes=1 ~/main/langs/sme/tools/tokenisers/tokeniser-disamb-gt-desc.pmhfst "" "UiT" N Prop Sem/Org ACR Sg Loc "UiT" N Sem/Org Prop ACR Sg Loc

"" "leat" V IV Ind Prs Sg3 : "http://uit.no" "http://uit.no"+URL : "<čujuhussan>" "čujuhus" N Sem/Plc-abstr Ess "čujuhus" N Sem/Plc-abstr Sg Acc PxSg1 "čujuhus" N Sem/Plc-abstr Sg Gen PxSg1 "čujuhus" N Sem/Plc-abstr Sg Nom PxSg1 :\n

albbas commented 6 years ago

Comment 12932

Date: 2018-08-11 23:38:23 +0200 From: Trond Trosterud <>

Dette er framleis eit problem. +URL var ikkje deklarert i root.lexc, men det hjelp ikkje å deklarere det. Kor er kjeldekoden til denne komponenten= Slik det er no øydelegg det for grammatikkontrollen.

albbas commented 6 years ago

Comment 12954

Date: 2018-08-30 14:50:54 +0200 From: Sjur Nørstebø Moshagen <>

Fixed in commits 170172-170177 for all languages.