giellalt / bugzilla-dummy

0 stars 0 forks source link

Recognise obfuscated e-mail addresses (Bugzilla Bug 2618) #1766

Closed albbas closed 5 years ago

albbas commented 5 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 2618

Date: 2019-10-10T14:04:20+02:00 From: Børre Gaup <> To: Sjur Nørstebø Moshagen <> CC: linda.wiechetek, sjur.n.moshagen, thomas.omma, trond.trosterud, unhammer+apertium

Last updated: 2019-10-22T12:30:14+02:00

albbas commented 5 years ago

Comment 13740

Date: 2019-10-10 14:04:20 +0200 From: Børre Gaup <>

snf(at)trollnet.no

albbas commented 5 years ago

Comment 13741

Date: 2019-10-10 14:15:27 +0200 From: Sjur Nørstebø Moshagen <>

Finst det andre mønster vi bør fanga opp?

albbas commented 5 years ago

Comment 13742

Date: 2019-10-10 14:25:10 +0200 From: Sjur Nørstebø Moshagen <>

No har eg lagt inn desse variantane:

(at) domain ; ! obfuscation .at. domain ; ! obfuscation % at% domain ; ! obfuscation

Trengst det fleire?

albbas commented 5 years ago

Comment 13743

Date: 2019-10-10 21:40:11 +0200 From: Børre Gaup <>

(In reply to Sjur Nørstebø Moshagen from comment #2)

No har eg lagt inn desse variantane:

(at) domain ; ! obfuscation .at. domain ; ! obfuscation % at% domain ; ! obfuscation

Trengst det fleire?

Har ikke sett annet enn det første mønsteret i korpuset vårt så langt.

albbas commented 5 years ago

Comment 13764

Date: 2019-10-22 12:30:14 +0200 From: Sjur Nørstebø Moshagen <>

Det funkar:

$ echo 'snf(at)trollnet.no' | hfst-tokenise -g tools/tokenisers/tokeniser-gramcheck-gt-desc.pmhfst "<snf(at)trollnet.no>" "snf(at)trollnet.no" URL :\n

Og:

$ echo 'snf.at.trollnet.no' | hfst-tokenise -g tools/tokenisers/tokeniser-gramcheck-gt-desc.pmhfst "" "snf.at.trollnet.no" URL :\n

Eg avsluttar.