Closed AurielleP closed 4 years ago
are Sk
and Mah
i18n abbreviations?
no - they are just street address part abbrev
mah. - mahallesi (district)
mh. - mahallesi (district)
blv. - bulvarı (boulevard)
cad. - caddesi (road)
cd. - caddesi (road)
sk. - sokak (alley)
ap. - apartmanı (apartment)
kat - floor. Kat 1 is 2nd floor by American method
they were tagged fine - i just excluded them to reduce the example output to only the part that was a bug (the slash parsing)
hey, this appears fixed now, in 13.2.0
const doc = nlp(`Ramazanoğlu Mah. Mahsus Sk. No:1 Pendik / İSTANBUL / TÜRKİYE`)
console.log(doc.terms().out('array'))
[ 'Ramazanoğlu',
'Mah.',
'Mahsus',
'Sk.',
'No:1',
'Pendik',
'/',
'İSTANBUL',
'/',
'TÜRKİYE' ]
maybe we should try to join lonsesome slashes in the future, but for now I think that's the desired behaviour let me know if i'm wrong cheers