OSMLatvija / Osmalyzer

Parsing OSM data in Latvia against various data sources
https://osmlatvija.github.io/Osmalyzer/
GNU General Public License v3.0
2 stars 1 forks source link

Check names of streets in other languages #44

Open markalex2209 opened 5 months ago

markalex2209 commented 5 months ago

Check that the names of streets in other languages are transliterations and not translations, to prevent situations like Brīvības iela -> Freedom street or улица Свободы.

Additionally, same check might be enacted for other objects and tags, not only streets and names.

HellMapGoesCoding commented 5 months ago

See https://osmlatvija.github.io/Osmalyzer/Improper%20translations%20report.html ; main info on Zulip

markalex2209 commented 4 months ago

Regarding

other names?

I think following might be checked similarly to already implemented approach: place=*, boundary = administrative, railway = station. AFAIK, those are expected to be transliterated and not translated too.

One more candidate is route=road, but it might result in a lot of false positives (or ballooning up translation dictionary) due to presence of general terms, that are not supposed to be translated. For example, European route, border with + name of the country, or coded reference like E22

HellMapGoesCoding commented 4 months ago

I wonder, are there actually that many Russian (or other language) names that it is worth the effort? Even street names didn't turn out to be that many. Might be worth just checking what sort of Russian names even exist.

markalex2209 commented 4 months ago

For suggested: around 570 objects.

route=road adds 95 more.

HellMapGoesCoding commented 4 months ago

Looking at the objects, I guess that sounds reasonable then.