whosonfirst-data / whosonfirst-data-admin-fr

Whos On First admin data for FR, homepage: https://whosonfirst.org
Other
0 stars 7 forks source link

Issue with the spelling of a French town - La Queue-lez-Yvelines #59

Closed missinglink closed 2 years ago

missinglink commented 2 years ago

As originally reported in https://github.com/pelias/pelias/issues/918, the user reported that the locals spell the place name using the lez form in place of les.

I did a little digging and wasn't able to confirm that this is an official name change (although I could be wrong or it may be very recent).

Either way, I believe this is a valid alias considering the town hall uses the 'z' version: http://www.la-queue-lez-yvelines.fr/

stepps00 commented 2 years ago

Thanks @missinglink

According to Wikipedia, the "official" spelling (what WOF would use in preferred name properties) is "La Queue-les-Yvelines": https://en.wikipedia.org/wiki/La_Queue-les-Yvelines

WOF does maintain the "La Queue-lez-Yvelines" name as a French variant.. I think we're good here, but let me know if there is something I'm missing. Happy to add additional variants where needed.

missinglink commented 2 years ago

Agh you're right sorry I missed that it's in the variant names.

We recently decided to stop importing variant names because they contain a lot of colloquialisms (eg. Thebigapple Bigapple for NYC, Fresco for SF etc) which increased recall but at the expense of causing precision issues in Pelias, I also noticed this comment and felt like we didn't want to bring in spelling mistakes from the variant field due to lifecycle rules.

In this case the variant is a legitimate alias we'd like to import so I'm not sure how best to qualify which names from those fields are desirable and which aren't.

stepps00 commented 2 years ago

Yeah, for the most part, the variant names are a "grab bag" of names that are sometimes inclusive of colloquial terms. Longer term, we can discuss maintaining the colloquial terms in just the _x_colloquial properties.. but I think in this case there is no data work to do, so I'll close.