https://github.com/dbpedia/extraction-framework/issues/314 proposes to collapse raw prop name "propNblah" where N is a digit to "prop", just like it does for "propN" (i.e. to consider "Nblah" a parasitic suffix like "N" is considered).
@jcsahnwaldt objects "I think there are some properties that contain a digit somewhere in the middle of their name".
https://github.com/dbpedia/extraction-framework/issues/314 proposes to collapse raw prop name "propNblah" where N is a digit to "prop", just like it does for "propN" (i.e. to consider "Nblah" a parasitic suffix like "N" is considered). @jcsahnwaldt objects "I think there are some properties that contain a digit somewhere in the middle of their name".
So we need to investigate this. Look at http://wiki.dbpedia.org/Downloads2014 sec "Raw Infobox Property Definitions". We get eg:
There certainly are some curiosities, eg
Or this, which quizzically just about makes some sense :-)
On the other hand, there are some legit cases. Eg these list rider attributes for 2 classes of motorbikes:
So I think we should precise the "parasitic suffix" rule like this: "digits followed by a single letter".
Looking at the result, a lot are good candidates for collapsing. But there are also imaginatively named props like this (who does that?):
What do you think?