101companies / 101repo

101companies contributions
http://101companies.org
MIT License
43 stars 31 forks source link

Bring up 101meta coverage for suffixes #13

Open rlaemmel opened 12 years ago

rlaemmel commented 12 years ago

Hi Martin,

as we discussed, please try to bring up 101meta coverage for suffixes.

Here is all the information needed.

Have a look at the suffixes101meta module [1]. The README.md describes how to use the corresponding dump [2](including possibly citations [3]). I went out of my way to actually describe also the methodology for using such suffix information. Please have a look.Then, indeed, look at the dump [2] and specifically at -> suffixes -> unmatched and see whether you can resolve some of the most popular unmatched prefixes, perhaps thereby also bringing up automagically our technology and language coverage.

Again, let's not try to fight down the list of unmatched suffixes completely. If you do the 10-20 most popular ones that's great.

As a side note, if you end up liking the "citation" idea [3], perhaps you could also add some citations to existing rules so that we better exercise this emerging, best practice.

Thanks, Ralf

[1] https://github.com/101companies/101worker/tree/master/modules/suffixes101meta

[2] http://black42.uni-koblenz.de/production/101worker/dumps/suffixes.json

[3] http://101companies.org/index.php/Language:101meta#Citations_for_metadata_rules

rlaemmel commented 12 years ago

We discussed .pref and .properties and .ini files. We take the view that these files agree on some broader format key-value-pair-based. We call the corresponding language "Preferences". The corresponding wiki page could link to a few resources such as: http://www.nodevice.com/extensions/pref.html

martinleinberger commented 12 years ago

Maybe "Preferences" isn't the right word for those languages. Please take a short look at http://en.wikipedia.org/wiki/.properties and http://en.wikipedia.org/wiki/INI_files#Example . I feel like seperating them doesn't make much sense, although .properties files are sometimes used differently than .ini files (e.g. localization)

rlaemmel commented 12 years ago

Agreed, better names for the language may be these then:

Now, I agree that it may be sometimes hard or generally not so interesting to separate these cases.

My favorite is "Settings".

rlaemmel commented 12 years ago

The suffix statistics aren't generated. Needs to re-enabled.