nicolas-raoul / Wikipedia-Reliable-Sources

Configuration for WRS, the search engine that aims to only return results from reliable websites
https://en.wikipedia.org/wiki/User:Syced/Wikipedia_Reference_Search
MIT License
4 stars 3 forks source link

Sort the list #4

Open nicolas-raoul opened 5 years ago

nicolas-raoul commented 5 years ago

The exclusions should come right after the site they are about, for instance:

*.livius.org/*  _cse_galkqgoksq0
*.microsoft.com/*   _cse_galkqgoksq0
*.microsoft.com/en-us/store/*   _cse_exclude_galkqgoksq0
*.nfl.com/* _cse_galkqgoksq0

In general, it could be a simple alphabetic sort, or something more subtle, domain-based:

*.spectator.co.uk/* _cse_galkqgoksq0
*.literature.tufts.edu/*    _cse_galkqgoksq0
*.science.tufts.edu/*   _cse_galkqgoksq0
*.jobs.science.tufts.edu/*  _cse_exclude_galkqgoksq0
*.umich.edu/*   _cse_galkqgoksq0

A Python or bash script would be ideal, but a pull request that sorts the list manually would be accepted too if it does not add/remove/modify any line at the same time.

nicolas-raoul commented 2 weeks ago

(the same applies to the new XML format)