MichaelAquilina / Reddit-Recommender-Bot

Indentifying Interesting Documents for Reddit using Recommender Techniques
7 stars 0 forks source link

Check out `normalize-incoming-links` branch #108

Closed MichaelAquilina closed 10 years ago

MichaelAquilina commented 10 years ago

The methods employed basically penalise pages with a large number of global incoming links but a small number or incoming links in its results. The idea is to prevent large pages from being consistently returned in the results.

Other problems which will be found include extremely large pages being consistently returned as part of the get_documents() results due to the fact that Counter will always be high for large documents like "New Feauters in Windows XP"

MichaelAquilina commented 10 years ago

Does not provide any significant improvement and increases code complexity. Closing this as irrelevant.