The methods employed basically penalise pages with a large number of global incoming links but a small number or incoming links in its results. The idea is to prevent large pages from being consistently returned in the results.
Other problems which will be found include extremely large pages being consistently returned as part of the get_documents() results due to the fact that Counter will always be high for large documents like "New Feauters in Windows XP"
The methods employed basically penalise pages with a large number of global incoming links but a small number or incoming links in its results. The idea is to prevent large pages from being consistently returned in the results.
Other problems which will be found include extremely large pages being consistently returned as part of the get_documents() results due to the fact that Counter will always be high for large documents like "New Feauters in Windows XP"