pythonhacker / harvestman-crawler

Automatically exported from code.google.com/p/harvestman-crawler
1 stars 3 forks source link

Depth of any url #34

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Hi,
I've seen in the code that HarvestManUrl doesn't store the depth from the
seed url ( I mean the number of jumps you need to get the url from seed and
it is heavier calculate it visiting all the ascendant urls) and I've store
it in rdepth (I've seen is not used). I attached the patch. If you find it
interesting you can use it.

Original issue reported on code.google.com by sr.migue...@gmail.com on 19 May 2010 at 4:24

Attachments: