Closed GoogleCodeExporter closed 9 years ago
Hello Mansur...
Even I am trying to implement the broken links stuff in my code.. Can you give
me a pointer on how to log the data against a WebURL ? Or how do you do it ?
Regs
Original comment by w3engine...@gmail.com
on 19 Jan 2012 at 2:38
[deleted comment]
Hello,
I just save not fetched links in a text file.
However, there are some subtle issues:
1. If a page being fetched is login protected;
2. If a page is dead and the request is redirected to error page;
Or you need my logic?
Regards
Original comment by mansur.u...@gmail.com
on 19 Jan 2012 at 3:23
I'm closing this issue because with the new handle status code, broken links
can be handled properly. See example here:
http://code.google.com/p/crawler4j/source/browse/src/test/java/edu/uci/ics/crawl
er4j/examples/statushandler/
-Yasser
Original comment by ganjisaffar@gmail.com
on 23 Jan 2012 at 12:12
Original issue reported on code.google.com by
mansur.u...@gmail.com
on 16 Jan 2012 at 3:49