We start with a seed site and then we want to restrict the crawler to only
follow links that are specified dynamically. We'd like to be able to pass
that list of links to WebCrawler via a method such as setAllowedHostnames()
since we don't want to hard code the list of sites in shouldVisit(). What's
the best way to accomplish this?
Original issue reported on code.google.com by eduf...@gmail.com on 6 May 2010 at 1:37
Original issue reported on code.google.com by
eduf...@gmail.com
on 6 May 2010 at 1:37