Closed nengine closed 10 years ago
follow_links_like
works running regex against url.path
that doesn't contain query string.
you can use focus_crawl
and use your logic to extract all of the links you are interested in contained in the page:
crawler.focus_crawl do |page|
page.links.select{ |url| url.to_s =~ /show.php\?id=[A-Z]+$/ }.uniq
end
When I use the regex like below it would not crawl.
however if I remove the id parameter it works.
Please let me know regex to match dynamic parameters are supported? Many thanks!