Closed daBee closed 2 years ago
I don't see in the issue description any kind of references to Ferrum-related things at all.
In this particular case, we have Watir that uses Selenium to manage Chrome - so, there is nothing about Ferrum.
The problematic point for this source is crawler protection. To put it simply, when we trying to parse the webpage than that closes itself and prevent any other requests:
Access Denied
You don't have permission to access "http://www.justdial.com/ca/NS-Halifax/Grocery-Stores" on this server.
Reference #...
I'd recommend using API parsing here, for example, here is the link:
https://www.justdial.com/ca/data/result/getdata?uri=result&city=Halifax&search=Grocery-Stores&sortBy=&state=NS&page=page-4&v=9.23
that we can paginate by page=page-4
query param to fetch all needed data.
So, I'll close this one, as no issue here.
Sorry about that.
How did you find that? There's no mention of any API.
@daBee
How did you find that?
getdata
request getdata
requestOK I thought something more was up. That URL failed with the same later, so I think they're counting hits. And this was during testing as well.
I have a browser that is misbehaving.
pagelimit = 5 pagelimit.times do a.send_keys :page_down sleep 6 end
a.close