Closed gsempe closed 2 years ago
As a complementary information, it seems the bug happens very rarely on the first crawl after the chrome launch but all the time on the second one
Does the behavior persist if you use dockerized Chrome?
For example, from this image microbox/chromium-headless:77.0.3844.0
?
I just tried with alpeware/chrome-headless-stable:latest
and the problem does not happen with this dockerized Chrome version.
From my point I consider it as a good workaround. What do you want to do with this issue from the project point of view?
Great! Well, the problem is that Chrome is not very stable and there are specific versions that work well. Here you can read how Puppeteer team solved this problem.
I guess I need to add this to README that not all Chrome versions are equally good for Ferret.
I'm getting this same error with a site. The FQL script is the same as other sites I'm also crawling, the only things that change are the selectors, and I only get the error for this one site.
I'm using the montferret/chromium
image in a docker compose setup. I tried updating it and also using it directly without compose, manually executing the script. I always get the error. Any idea on what to do/check to fix it?
Hmm, is it happening on Google Search page only?
It's not on Google, it's a real estate site.
Are you using the latest version of Ferret?
Could you give me an example of your query?
I get nothing with ferret -version
but I ran go get -u
and I'm in theory using v1.5.0 of cli. ferret is v0.16.3, updated it to try again just in case.
Here a working query: https://marcgonzález.com/p/ferret/immo-properties-index.fql Here basically the same query failing on another site: https://marcgonzález.com/p/ferret/7claus-properties-index.fql
I added SCROLL_BOTTOM to the second query because I thought the problem might be that some elements appear on scroll, but the error persists.
I think the problem is that the 'Next' element you are trying to click on is not visible despite using .visible-xs
selector. Your selector works only when the size of the screen equals mobile phone size.
Bottom line: change .visible-xs
to .hidden-xs
.
I tried modifying it and the error is gone, thank you. I didn't even think you could not click elements with display: none
honestly.
Now I get an operation timed out
error but I guess it's another issue.
Describe the bug The google example does not work anymore with the version 0.10
To Reproduce Steps to reproduce the behavior:
Expected behavior It is expected to have the same behavior.
Screenshots If applicable, add screenshots to help explain your problem.
Desktop (please complete the following information):
Additional context Chrome is launched with the command line: