-
I was debugging some logic I was adding to an OnRequest (stopping a crawl at a timeout) and as I walked through the code I noticed something that felt a little off:
This method is invoked by a go c…
-
![image](https://github.com/gocolly/colly/assets/90125263/deeef4b4-3931-46ed-a5d9-615e281096fd)
when the high performance situation ,the colly chan will be blocked, cause the signal in channel …
-
Looks like there is a difference visiting local files on linux vs windows.
On Linux: html file is visited fine using FileTransport and default
On Windows html file is visited using default - but n…
-
I use this code to get statistic
```
c := colly.NewCollector()
log.Println(c.String())
```
But c.String() return string, I want to get requestCount ,responseCount etc separately, then conbine t…
-
Would be nice to have an OnFinally handler that will be executed after both OnError and OnScraped handlers
-
Hi go-colly team,
Is there any way to scrap website that needs to log in first using github account?
-
问题一:采集显示的链接错误,带“%”的都是错误链接。
```
package dev
import (
"strings"
"megaCrawler/crawlers"
"megaCrawler/extractors"
"github.com/gocolly/colly/v2"
)
func init() {
engine := crawl…
-
after 6 books it shows this error
icf20 updated
4 years ago
-
I use the huge proxies to crawl amazon website, but in onerror function, i can not get the response.Request.Proxy Url because it was not always display. I want to remove all the not working proxies. T…
-
People are using the library for different use cases. Some are using it for better readability of websites, others for migrating content. Knowing the use cases helps to prioritize features and plugins…