gocolly Search Results - Githubissues

300 results
for gocolly

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

gocolly/colly #696

Go coroutines are created but blocked by parallelism logic

I was debugging some logic I was adding to an OnRequest (stopping a crawl at a timeout) and as I walked through the code I noticed something that felt a little off: This method is invoked by a go c…

williamjulianvicary updated 2 years ago
2
gocolly/colly #778

request chan error

![image](https://github.com/gocolly/colly/assets/90125263/deeef4b4-3931-46ed-a5d9-615e281096fd) when the high performance situation ,the colly chan will be blocked, cause the signal in channel …

JBossBC updated 1 year ago
1
gocolly/colly #260

Parsing local files not working on windows

Looks like there is a difference visiting local files on linux vs windows. On Linux: html file is visited fine using FileTransport and default On Windows html file is visited using default - but n…

stigmelling updated 3 years ago
1
gocolly/colly #411

how to get colly.Collector.responseCount and modify colly.Co…

I use this code to get statistic ``` c := colly.NewCollector() log.Println(c.String()) ``` But c.String() return string, I want to get requestCount ,responseCount etc separately, then conbine t…

makelove updated 4 years ago
4
gocolly/colly #305

Add OnFinally support

Would be nice to have an OnFinally handler that will be executed after both OnError and OnScraped handlers

icamys updated 5 years ago
3
gocolly/colly #591

Scape website that need to log in using github

Hi go-colly team, Is there any way to scrap website that needs to log in first using github account?

cikupin updated 3 years ago
2
foxwhite25/megaCrawler #144

问题2

问题一：采集显示的链接错误，带“%”的都是错误链接。 ``` package dev import ( "strings" "megaCrawler/crawlers" "megaCrawler/extractors" "github.com/gocolly/colly/v2" ) func init() { engine := crawl…

Homework-DAD updated 3 weeks ago
3
luckylittle/blinkist-m4a-downloader #2

exit status 1

after 6 books it shows this error

icf20 updated 4 years ago
15
gocolly/colly #708

problem to get the response.Request.ProxyUrl field value bec…

I use the huge proxies to crawl amazon website, but in onerror function, i can not get the response.Request.Proxy Url because it was not always display. I want to remove all the not working proxies. T…

chjf2008 updated 1 year ago
2
JohannesKaufmann/html-to-markdown #63

💬 Who is using it?

People are using the library for different use cases. Some are using it for better readability of websites, others for migrating content. Knowing the use cases helps to prioritize features and plugins…

JohannesKaufmann updated 1 week ago
10

上一页 1...3 4 5 6 7 8 9...30 下一页

300 results for gocolly

300 results
for gocolly