gocolly Search Results - Githubissues

300 results
for gocolly

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

gocolly/colly #185

Run this code, end directly, no errors, no errors

Run this code, end directly, no errors, no errors But when I remove this code, it can run normally. ``` package main import ( "github.com/gocolly/colly" "log" "go_spider/bolt_storage…

aimuz updated 6 years ago
11
gocolly/colly #150

strange behavior when setting user agent

I found a strange case where setting the user agent causes the crawl to not work but `curl` gets the site response fine with the same user agent. I provided example code below which you can run with a…

tamoyal updated 6 years ago
1
gocolly/colly #140

Difference checking

Quick question. I need to scrape documentation from web sites. After I have grabbed the documentation from their web site I need to run jobs to go back everyday and only get anything new. Do you…

ghost updated 6 years ago
3
gocolly/colly #122

[Question] Colly distributed scrapping ?

Hi, According to [colly documentation](http://go-colly.org/docs/best_practices/distributed/#distributed-scrapers) distributed scrapers ``` the best you can do is wrapping the scraper in a server. …

yspanchal updated 6 years ago
2
gocolly/colly #116

colly.Context should be an interface?

I am passing information between collectors and setting scraped data using the context. Actually i do something like ctx.Put("collector.From",X) and ctx.Put("object.Name",X). This adds some overheat t…

llonchj updated 6 years ago
4
gocolly/colly #130

Does Colly support colon-separated URL?

Hi, thank you for building such a powerful library! I was testing Colly with various websites and it seems colon separated URLs are sometimes ignored, especially on paginated URLs. For example, …

tomodian updated 6 years ago
2
quintilesims/slackbot #32

Use Google for !gif command

This would essentially replicate the behavior of the old bot. Which scrapes google for gifs. I know it's not best practice to scrape the web however I believe this has the best results for getting GIF…

nehayward updated 6 years ago
7
gocolly/colly #108

AppEngine returns null from colly scraper

I'm having a slight issue with app engine and wondering whether I've missed some config stage to get colly working correctly. Sorry if this is the wrong place to be posting this. I created my code …

jdlennoxs updated 6 years ago
4
gocolly/colly #99

Error: gzip: invalid header

```go package main import ( "fmt" "github.com/gocolly/colly" "github.com/gocolly/colly/debug" "time" ) func main() { c := colly.NewCollector( colly.UserAgent("Mozilla/5.0 (Windo…

max107 updated 6 years ago
6
gocolly/colly #125

Issue parsing relative urls

I'm relatively new to golang so while I realize the url parsing is done using golang's core libraries, I have still found an issue that may be valuable to solve in a crawling project. If you look a…

tamoyal updated 6 years ago
10

上一页 1...24 25 26 27 28 29 30...30 下一页

300 results for gocolly

300 results
for gocolly