colly Search Results - Githubissues

619 results
for colly

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

gocolly/colly #689

How to set the "Host" header in the "OnRequest" Callback?

I have setup an OnRequest callback like so: ``` c.OnRequest(func(r *colly.Request) { for header, value := range headers { r.Headers.Set(header, value) } }) ``` One of these headers is the …

ErikOwen updated 2 years ago
1
foxwhite25/megaCrawler #127

疑问

# 新闻页 # https://www.iema.net/resources/news/ # 问题 # 1. 脚本不运行（见下图） 2. 网站翻页的href为“#”（见下图） # 脚本代码 # ```go package dev import ( "megaCrawler/crawlers" "megaCrawler/extractors" …

Homework-DAD updated 1 week ago
1
gocolly/colly #4

Feature Request: JavaScript execution

Any plan to add JavaScript engines to this framework. A few projects that might help are https://github.com/robertkrimen/otto https://github.com/lazytiger/go-v8 https://github.com/dop251/goja …

vosmith updated 2 months ago
30
gocolly/colly #759

Proxies are not rotated

I'm using an array of HTTP proxies and setting up the collector as described in the example: ```go c := colly.NewCollector( colly.MaxDepth(cfg.MaxDepth), colly.URLFilters( …

regnull updated 1 year ago
1
metakgp/iqps-go #91

Make the crawler concurrent

**Is your feature request related to a problem? Please describe.** Currently the crawler sequentially fetches each paper details, parses it and downloads the paper. This can be made lot faster using …

rajivharlalka updated 3 weeks ago
14
geziyor/geziyor #48

Is scraping shadow DOM an option?

Hi, I'm trying to web scrapping [YouTube charts](https://charts.youtube.com/?hl=pt), unsuccessfully because they use polymer / shadow DOM. With Geziyor, could I do that? I'm using colly, and they don'…

jtlimo updated 2 years ago
6
gocolly/colly #569

AllowedDomains and host with port

I crawl website like http://127.0.0.1:8080,AllowedDomains is 127.0.0.1:8080,crawl is not work. in code github.com/gocolly/colly/v2.(*Collector).isDomainAllowed at colly.go:752 if d2 == domain { /…

uyid updated 1 year ago
2
gocolly/colly #254

Cookiejar does not respect several domains

I'm attempting to use colly to write a program that first logs in to a site, and then obtains some information. The site uses 2 different domains to handle authentication, say `www.site.com` and `auth…

jespern updated 5 years ago
1
gocolly/colly #432

cryptocoinmarketcap crawling example doesn't work.

Hi, I'm newbie and I have a question. First of all, thanks to make and share this cooool project! I'm trying to do some examples but the 'cryptocoinmarketcap.go' file doesn't work. I think …

dlwogns0128 updated 3 years ago
1
co0p/x-scrap #8

Bug: wrong counts with multiple urls

single url#1 ``` > go run cmd/xscrap/main.go -urls https://kreuzwerker.de/post/aws-summit-2022-berlin-have-some-pie -tags Elasticsearch url: https://kreuzwerker.de/post/aws-summit-2022-berlin-hav…

glenacota updated 1 year ago
1

上一页 1...5 6 7 8 9 10 11...62 下一页

619 results for colly

619 results
for colly