subdomain-crawler Search Results

262 results
for subdomain-crawler

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/crawlers #410

ReferenceFilter and stayOnDomain Being Ignored?

I'm attempting to resolve an error I see when doing an initial test crawl and seeing some strange behavior. First, here's the relevant parts of my config file: https://www.myredact…

dhildreth updated 6 years ago
8
Norconex/crawlers #394

To handle sub domains

I wanted to crawl pages linked as sub-domains. But 'stayOnDomain' doesn't handle them as same domain. Case - Root URL : http://www.some.com - Child URL : http://hellosub.some.com/page.html …

popthink updated 6 years ago
5
oasis-tcs/sarif-spec #43

Consider: Remove Dynamic specific properties and terminology

Properties such as threadId are only relevant to dynamic analysis. Should they be first class properties in this format, or left for users to put in catch-all property bags? Also, I think one coul…

DerSaidin updated 6 years ago
2
simplecrawler/simplecrawler #363

Crash on invalid robots.txt redirect

Hi! Thanks for a very useful module. I'm unfortunately experiencing an exception when trying to parse a url where the robots.txt download redirects to an invalid url. The source url is `http://99ra…

ghost updated 7 years ago
9
Norconex/crawlers #365

norconex connector issue

I am trying to set up a norconex connector for a site and my issue is that the URLs under the div portion is not getting crawled. Attaching the configuration code here:- ```xml #set($http…

Navaminavu updated 7 years ago
9
StreisandEffect/discussions #37

Feature idea: nghttpx for HTTP/2 proxy

Hi, I am both a big fan of Ansible and all kinds of VPN / proxy software. So I am thrilled to find such a awesome, detailed documented project like Streisand. I am thinking about contributing a …

wzyboy updated 7 years ago
12
scrapinghub/frontera #211

frontera converts keys in scrapy's response meta to bytes

while running crawler in https://github.com/sibiryakov/frontera-google I get the following error. ``` 2016-10-04 23:38:19 [scrapy] ERROR: Spider error processing (referer: None) Traceback (most rece…

voith updated 7 years ago
5
stacks-network/stacks-core #627

regtest mock insight api requires authentication

The current mock implementation of the regtest insight api requires authentication while our production api doesn't require authentication. Behavior should be the same.

larrysalibra updated 6 years ago
9
USC-CSSL/TACIT #538

Frontiers crawler: missing subdomain

The Research Methods and Analytics subdomain is missing from the engineering section.

joh03067 updated 8 years ago
1
liip/TheA11yMachine #63

Issues on Windows

Hey there, I have multiple issues on windows First a non Windows specific issue : `git` is required has some packages are linked to github repos. Then, `a11ym` behave strangely on windows. When I t…

krtek4 updated 7 years ago
10

上一页 1...20 21 22 23 24 25 26...27 下一页

262 results for subdomain-crawler

262 results
for subdomain-crawler