robots-txt-parser Search Results

1000+ results
for robots-txt-parser

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

crawler-commons/crawler-commons #182

sitemap parser assumes wrong baseUrl

It refuses to accept the links from the archive.org sitemaps: 20:36:12,023 WARN [crawlercommons.sitemaps.SiteMapParser] (IdxTask) URL: https://archive.org/details/ARCHIVEIT-3490-TWELVE_HOURS-FHIFE…

jjYBdx4IL updated 6 years ago
2
allinurl/goaccess #1084

Docker realtime is not working

Hello, I'm using the dockerized version of goaccess, static file generation kinda works, but I cant get up and running the docker image. My config file contains both ssl certificate, and ssl key, l…

josefkorbel updated 6 years ago
9
linkchecker/linkchecker #34

Two warnings building on OSX 10.10.5

Using OSX 10.10.5, Python 2.7.11: I ran `make localbuild` as specified in https://github.com/linkcheck/linkchecker/blob/master/doc/development.mdwn, and got the following build progress and warning…

richb-hanover updated 6 years ago
4
ekalinin/robots.js #29

praser.parse() throwing Cannot read property \'length\' of u…

My code ``` import robots from 'robots'; const robotsParser = new robots.RobotsParser(); const url = 'https://google.com/robots.txt'; return new Promise((resolve, reject) =…

zackiles updated 7 years ago
1
rails/rails #31865

Rails fails to generate new app with 5.2.0.rc1

### Steps to reproduce After running `rails new rails_playground --webpack` I'm getting this error: ``` Using --database=postgresql from /Users/yurii/.railsrc create create …

yuriihabrusiev updated 6 years ago
18
internetarchive/wayback #140

bugs.chromium.org reports an incorrect robots.txt restrictio…

Navigate to: https://web.archive.org/web/http://bugs.chromium.org/p/project-zero/issues/detail?id=1139 see that wayback says it's blocked by robots.txt: ![image](https://cloud.githubusercontent.…

nightpool updated 6 years ago
2
npm/npm #5869

Constant 'Error: EACCES, mkdir' failures when trying to run …

I apologize if this is a generator-angular issue, but I've gone back and forth with so many fixes it's hard to keep all the wires straight. ``` yo --version && echo $PATH $NODE_PATH && node -e 'conso…

nominalaeon updated 6 years ago
40
ScaleUnlimited/flink-crawler #23

Extract links from sitemaps referenced by robots.txt

The first time we process a robots.txt file, or when we re-process it, we should see if there's a sitemap (or sitemaps). If so, then we could output the sitemap URL(s) as well as the URL being checked…

kkrugler updated 6 years ago
2
linkchecker/linkchecker #10

sdists are broken

Steps to reproduce: 1. `python setup.py sdist` 2. `virtualenv /tmp/env && /tmp/env/bin/pip install sdist/LinkChecker*` Expected behavior: - linkchecker is installed successfully Actual beha…

mgedmin updated 6 years ago
1
mmistakes/minimal-mistakes #1414

Masthead links don't show

Hello everyone, thanks for looking into my issue! - [x] This is a question about using the theme. - [ ] This is a feature request. - [ ] I believe this to be a bug with the theme. - [x] I hav…

zorbathegreek updated 6 years ago
4

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for robots-txt-parser

1000+ results
for robots-txt-parser