automated-crawler Search Results

791 results
for automated-crawler

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openactive/status-dashboard #1

Better User Agent and reduce transfer amounts for large book…

Hi all, We've had to unpublish a lot of our feeds as we've been experiencing an issue which I believe is related to this dashboard. Firstly, it doesn't appear to publish a reasonable UA so this is …

nathansalter updated 1 month ago
3
Letractively/aost #391

Research: Add html crawler to automated create UI modules fr…

``` add crawler to parse html source and create html skeleton, then build UI modules based on the skeleton. ``` Original issue reported on code.google.com by `John.Jian.Fang@gmail.com` on 16 Feb 201…

GoogleCodeExporter updated 8 years ago
2
agrynchuk/noodle-ng #45

Unit Testing

``` At the moment why do not use any form of automated testing in our code. The basic structure for testing the web application is already there in TurboGears, we should expand that. But I think we …

GoogleCodeExporter updated 9 years ago
1
jsanahuja/InstagramFeed #56

Error Cross-Origin Read Blocking (CORB)

The [CORS fix](https://github.com/jsanahuja/InstagramFeed/issues/55) creates a new problem for me: `Cross-Origin Read Blocking (CORB) blocked cross-origin response https://www.instagram.com/p/CL7rMUKD…

RomainDW updated 3 years ago
26
trincema/ui-testing-poc #13

[Bug] Unreliable mechanism for crawlers/bots prevention

**Bug Type:** Reliability **Test Method:** Automation **Description/Summary** Login is sometimes blocked by a warning message during the execution of automated tests in production. This behavior…

trincema updated 1 year ago
1
iipc/qa2019 #14

Diagnosis of crawl versus replay problems

A common issue is that it is not clear if a problem with a site is due to gaps in the crawl, or replay-time rewriting limitations. It should be possible to use proxy playback mode to evaluate the craw…

anjackson updated 5 years ago
1
webrecorder/archiveweb.page #40

Is there a way to record all tabs simultaneously?

I need to make an archive that requires a login (I wish I could use Pywb but the OneLogin service has issues with it) and need to save a whole bunch of links, so pressing the start button over and ove…

YousufSSyed updated 3 years ago
2
ottofabian/NLP4Web_Project #19

Improve TwitterCrawler

ToDo: - Automate Crawl List: - Implement Data Structure to Map Twitter handles to Domains - Implement method to look for already crawled authors and update only new tweets / crawl new autho…

mrnyc54 updated 6 years ago
2
PostHog/posthog #14004

Have a way to filter out bots that a user identifies

## Is your feature request related to a problem? From a slack convo with a user - it would be nice to have a way for users to place identified bots https://posthogusers.slack.com/archives/C01GLBKH…

fuziontech updated 1 year ago
3
aim42/htmlSanityCheck #219

avoid bad 403 and 405 results

e.g. Amazon always returns 405 upon HEAD requests. We should send a GET after all suspicious error codes (esp. 403 and 405) to get better results.

gernotstarke updated 6 years ago
4

上一页 1...1 2 3 4 5 6 7...80 下一页

791 results for automated-crawler

791 results
for automated-crawler