custom-crawler Search Results

1000+ results
for custom-crawler

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SHHOOD442/memcached #414

memcached 1.4.24 segfaults

``` What steps will reproduce the problem? 1. SLES 11.3 with slightly patched 3.16 kernel Linux memcached9 3.16.3-4.1.100-default #1 SMP Thu Sep 18 06:32:16 UTC 2014 (d2bbe7f) x86_64 x86_64 x86_64 GN…

GoogleCodeExporter updated 8 years ago
2
scrapy/scrapy #5988

DNSCACHE_ENABLED not respected when in Spider.custom_setting…

### Description It's observed that custom [`DNSCACHE_ENABLED`](https://docs.scrapy.org/en/latest/topics/settings.html#dnscache-enabled) is not respected when specified as part of [`Spider.custom_…

starrify updated 1 year ago
2
continuedev/continue #1453

Analogue of the @Web as in the cursor.sh

### Validations - [X] I believe this is a way to improve. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue](https://githu…

Agnostion updated 2 months ago
1
kriasoft/react-firebase-starter #133

SEO issue with Facebook/Twitter

So SEO was discussed in issue #101 for the Google crawler, but this is concerned with the Facebook and Twitter crawlers. They both do not interpret JavaScript, which makes it impossible to dynamically…

theplatapi updated 7 years ago
5
johanneszab/TumblThree #175

Parse the tumblr search, tumblr tag search and tumblr like/b…

It should be possible to enhance the current implementations by parsing the results from the crawler into proper html. Right now the crawler only load the whole pages into a large string and extract p…

johanneszab updated 6 years ago
1
DataCrawl-AI/datacrawl #12

Feature: Add a feature to only crawl the given list of urls

- Accept a argument from the user. Something like `url_list` - Crawl only the urls provided by the users as an argument and nothing else.

indrajithi updated 3 months ago
5
gtalarico/revitapidocs #107

Block IP from crawlers

Server is getting slammed with crawlers trying to find stuff + 404 errors Gunicorn Server Hooks http://stackoverflow.com/questions/40951861/how-to-use-variables-created-in-gunicorns-server-hooks …

gtalarico updated 7 years ago
1
awsdocs/aws-doc-sdk-examples #4598

Swift Glue MVP

Implement the following for the Swift SDK. ## Service actions Service actions can either be pulled out as individual functions or can be incorporated into the scenario, but each service action m…

shepazon updated 1 year ago
1
rmusser01/tldw #54

Improvement: Improve URL Scraping/Ingestion

Issue to track improvements/ideas for URL Scraping & Ingestion - [ ] Add custom cookie support - [ ] Instructions for adding custom browser-addons to the scraping browser - [ ] Support for identi…

rmusser01 updated 6 days ago
8
openzim/zim-requests #988

CIA World factbook is incomplete

### ZIM(s) location https://library.kiwix.org/viewer#theworldfactbook_en_all_2023-12/A/www.cia.gov/the-world-factbook/ ### Recipe(s) URL https://farm.openzim.org/recipes/CIAworldfactbook_en_all/edi…

Popolechien updated 3 months ago
6

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for custom-crawler

1000+ results
for custom-crawler