issues
search
scrapy
/
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
https://scrapy.org
BSD 3-Clause "New" or "Revised" License
51.16k
stars
10.35k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Install typing stubs for boto3 and botocore.
#6370
wRAR
opened
8 hours ago
1
persist_file() can return a Deferred that is never awaited
#6369
wRAR
opened
9 hours ago
0
Closes #6365. Fix overridable methods in MediaPipeline
#6368
kumar-sanchay
opened
10 hours ago
1
Update expectations of cookies after redirects.
#6367
wRAR
closed
2 days ago
2
Latest allowed_domains behavior breaks middlewares that rewrite urls
#6366
rvandam
opened
3 days ago
4
Fix overridable methods in MediaPipeline
#6365
wRAR
opened
3 days ago
1
Undeprecate and add back to defaults the off-domain spider middleware
#6364
Gallaecio
closed
3 days ago
2
Merge 2.11.2
#6363
Gallaecio
closed
4 days ago
1
Possible improvement: this check depends on scrapy reponse properly converted to XMLReponse intance
#6362
DharmeshPandav
opened
4 days ago
3
Remove top-level reactor imports from CrawlerProces/CrawlerRunner examples
#6361
wRAR
opened
4 days ago
2
is there any method can add decorator on parse function?
#6360
xiaobai987292
closed
4 days ago
5
Release notes for 2.11.2
#6359
Gallaecio
closed
5 days ago
1
Fix the offsite middleware missing some requests
#6358
Gallaecio
closed
5 days ago
1
Allow user-defined secure cookies
#6357
Gallaecio
closed
3 days ago
0
Full typing for scrapy/spiders.
#6356
wRAR
closed
5 days ago
1
Fails to fetch request with hyphens in 3rd and 4th position of domain
#6355
vlln
closed
6 days ago
0
AttributeError: 'SelectReactor' object has no attribute '_handleSignals'
#6354
windY1Y
closed
6 days ago
2
Use ParamSpec for callables.
#6353
wRAR
closed
5 days ago
2
Closes #6340. Deprecate the spider argument to Downloader._get_slot_key()
#6352
kumar-sanchay
closed
1 week ago
4
Document stats produced by Scrapy
#6351
mohmad-null
opened
1 week ago
0
CachingHostnameResolver with CONCURRENT_REQUESTS_PER_IP fails
#6350
mohmad-null
opened
1 week ago
7
Closes #6343. Make certain args of ScrapyAgent and TunnelingAgent required
#6349
kumar-sanchay
closed
1 week ago
6
Closes #6343. Make certain args of ScrapyAgent and TunnelingAgent required
#6348
kumar-sanchay
closed
1 week ago
0
Closes #6342. Setting METAREFRESH_IGNORE_TAGS to [‘noscript’] by default
#6347
aisha-partha
closed
1 week ago
1
Closes #6342. Setting METAREFRESH_IGNORE_TAGS to ["noscript"] by default
#6346
aisha-partha
closed
1 week ago
0
SCRAPER_SLOT_MAX_ACTIVE_SIZE - documentations
#6345
mohmad-null
closed
1 week ago
1
Update MANIFEST.in.
#6344
wRAR
closed
1 week ago
1
Make certain args of `ScrapyAgent` and `TunnelingAgent` required
#6343
wRAR
closed
1 week ago
1
Set METAREFRESH_IGNORE_TAGS to ["noscript"] by default
#6342
Gallaecio
closed
1 week ago
3
More typing for scrapy/core/downloader
#6341
wRAR
closed
1 week ago
0
Deprecate the `spider` argument to `Downloader._get_slot_key()`
#6340
wRAR
closed
1 week ago
3
服务器部署遇到问题
#6339
Hao1617
closed
1 week ago
2
Add Link to BSD-3 License
#6338
jtoallen
closed
4 days ago
4
Full typing for scrapy/linkextractors.
#6337
wRAR
closed
1 week ago
1
Full typing for scrapy/http/cookies.py.
#6336
wRAR
closed
1 week ago
0
Use the Self type hint in from_crawler/from_settings.
#6335
wRAR
closed
1 week ago
1
Closes #6328. Document 'json' selector type
#6334
kumar-sanchay
closed
1 week ago
2
Full typing for scrapy/*.py
#6333
wRAR
closed
1 week ago
1
Add support for multipart/form-data to FormRequest
#6332
alexandresgf
opened
2 weeks ago
4
Provide an addon for Broad Crawls
#6331
kmike
opened
2 weeks ago
2
LinkExtractor changing case of URL (but didn't used to)
#6329
mohmad-null
opened
2 weeks ago
3
'json' selector type not documented
#6328
mohmad-null
closed
1 week ago
3
Issue #6321: Link extractor all tags and attributes option
#6327
PJ1256
opened
2 weeks ago
2
Typing for build_from_*.
#6326
wRAR
closed
2 weeks ago
2
Full typing for scrapy/extensions, part 3.
#6325
wRAR
closed
2 weeks ago
1
issue #6323: add SpiderLoggerAdapter, change Spider.logger to return SpiderLoggerAdapter
#6324
bloodforcream
closed
5 days ago
4
Spider.logger not logging custom extra information
#6323
bloodforcream
closed
5 days ago
0
Remove the auto-generated copyright years from the docs footer.
#6322
wRAR
closed
2 weeks ago
1
Option to include all tags and attrs in LinkExtractor with specified exclusions
#6321
User087
opened
3 weeks ago
6
Scrapy Spider Fails to Process All URLs from CSV on Large URL Sets
#6320
mjid13
closed
4 weeks ago
1
Next