issues
search
scrapy
/
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
https://scrapy.org
BSD 3-Clause "New" or "Revised" License
50.99k
stars
10.34k
forks
source link
issues
Recently updated
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
More typing for scrapy/core/downloader
#6341
wRAR
opened
8 hours ago
0
Add support for multipart/form-data to FormRequest
#6332
alexandresgf
opened
2 days ago
4
Add Link to BSD-3 License
#6338
jtoallen
opened
1 day ago
4
Full typing for scrapy/extensions, part 2.
#6279
wRAR
closed
1 month ago
1
Full typing for scrapy/extensions, part 3.
#6325
wRAR
closed
1 week ago
1
Typing for build_from_*.
#6326
wRAR
closed
1 week ago
2
Deprecate the `spider` argument to `Downloader._get_slot_key()`
#6340
wRAR
opened
16 hours ago
1
Full typing for scrapy/*.py
#6333
wRAR
closed
16 hours ago
1
Use the Self type hint in from_crawler/from_settings.
#6335
wRAR
closed
16 hours ago
1
Full typing for scrapy/linkextractors.
#6337
wRAR
closed
16 hours ago
1
Full typing for scrapy/http/cookies.py.
#6336
wRAR
closed
16 hours ago
0
Option to include all tags and attrs in LinkExtractor with specified exclusions
#6321
User087
opened
1 week ago
6
Issue #6321: Link extractor all tags and attributes option
#6327
PJ1256
opened
4 days ago
2
Closes #6328. Document 'json' selector type
#6334
kumar-sanchay
closed
16 hours ago
2
'json' selector type not documented
#6328
mohmad-null
closed
16 hours ago
3
LinkExtractor changing case of URL (but didn't used to)
#6329
mohmad-null
opened
3 days ago
3
issue #6323: add SpiderLoggerAdapter, change Spider.logger to return SpiderLoggerAdapter
#6324
bloodforcream
opened
1 week ago
2
服务器部署遇到问题
#6339
Hao1617
closed
19 hours ago
2
Remove the auto-generated copyright years from the docs footer.
#6322
wRAR
closed
1 week ago
1
Provide an addon for Broad Crawls
#6331
kmike
opened
3 days ago
0
twisted.python.failure.Failure twisted.internet.error.ConnectionLost: Connection to the other side was lost in a non-clean fashion: Connection lost
#3103
ghost
closed
6 years ago
25
Spider.logger not logging custom extra information
#6323
bloodforcream
opened
1 week ago
0
dupefilter skips a request when a page is redirected to itself
#1225
immerrr
opened
8 years ago
20
Scrapy and Great Expectations: Error - __provides__
#6307
culpgrant
closed
1 week ago
13
feat: add disallowed_domains option to OffsiteMiddleware
#5922
felipecustodio
opened
1 year ago
0
Make the build reproducible
#5019
lamby
closed
1 week ago
4
Warn about br handling if brotlipy is not installed
#4697
Gallaecio
opened
3 years ago
11
Twisted and asyncio
#6219
abebus
closed
1 week ago
7
Adding support for Path objects to APIs that take paths
#5739
wRAR
closed
1 week ago
19
Implement get_import _path
#6225
Gallaecio
opened
2 months ago
3
Edits to media_downloaded in files.py to handle 201 response status (#1615 and #1806)
#6309
zoemyatt
opened
4 weeks ago
3
"Content-Encoding" header gets stripped from response headers
#1988
mborho
opened
7 years ago
14
fix: LxmlLinkExtractor unique_list missing key
#6221
jxlil
closed
2 weeks ago
8
LxmlLinkExtractor unique_list missing key
#3273
nikan1996
closed
2 weeks ago
0
Scrapy Spider Fails to Process All URLs from CSV on Large URL Sets
#6320
mjid13
closed
2 weeks ago
1
Per spider DNS_RESOLVER doesn't work
#6319
synodriver
closed
2 weeks ago
1
fix test expectations
#6316
ghost
closed
2 weeks ago
4
Stringify path
#6318
labrocadabro
closed
2 weeks ago
0
test_get_func_args() expectation changes in new Python point releases
#6312
wRAR
closed
2 weeks ago
6
OpenSSLError 'unexpected eof while reading' openssl
#5835
necronet
closed
1 year ago
7
issue#6305 Replace deprecated ast.NameConstant usage and add tests
#6306
lizdenhup
opened
1 month ago
0
chore: fix some typos in comments
#6317
TechVest
closed
2 weeks ago
0
Media Pipeline is not filtering the duplicate file requests
#6314
Ehsan-U
closed
2 weeks ago
3
Test workflow
#6315
OwenJRJones
closed
2 weeks ago
0
Receiving 403 while using proxy server and a valid user agent
#6313
devfox-se
closed
2 weeks ago
1
Refactor _get_inputs to reduce complexity
#6237
noon-io
closed
3 weeks ago
1
Cookiejars exposed
#6218
GeorgeA92
opened
2 months ago
6
Add
#6311
ioannastantzou
closed
3 weeks ago
0
execution of asyncio.ensure_future(coro()) ignored on close_spider() pipelines call
#6238
abebus
closed
2 months ago
5
Changes for Improve the docs about Crawler initialization changes
#6076
Amik-Sen-Fun
closed
5 months ago
2
Next