issues
search
scrapy
/
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
https://scrapy.org
BSD 3-Clause "New" or "Revised" License
50.9k
stars
10.33k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Scrapy Spider Fails to Process All URLs from CSV on Large URL Sets
#6320
mjid13
closed
6 days ago
1
Per spider DNS_RESOLVER doesn't work
#6319
synodriver
closed
1 week ago
1
Stringify path
#6318
labrocadabro
closed
1 week ago
0
chore: fix some typos in comments
#6317
TechVest
closed
1 week ago
0
fix test expectations
#6316
kokobhara
closed
1 week ago
4
Test workflow
#6315
OwenJRJones
closed
1 week ago
0
Media Pipeline is not filtering the duplicate file requests
#6314
Ehsan-U
closed
1 week ago
3
Receiving 403 while using proxy server and a valid user agent
#6313
devfox-se
closed
1 week ago
1
test_get_func_args() expectation changes in new Python point releases
#6312
wRAR
closed
1 week ago
6
Add
#6311
ioannastantzou
closed
1 week ago
0
Fix WrappedRequest.get_header raising TypeError if default is None
#6310
VMRuiz
closed
2 weeks ago
0
Edits to media_downloaded in files.py to handle 201 response status (#1615 and #1806)
#6309
zoemyatt
opened
2 weeks ago
3
Different behavior of `get_header` between urllib.Request and WrappedRequest
#6308
marinelay
closed
2 weeks ago
4
Scrapy and Great Expectations: Error - __provides__
#6307
culpgrant
opened
2 weeks ago
10
issue#6305 Replace deprecated ast.NameConstant usage and add tests
#6306
lizdenhup
opened
3 weeks ago
0
ast.NameConstant is deprecated and will be removed in Python 3.14; use ast.Constant instead
#6305
wRAR
opened
3 weeks ago
1
Not able to use requests inside with scrapy.
#6304
virenramani
closed
3 weeks ago
1
addradon&black
#6303
Sintivrousai
closed
3 weeks ago
1
intergrateradonandblack
#6302
Sintivrousai
closed
3 weeks ago
0
addradonandblack
#6301
Sintivrousai
closed
3 weeks ago
0
addradonandblack
#6300
Sintivrousai
closed
3 weeks ago
0
installationdoc
#6299
Sintivrousai
closed
3 weeks ago
1
Handle robots.txt files not UTF-8 encoded
#6298
lorenzoverardo
closed
3 weeks ago
2
converted string's concat to f-strings
#6296
igeni
closed
3 weeks ago
2
Failed to scrape data from Auction website with Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min) error
#6295
nith-ch
closed
1 month ago
3
Decompressor' object has no attribute 'process'
#6294
Icefloji
closed
1 month ago
1
SitemapSpider will ignore sitemap with URLs like https://website.com/filename.xml?from=7155352010944&to=7482320519360
#6293
seagatesoft
opened
1 month ago
3
Handle robots.txt files not utf-8 encoded
#6292
fkromer
closed
3 weeks ago
3
AttributeError: 'Decompressor' object has no attribute 'process'
#6291
phillipshaong
closed
1 month ago
1
Fix WindowsRunSpiderCommandTest skip outside Windows for older Twisted
#6290
Gallaecio
closed
1 month ago
2
GZipPlugin does not work with S3
#6289
masaez
closed
1 month ago
3
More documentation needed about the robots.txt protocol
#6288
josegicar
closed
1 month ago
3
Added comments to the file robotstxt.py
#6287
josegicar
closed
1 month ago
0
WindowsRunSpiderCommandTest isn't skipped properly in the pinned envs
#6286
wRAR
closed
1 month ago
0
Fix some comments
#6285
pengqiseven
closed
1 month ago
1
Update __init__.py
#6284
Runder55
closed
1 month ago
0
Update startproyect.py
#6283
Runder55
closed
1 month ago
1
added "lang='en" and "xml:lang='eng'" attributes to selectors-sample1.html
#6282
adrdiavaz
closed
1 month ago
1
Added "lang" and "xml:lang" attributes to this "<html>" element
#6281
fabrobher
closed
1 month ago
1
Update selectors-sample1.html
#6280
Runder55
closed
1 month ago
1
Full typing for scrapy/extensions, part 2.
#6279
wRAR
closed
1 month ago
1
Document the SpiderState extension
#6278
wRAR
opened
1 month ago
0
[24733854] Added Item Processor Pipeline
#6277
sahabyte
closed
3 weeks ago
0
Full typing for scrapy/extensions, part 1.
#6276
wRAR
closed
1 month ago
1
Full typing for scrapy/exporters.py.
#6275
wRAR
closed
1 month ago
1
Improve typing for Spider.parse().
#6274
wRAR
opened
1 month ago
1
add support for custom exporter class
#6273
guillermo-bondonno
opened
1 month ago
7
chore: Removing tests/requirements.txt and adding dependencies to the tox.ini file
#6272
lucas-belo
closed
1 month ago
14
Add an extra-deps job for pypy
#6271
Gallaecio
opened
1 month ago
0
Remove tests/requirements.txt
#6270
Gallaecio
closed
1 month ago
0
Next