issues
search
scrapy
/
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
https://scrapy.org
BSD 3-Clause "New" or "Revised" License
50.99k
stars
10.34k
forks
source link
issues
Least commented
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
CSS selectors
#176
barraponto
closed
10 years ago
67
Python 3 support
#263
extesy
closed
7 years ago
65
Scrapy.selector Enhancement Proposal
#906
zwidny
closed
8 years ago
60
New selector method: extract_first()
#568
shirk3y
closed
9 years ago
54
[GSoC 2019] Support for Different robots.txt Parsers
#3656
whalebot-helmsman
closed
4 years ago
48
Support for socks5 proxy
#747
cydu
opened
9 years ago
48
Identical requests sent by Scrapy vs Requests module returning different status codes
#4951
pmdbt
opened
3 years ago
46
Integrating xtractmime into Scrapy
#5204
akshaysharmajs
opened
2 years ago
45
enable ANSI color (instead of ANSI color codes) in the Windows terminal #4393
#4403
akshaysharmajs
closed
3 years ago
45
Missing the Scrapy entry in Wikipedia in many languages
#4233
noviluni
opened
4 years ago
44
Issue with running scrapy spider from script.
#2473
tituskex
closed
2 years ago
42
' error: command 'x86_64-linux-gnu-gcc' failed with exit status 1 '
#2115
euler16
closed
7 years ago
41
Centralized Request fingerprints
#900
kmike
closed
1 year ago
40
Python 3.7 support
#3143
lopuhin
closed
5 years ago
39
Speedup & fix URL parsing
#1306
kmike
opened
8 years ago
39
Per request delay
#802
chekunkov
opened
9 years ago
38
Support relative urls better
#548
kmike
closed
4 years ago
38
Scrapy chokes on HTTP response status lines without a Reason phrase
#345
tonal
closed
6 years ago
37
General Message Queues as Storage for Requests
#4326
whalebot-helmsman
opened
4 years ago
36
Ability to control consumption of start_requests from spider
#3237
whalebot-helmsman
opened
6 years ago
36
sslv3 alert handshake failure when making a request
#1764
lagenar
closed
7 years ago
36
Add-ons
#1272
jdemaeyer
closed
9 months ago
36
GSoC 2021: Feeds enhancements
#4963
ejulio
closed
1 year ago
35
Error when install scrapy in window by using pip install scrapy
#2881
lhkthomas
closed
1 year ago
34
[MRG+1] Feed exports: beautify JSON and XML
#2456
elacuesta
closed
6 years ago
33
[MRG] Selectors unified API
#426
dangra
closed
10 years ago
32
[MRG+1] Support auth credentials in netloc for HTTP and FTP handlers
#1466
umrashrf
opened
8 years ago
30
[MRG+1] Migrating selectors to use parsel
#1409
eliasdorneles
closed
8 years ago
30
Make it easier to use Scrapy in Jupyter Notebook
#4299
Gallaecio
opened
4 years ago
29
Support for dataclass and attrs items
#3881
elacuesta
closed
3 years ago
29
SSL issue when scraping website
#1429
gmeans
closed
7 years ago
29
[MRG +1] bpython support (followup on #270)
#1100
nyov
closed
8 years ago
29
Wrong type(response) for binary responses
#4240
ejulio
opened
4 years ago
28
add support for a nested loaders
#1467
dacjames
closed
8 years ago
28
Support for async callbacks
#4978
wRAR
closed
1 year ago
27
TLS handshake failure
#2717
povilasb
closed
4 years ago
27
Adding domain
#3160
tianhuil
closed
4 years ago
26
HTTP 2 support
#1854
povilasb
closed
3 years ago
26
IPv6ThreadedResolver based on socket.getaddrinfo
#1104
nyov
closed
4 years ago
26
[MRG+1] Added JmesSelect
#1016
SudShekhar
closed
9 years ago
26
Security enhancement when following a "redirect"
#457
mvsantos
opened
10 years ago
26
Change extensions/spiders/settings initialisation order, v2
#6038
wRAR
closed
7 months ago
25
[MRG+1] Issue #2919: Fix FormRequest.formdata with GET method duplicates same key in query string
#3579
maramsumanth
closed
2 years ago
25
twisted.python.failure.Failure twisted.internet.error.ConnectionLost: Connection to the other side was lost in a non-clean fashion: Connection lost
#3103
ghost
closed
6 years ago
25
ImagePipeline breaks on invalid images
#5079
Gallaecio
closed
2 years ago
24
python requests scrapes correctly but scrapy cant
#4883
maxwill-max
closed
3 years ago
24
SSL handshake failure
#2424
briehanlombaard
closed
6 years ago
24
[MRG+1] Added option to turn off ensure_ascii for JSON exporters
#2034
dracony
closed
7 years ago
24
Allow multiple items through pipelines?
#1915
dxue2012
closed
7 years ago
24
[MRG+1] Fix for KeyError in robots.txt middleware
#1735
ArturGaspar
closed
8 years ago
24
Next