issues
search
elliotgao2
/
gain
Web crawling framework based on asyncio.
GNU General Public License v3.0
2.04k
stars
207
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
The project is dead
#51
itallmakesense
opened
4 years ago
0
Gain Improvements - Stable
#50
kwuite
closed
5 years ago
6
demo error
#49
AaronConlon
opened
5 years ago
0
bug
#48
luotom
opened
5 years ago
0
SSL handshake failed on verifying the certificate
#47
38602629
opened
6 years ago
1
Gain Improvements - Ludaro
#46
kwuite
opened
6 years ago
5
What does this statement mean?
#45
StupidTAO
closed
6 years ago
1
Add hooks before download and after download.
#44
songww
opened
6 years ago
3
Does it work on OSX?
#43
littlepea
opened
6 years ago
1
The ``sciencenet_spider.py`` example does not (seem to) work for python 3.6
#42
endafarrell
opened
6 years ago
5
aiofiles BUG
#41
willshion
closed
6 years ago
1
Queue timeout and python 3.6 support.
#40
yc0
closed
6 years ago
0
Repeated bug
#39
allphfa
closed
6 years ago
0
add cssParser
#38
allphfa
closed
6 years ago
0
Add document's own parsing
#37
allphfa
closed
6 years ago
1
add encoding
#36
allphfa
closed
6 years ago
1
gain API update to reflect proposed usage
#35
origliante
closed
6 years ago
0
Change candidate urls implementation from list to asyncio queue and little fixes
#34
babykick
closed
6 years ago
0
Change candidate urls implementation from list to asyncio queue and little fixes
#33
babykick
closed
7 years ago
0
Update XPathParser and proxy usage in README.
#32
babykick
closed
7 years ago
1
Add proxy support
#31
babykick
closed
7 years ago
2
Add PhantomJS support.
#30
elliotgao2
closed
6 years ago
1
Limit the interval between two requests.
#29
elliotgao2
closed
6 years ago
7
fix format to match PEP8
#28
wuqiangroy
closed
6 years ago
0
Another try for parsing multiple items
#27
hyfc
closed
6 years ago
0
Fix #24
#26
wisecsj
closed
6 years ago
0
TypeError: write() argument must be str, not dict
#25
hyfc
closed
7 years ago
1
Css selector add attr not work correctly
#24
wisecsj
closed
6 years ago
1
Unescape html contains HTML Entities
#23
wisecsj
closed
7 years ago
1
Please do a decent code review before accepting pull requests
#22
perklet
closed
6 years ago
1
Test failed on Windows
#21
hyfc
closed
7 years ago
1
Some Suggestions
#20
wisecsj
closed
7 years ago
4
Add homepage.
#19
elliotgao2
closed
7 years ago
1
File downloader.
#18
elliotgao2
closed
7 years ago
1
IP proxy.
#17
elliotgao2
closed
7 years ago
4
Add full documentation.
#16
elliotgao2
closed
6 years ago
0
Add logo.
#15
elliotgao2
closed
7 years ago
0
Add architecture diagram.
#14
elliotgao2
closed
7 years ago
0
Master
#13
wisecsj
closed
7 years ago
0
Fix travis-ci
#12
c1ay
closed
7 years ago
0
Revert "Add built-in save result class"
#11
elliotgao2
closed
7 years ago
0
Add built-in save result class
#10
c1ay
closed
7 years ago
1
Add support to handle the value of each field of an item.
#9
howie6879
closed
7 years ago
2
Building 'pybloomfilter' extension on Windows.
#8
adonge
closed
7 years ago
1
Parse multiple item from each page.
#7
elliotgao2
closed
6 years ago
0
RuntimeError: uvloop does not support Windows at the moment.
#6
sumarsky
closed
7 years ago
3
Custom header.
#5
elliotgao2
closed
7 years ago
0
Add some built-in save() methods.
#4
elliotgao2
closed
7 years ago
3
Separate Follower from Parser.
#3
elliotgao2
closed
7 years ago
1
Regex selector support.
#2
elliotgao2
closed
7 years ago
0
Next