issues
search
taganaka
/
polipus
Polipus: distributed and scalable web-crawler framework
MIT License
92
stars
32
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Use #add_url in #takeover to add seed urls
#22
tmaier
closed
10 years ago
2
[WIP] HTTP compression handling
#21
taganaka
closed
10 years ago
2
Gzip decoded body not used anywhere
#20
tmaier
closed
10 years ago
1
Show code coverage and results of quality analysis
#19
tmaier
closed
10 years ago
0
Remove #overflow_adapter. Closes #11
#18
tmaier
closed
10 years ago
0
Minor changes to code style
#17
tmaier
closed
10 years ago
1
Change logging format. Closes #14
#16
tmaier
closed
10 years ago
0
Internet connection lost; Page still stored and processed
#15
tmaier
closed
10 years ago
6
Change logging format
#14
tmaier
closed
10 years ago
1
Travis ci
#13
taganaka
closed
10 years ago
0
Robots.txt option
#12
ABrisset
closed
10 years ago
3
#queue_overflow_adapter and #overflow_adapter; Same thing?
#11
tmaier
closed
10 years ago
1
Incremental Crawling
#10
nengine
closed
10 years ago
3
Thread seems to hang in HTTP Call
#9
hendricius
opened
10 years ago
9
How to setup in a cluster environment?
#8
dbuarque
closed
10 years ago
2
URL patching
#7
nengine
closed
10 years ago
5
RegularExpression To Follow a Link
#6
nengine
closed
10 years ago
1
Where is the Gem
#5
nengine
closed
10 years ago
1
Domain aliases
#4
taganaka
closed
10 years ago
0
allow following uris with www
#3
hendricius
closed
10 years ago
3
fix wrong variable on stats reset
#2
hendricius
closed
10 years ago
0
Dofollow links
#1
hendricius
closed
10 years ago
4
Previous