issues
search
ArchiveTeam
/
grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Other
1.31k
stars
129
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add missing build-depends
#140
anarcat
opened
5 years ago
2
Better alignment values for larger crawl
#139
scumola
opened
5 years ago
3
AttributeError: 'cython_function_or_method' object has no attribute 'endswith'
#138
scumola
closed
5 years ago
3
Old resolver is still used after changing /etc/resolv.conf
#137
ivan
opened
5 years ago
0
dashboard: Home/PgUp/PgDn/End keys usually fail in Firefox
#136
ivan
opened
5 years ago
1
grab-site 2.x upgrade guide
#135
ivan
closed
5 years ago
0
grab-site spends a lot of time in dupespotter
#134
ivan
opened
5 years ago
1
Fix websocket reconnection to gs-server
#133
ivan
closed
5 years ago
3
grab-site grabs urls with session id against my will
#132
fin-atem
closed
5 years ago
2
Document --no-warc-compression
#131
Dri0m
closed
2 years ago
4
Add feature to avoid queuing any more URLs
#130
ivan
closed
5 years ago
1
Failed DNS resolutions are retried forever with --wpull-args=--retry-dns-error
#129
ethus3h
closed
5 years ago
4
Benchmark cpython 3.4 vs Nuitka-built grab-site.exe
#128
ivan
closed
5 years ago
1
imgur images are not grabbed after imgur redirects
#127
ivan
opened
5 years ago
0
tumblr archives may not play back properly
#126
RomeSilvanus
opened
5 years ago
9
Missing linked files.
#125
ZizzyDizzyMC
closed
5 years ago
5
Can no longer archive tumblr blogs from Europe
#124
ivan
closed
6 years ago
1
Does gs-server really need to run?
#123
hofmand
closed
6 years ago
4
Cannot get gs-server to start
#122
brentoage
closed
6 years ago
2
Error installing on MacOS 10.13.4 High Sierra
#121
EtienneBerlin
closed
6 years ago
4
scrape a page more than once
#120
notslang
opened
6 years ago
3
Added Centos7 Installation
#119
raspher
closed
3 years ago
1
Add archive.org to global ignore set
#118
brandongalbraith
closed
6 years ago
5
wpull spends a lot of time in add_cookie_header
#117
ivan
opened
6 years ago
0
box-shadow on dashboard makes Chrome use twice as much CPU
#116
ivan
closed
6 years ago
1
Add support for Cloudflare DDoS protection screen
#115
ivan
opened
6 years ago
2
Update README to point to newer deadsnakes PPA
#114
ivan
closed
6 years ago
1
Clarify "start a new shell" in README
#113
ivan
closed
6 years ago
1
Automatically slow down for a domain on 429 Too many requests
#112
ivan
opened
6 years ago
1
Crash in wpull/dns.py -> dns/inet.py -> is_multicast
#111
ivan
closed
6 years ago
2
Add toggleable mode that shows all URLs being queued
#110
ivan
opened
6 years ago
0
googleplus igset: Ignore more login URLs
#109
jodizzle
closed
6 years ago
4
Nonsensical [Errno 8] Exec format error
#108
ivan
opened
6 years ago
1
Add a default get_urls hook to get :orig quality images on Twitter
#107
ivan
closed
5 years ago
6
Windows Subsystem for Linux: gs-dump-urls fails on active crawl
#106
ivan
opened
6 years ago
1
Call Script To Rewrite URL Matching Specific Regex?
#105
brandongalbraith
closed
6 years ago
2
dashboard: use CSS scroll-boundary-behavior in supported browsers
#104
ivan
closed
6 years ago
1
Add queue and pending
#103
raspher
closed
6 years ago
1
Segfault when run under Windows Subsystem for Linux
#102
ivan
closed
6 years ago
1
grab-site users please upgrade for important dashboard fix
#101
ivan
closed
7 years ago
0
Duplicate URLs with different Request headers not stored
#100
atiro
opened
7 years ago
4
Support Restarting Crawl Prematurely Terminated
#99
brandongalbraith
closed
7 years ago
2
Update the README to assume grab-site is installed in a virtualenv
#98
ivan
closed
6 years ago
1
Help on 403 Forbidden errors
#97
Svekla
closed
7 years ago
7
grab-site benchmark with cPython 3.4.5 vs PyPy3 5.5.0 on Ubuntu 16.04.1
#96
ivan
closed
7 years ago
0
ImportError: No module named 'dns.resolver'
#95
ivan
closed
7 years ago
2
Tumblr redirect
#94
RomeSilvanus
closed
7 years ago
2
Add Dockerfile to simplify installation
#93
notslang
opened
7 years ago
20
Upgrade to wpull 2.0
#92
luckcolors
closed
7 years ago
1
Add config file using configparser
#91
12As
closed
7 years ago
1
Previous
Next