issues
search
ArchiveTeam
/
grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Other
1.31k
stars
129
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix FB-RE2 build error in setup.py
#240
dannypage
opened
1 day ago
0
Failed building wheel for fb-re2
#239
mariospicross
opened
3 months ago
1
Dashboard
#238
drzo
opened
3 months ago
0
xFormers Support?
#237
Astra060
closed
4 months ago
1
Fallback to re if re2 can't be imported
#236
rebane2001
opened
4 months ago
0
Fix --which-wpull-command not working correctly with certain paths
#235
rebane2001
opened
4 months ago
0
fb-re2 dependency clang compile error on macOS Sonoma
#234
xor-gate
opened
6 months ago
1
Is it possible to crawl only a domain and its subdomains?
#233
ghost
closed
5 months ago
3
Support python 3.9-3.12
#232
fruzitent
opened
7 months ago
0
Add instructions for when using nix profiles
#231
tripleo1
opened
8 months ago
0
Add instructions for when using nix profiles
#230
tripleo1
closed
8 months ago
1
Grab site is not actually compatible with python 3.8
#229
cenodis
opened
1 year ago
2
is it possible to output regular files instead of warc?
#228
ftc2
opened
1 year ago
6
grab-site not displaying any content on Port 29000, but installed and running
#227
DominicBilke
opened
1 year ago
2
Add upload option
#226
upintheairsheep
closed
1 year ago
6
Debian/Ubuntu install instructions fail on Raspbian
#225
Billybangleballs
closed
1 year ago
5
Add a --no-global-igset option
#224
ivan
closed
1 year ago
1
Can't grab Wikimedia thumbnails, even when global is removed from igset file
#223
BrinBellway
closed
1 year ago
2
Record grab-site version in WARC headers
#222
JustAnotherArchivist
closed
2 years ago
1
Log settings changes and ignores
#221
JustAnotherArchivist
opened
2 years ago
2
RuntimeError: To use txaio, you must first select a framework with .use_twisted() or .use_asyncio()
#220
PadraigEire
closed
2 years ago
3
No messege on Dashboard
#219
CircleCrop
closed
2 years ago
3
Nix-based macOS install does not work because of failing Yapsy tests
#218
ivan
opened
2 years ago
0
install error in macOS Catalina
#217
LeeBinder
closed
2 years ago
10
Update macOS install script to reflect Python 3.8.x (rather than 3.7)
#216
LeeBinder
closed
2 years ago
5
Make it work again in Python 3.10
#215
iacore
opened
2 years ago
5
Syntax Error on run
#214
trentwiles
closed
2 years ago
2
Should we add an anti-porn igset?
#213
TheTechRobo
opened
2 years ago
4
Dubious quickmod2 SMF forum ignore
#212
TheTechRobo
opened
2 years ago
0
README: remove outdated "non-SMF forums"
#211
TheTechRobo
opened
2 years ago
0
Resuming a WARC after hard "No space left on device" error message?
#210
Preservation-Quest
closed
2 years ago
1
Update README.md
#209
Preservation-Quest
closed
2 years ago
6
multiple --wpull-args
#208
TheTechRobo
opened
2 years ago
0
How do you add custom hooks now?
#207
TheTechRobo
opened
2 years ago
3
Pause gracefully if OSError (No space left on device)
#206
TheTechRobo
opened
2 years ago
4
Cloudflare-protected site responds with 503 Service Temporarily Unavailable
#205
rmfkdehd
opened
2 years ago
5
Add some Tumblr ignores to global igset
#204
TheTechRobo
opened
2 years ago
0
Add SimpleMachineForum ignores to `forums` igset
#203
TheTechRobo
closed
2 years ago
6
On the dashboard, make the background colour ACTUALLY a background colour
#202
TheTechRobo
closed
2 years ago
1
Add SimpleMachineForums igsets
#201
TheTechRobo
closed
2 years ago
0
No module named 'autobahn'
#200
vitacell
opened
2 years ago
1
Backslash to Forward slash correction
#199
acrois
opened
2 years ago
2
Fix ludios_wpull to support SQLAlchemy 1.4
#198
ivan
closed
5 months ago
7
Dupe spotter user-defined list of expressions / separation of default dupe spotter expressions
#197
acrois
opened
2 years ago
1
Error while starting a crawl in docker container
#196
Z2Up1UwcaYOyZq
closed
2 years ago
2
Full Docker support
#195
acrois
opened
2 years ago
15
infinite recursion on offsite links?
#194
TheTechRobo
opened
2 years ago
3
Ignore errors and keep crawling
#193
TowardMyth
opened
2 years ago
8
Project Evolution
#192
acrois
opened
2 years ago
4
What does the ID do?
#191
TheTechRobo
closed
2 years ago
3
Next