issues
search
internetarchive
/
brozzler
brozzler - distributed browser-based web crawler
Apache License 2.0
653
stars
96
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Implement Seed-level video capture setting handling
#288
gretchenleighmiller
opened
3 days ago
0
prometheus metrics for brozzler, plus yt-dlp
#287
galgeek
opened
4 days ago
0
updates for proxyrack mostly
#286
galgeek
opened
4 days ago
0
feat: Detect connection failures forwarded from warcprox and retry th…
#285
adam-miller
opened
2 weeks ago
1
fix: handle exceptions when requesting page headers for content-type …
#284
adam-miller
closed
3 weeks ago
0
prioritize video resolution — stop preferring single file download from yt-dlp
#283
galgeek
closed
1 month ago
0
websocket-client update, plus yt-dlp update
#282
galgeek
closed
1 month ago
0
Upgrade websocket-client dependency
#281
vbanos
opened
1 month ago
2
refine April ytdlp_last update
#280
galgeek
closed
1 month ago
0
Update Ansible and Vagrant for Debian 12
#279
insom
opened
3 months ago
1
behavior to capture opengov.nsw.gov document pages
#278
galgeek
closed
4 months ago
0
skip ytdlp for selected seeds
#277
galgeek
closed
3 months ago
0
run yt-dlp after brozzling a page (if at all)
#276
galgeek
closed
4 months ago
2
Brozzler does not work with the newest chromium.
#275
dElogics
opened
5 months ago
0
Use --headless=new or --headless=chrome if supported
#274
vbanos
closed
1 month ago
0
add eldo.lu cookies dialog selector to defaults
#273
galgeek
closed
6 months ago
0
Unable to playback bsky.app pages
#272
ArchivingToolsForWBM
opened
7 months ago
0
Use black, enforce with GitHub Actions
#271
avdempsey
closed
7 months ago
1
MOMA behavior
#270
galgeek
closed
7 months ago
1
update brozzler setup.py
#269
galgeek
closed
6 months ago
0
skip yt-dlp for PDFs
#268
galgeek
closed
9 months ago
0
Brozzler-easy issue after start
#267
yacylover
opened
10 months ago
0
unpack ie_result for youtube:tab too
#266
galgeek
closed
10 months ago
0
format-sort fix
#265
galgeek
closed
10 months ago
0
improve yt-dlp import
#264
galgeek
closed
10 months ago
0
yt-dlp: capture postprocessor "Merger" videos
#263
galgeek
closed
11 months ago
0
update doublethink & rethinkdb imports
#262
galgeek
closed
11 months ago
0
Update rethinkdb dependency
#261
vbanos
closed
11 months ago
0
headless chrome
#260
galgeek
closed
11 months ago
0
handle m3u8s
#259
galgeek
closed
11 months ago
0
yt-dlp capture replayable mp4s again
#258
galgeek
closed
1 year ago
0
Do not try to get a screenshot if status is 4xx, 5xx
#257
vbanos
closed
1 year ago
2
brozzle-page Not Working With Recent Version of Google Chrome
#256
treid003
opened
1 year ago
2
draft: add Thorium support
#255
galgeek
opened
1 year ago
0
configurable browser window height & width
#254
galgeek
closed
1 year ago
0
add socket_timeout opt for yt-dlp
#253
galgeek
closed
1 year ago
0
feat: implementing browserless
#252
andyMrtnzP
opened
1 year ago
2
Starting and Stopping
#251
usmanQNL
opened
1 year ago
0
Evaluation of brozzler's scalability?
#250
goelayu
opened
2 years ago
4
zlib compression for warcprox_meta blocks
#249
galgeek
closed
2 years ago
0
Add more stealth evasions
#248
vbanos
closed
2 years ago
2
--stealth for brozzler_worker
#247
galgeek
closed
2 years ago
0
Add stealth parameter to avoid antibot systems
#246
vbanos
closed
2 years ago
1
yt-dlp: use 'youtube_dl' logger
#245
galgeek
closed
2 years ago
0
yt-dlp should skip live streams
#244
galgeek
closed
2 years ago
0
Adds hop path support
#243
adam-miller
closed
2 years ago
0
updates for yt-dlp 2022.03.08.2 and more?
#242
galgeek
closed
2 years ago
0
yt-dlp for brozzler
#241
galgeek
closed
2 years ago
1
yt-dlp for brozzler
#240
galgeek
closed
2 years ago
1
Adds hop path support
#239
adam-miller
closed
2 years ago
0
Next