issues
search
everypolitician
/
scraped_page_archive
Create an archive of HTML pages scraped by a Ruby scraper
MIT License
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Tidy for newer rubocop
#65
tmtmtmtm
closed
7 years ago
0
chomp MORPH_SCRAPER_CACHE_GITHUB_REPO_URL
#64
tmtmtmtm
closed
7 years ago
0
Make it easier to see what the archiver is doing
#63
chrismytton
opened
7 years ago
0
Make gem store PDFs as correct type
#62
ondenman
opened
7 years ago
1
Check storage is valid before using it
#61
chrismytton
opened
7 years ago
0
[WIP] Add a local git storage class
#60
chrismytton
opened
7 years ago
0
Remove old release docs from README
#59
chrismytton
closed
8 years ago
2
Fix release date for v0.5.0 in CHANGELOG
#58
chrismytton
closed
7 years ago
1
Make the gem work with Mechanize
#57
octopusinvitro
opened
8 years ago
0
Enable RubyGems deployment from Travis CI
#56
chrismytton
closed
8 years ago
4
Version 0.5.0
#55
chrismytton
closed
8 years ago
3
Make creating an adapter easier
#54
chrismytton
opened
8 years ago
0
Don't re-clone repo on each request
#53
chrismytton
closed
8 years ago
2
Git repo is re-cloned on every request
#52
chrismytton
closed
8 years ago
0
Extract library for saving to the archive
#51
chrismytton
opened
8 years ago
0
Produce a more helpful error if origin is an scp-style URL
#50
mhl
closed
7 years ago
5
Remove reference to variable that doesn't exist
#49
chrismytton
closed
8 years ago
1
Error when using GitStorage
#48
chrismytton
closed
8 years ago
0
chomp the MORPH_SCRAPER_CACHE_GITHUB_REPO_URL before use
#47
tmtmtmtm
closed
7 years ago
0
Improve the error message when the repo url is wrong
#46
octopusinvitro
closed
8 years ago
3
improve error message when git repo is missing
#45
davewhiteland
closed
8 years ago
0
Add Rubocop
#44
chrismytton
closed
8 years ago
1
Add rubocop
#43
tmtmtmtm
closed
8 years ago
0
Extract git operations into a separate storage class
#42
chrismytton
closed
8 years ago
3
Switch to using pry in bin/console
#41
chrismytton
closed
8 years ago
0
Scrub GitHub access token from errors
#40
chrismytton
opened
8 years ago
1
0.4.1 release broken
#39
struan
opened
8 years ago
0
Handle rejected pushes
#38
tmtmtmtm
opened
8 years ago
0
Handle ssh remotes
#37
octopusinvitro
opened
8 years ago
1
Update README to reorder usage examples
#36
octopusinvitro
closed
8 years ago
0
Error when running with Poltergeist
#35
chrismytton
opened
8 years ago
0
Offer a way to turn off the archive
#34
chrismytton
opened
8 years ago
1
What should happen when the same url returns different html?
#33
chrismytton
opened
8 years ago
1
Remove references to page in capybara adapter
#32
chrismytton
closed
8 years ago
0
Ensure storage location is set before running VCR
#31
chrismytton
closed
8 years ago
0
Error when using Capybara support
#30
chrismytton
closed
8 years ago
0
Fix Ruby 2.0.0 compatibility
#29
chrismytton
closed
8 years ago
0
Doesn't work with ruby 2.0.0
#28
chrismytton
closed
8 years ago
2
Get GitHub url from environment variable first
#27
chrismytton
closed
8 years ago
0
Session details in URL causing pages to be saved multiple times
#26
struan
opened
8 years ago
0
Clean Github token from errors
#25
tmtmtmtm
opened
8 years ago
0
Change ScrapedPageArchive to be a class
#24
chrismytton
closed
8 years ago
0
Add method for getting pages back out of the archive
#23
chrismytton
closed
8 years ago
0
Reverse the order of the Usage
#22
tmtmtmtm
closed
8 years ago
0
Getting data back out of the archive
#21
chrismytton
opened
8 years ago
0
Avoid storing file changes that are due to always changing page elements
#20
struan
opened
8 years ago
2
add support for capybara poltergeist based scrapers
#19
struan
closed
8 years ago
4
Retry github timeouts
#18
tmtmtmtm
opened
8 years ago
0
Don't rescue commit errors with pry
#17
chrismytton
closed
8 years ago
0
Set git user.name option after cloning
#16
chrismytton
closed
8 years ago
0
Next