issues
search
propublica
/
upton
A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)
MIT License
1.61k
stars
112
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Doc: Add installation instructions
#43
vfonic
closed
6 years ago
1
Fix markdown syntax in readme
#42
bschne
closed
6 years ago
1
Update README URLs based on HTTP redirects
#41
ReadmeCritic
opened
9 years ago
0
New version?
#40
nofxx
closed
9 years ago
2
scrape_to_csv method should write to the CSV incrementally
#39
jeremybmerrill
opened
9 years ago
0
make scrape method return an enumerator
#38
jeremybmerrill
opened
9 years ago
0
Pagination always double-downloads first page
#37
jaypinho
closed
10 years ago
3
problem scraping index page (Scraping 0 instances)
#36
okliv
opened
10 years ago
1
Make Scraper instances additive
#35
jeremybmerrill
opened
10 years ago
1
Implement @pagination_interval
#34
caseycesari
closed
10 years ago
1
HTML Comment on stashed pages with info
#33
jeremybmerrill
closed
10 years ago
1
Create ScrapedPage object
#32
jeremybmerrill
opened
10 years ago
1
Helper methods for scraping one page and for scraping multiple
#31
jeremybmerrill
opened
10 years ago
5
Nokogiri::CSS::SyntaxError: unexpected '$' after ''
#30
irosenb
closed
10 years ago
3
The example in README.md does not work
#29
paos
closed
10 years ago
2
pagination doesn't respect sleep time
#28
jeremybmerrill
closed
10 years ago
7
Warn users of slug collisions
#27
jeremybmerrill
opened
10 years ago
0
Setting @pagination should be @paginated
#26
ArthurClemens
closed
10 years ago
1
Switch from concatenating HTML to putting it in an array when paginating
#25
jeremybmerrill
closed
10 years ago
2
Made changes to get_instance method
#24
esagara
closed
11 years ago
1
Recursive function causing a stack overflow
#23
esagara
closed
10 years ago
5
Use content-type to skip non-HTML instance pages
#22
swapab
closed
11 years ago
4
add a flew ruby style practice
#21
helloworld-cat
closed
11 years ago
1
Improving url_to_filename
#20
dannguyen
opened
11 years ago
7
Added support for pagination of index pages. Resolves issue #17.
#19
bxjx
closed
11 years ago
2
find by xpath
#18
abacha
closed
11 years ago
5
Handle pagination out-of-the-box
#17
bxjx
closed
10 years ago
2
relative url edge cases
#16
jeremybmerrill
closed
11 years ago
4
Downloading and caching extracted
#15
kgrz
closed
11 years ago
12
Added Utils.resolve_url and some tests
#14
dannguyen
closed
11 years ago
5
Adding some basic RSpec unit tests
#13
dannguyen
closed
11 years ago
1
fix broken/incomplete test
#12
kgrz
closed
11 years ago
5
Deprecating :selector_method parameter
#11
dannguyen
closed
11 years ago
1
Downloading and Caching part
#10
kgrz
closed
11 years ago
7
Rspec specific changes
#9
kgrz
closed
11 years ago
1
relative URLs
#8
jeremybmerrill
closed
11 years ago
2
fixed typos in upton.rb
#7
jkokenge
closed
11 years ago
0
More test coverage, more idiomatic tests
#6
brianflanagan
opened
11 years ago
15
Refactor API
#5
adelevie
opened
11 years ago
20
Update Gemfile
#4
shail
closed
11 years ago
1
Update TODO.md
#3
shail
closed
11 years ago
1
using gh-flavored markdown in readme
#2
adelevie
closed
11 years ago
1
Issue requiring utils?
#1
dankeemahill
closed
11 years ago
1