issues
search
JBGruber
/
paperboy
A comprehensive (eventually) collection of webscraping scripts for news media sites
45
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
parsing script type="application/ld+json"
#25
schochastics
closed
3 weeks ago
2
add german media (#23)
#24
schochastics
opened
3 weeks ago
5
german news media
#23
schochastics
opened
3 weeks ago
2
fixed faz #21
#22
schochastics
closed
3 weeks ago
2
faz broken
#21
schochastics
closed
3 weeks ago
0
improve rss reader (close #19)
#20
JBGruber
closed
3 months ago
0
Extend RSS parsers
#19
JBGruber
closed
3 months ago
0
Adding Open Graph scraping
#18
kasperwelbers
closed
11 months ago
3
Cloudflare
#17
JBGruber
closed
11 months ago
1
usethis style function for new parsers
#16
JBGruber
opened
1 year ago
0
Illegal characters
#15
JBGruber
opened
1 year ago
0
Testing strategy
#14
JBGruber
opened
1 year ago
1
Refactor collect
#13
JBGruber
closed
1 year ago
0
Refactor collect (#11)
#12
JBGruber
closed
1 year ago
1
refactor collect
#11
JBGruber
closed
1 year ago
2
Refactor simple pb deliver paper
#10
JBGruber
closed
1 year ago
0
refactor pb_deliver.data.frame to make pb_deliver_paper function less complex
#9
JBGruber
closed
1 year ago
2
Improve cookie handling
#8
JBGruber
closed
1 year ago
1
made collection more robust
#7
JBGruber
closed
1 year ago
0
R-CMD-check fails
#6
JBGruber
closed
1 year ago
0
Write vignette for developers to show them how they can contribute a parsers
#5
JBGruber
closed
1 year ago
0
Write default parser that tries known approaches
#4
JBGruber
closed
2 years ago
0
washingtonpost.com GDPR consent
#3
JBGruber
closed
2 years ago
3
forbes.com returns garbage
#2
JBGruber
opened
3 years ago
0
Implement initial set of scrapers to get things going
#1
JBGruber
opened
3 years ago
1