issues
search
ecprice
/
newsdiffs
Automatic scraper that tracks changes in news articles over time.
Other
497
stars
135
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Active Maintaining
#65
mjspeck
opened
4 months ago
2
Update for NYT / User Agent change
#64
iamvishnurajan
opened
2 years ago
1
ModuleNotFoundError: No module named 'website'
#63
nearbrogithub
opened
3 years ago
0
Initial docker setup
#62
0xrin1
opened
4 years ago
0
URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:727)>
#61
dwedigital
opened
4 years ago
0
Add Canadian news
#60
dleslie
opened
4 years ago
0
Update for "2020"
#59
iamvishnurajan
opened
4 years ago
1
Fix BBC scraper heading and date
#58
tomwieck
closed
2 years ago
0
Adding header information in http requests to avoid 403 errors
#57
msbt
closed
6 years ago
1
Add theguardian.com
#56
jasoncartwright
opened
6 years ago
0
/browse takes 20-30sec to respond
#55
jasoncartwright
opened
6 years ago
0
Add foxnews.com
#54
jasoncartwright
opened
6 years ago
0
newsdiffs.org isn't served over HTTPS
#53
jasoncartwright
opened
6 years ago
0
BBC scraper no longer works
#52
jasoncartwright
opened
6 years ago
0
Upgrade to Django 1.6 and add CONN_MAX_AGE: 60
#51
carlgieringer
closed
2 years ago
0
NYT scraper no longer works
#50
iamvishnurajan
opened
6 years ago
3
Fix NYTParser for new article format
#49
carlgieringer
closed
6 years ago
20
Docs fail to mention check on robots.txt
#48
will-martin
opened
6 years ago
0
Updates to help me prioritize critical bugs
#47
carlgieringer
closed
2 years ago
0
WIPR: DO NOTCOMMIT. Django 1.10 & remove git
#46
awong-dev
closed
7 years ago
0
Newsdiffs wh frontend
#45
ezramechaber
closed
7 years ago
0
Clean up README.md and make work with virtualenv.
#44
awong-dev
opened
7 years ago
2
format readme, include link
#43
martenson
closed
8 years ago
1
Fixed issues with https:// and disabled the functionality to browse old days' data
#42
ExLupi
closed
8 years ago
0
frontend/views.py REJECTING valid urls
#41
wizardishungry
opened
8 years ago
0
Fix protocol relative urls
#40
wizardishungry
opened
8 years ago
0
Fixes for getting newsdiffs working on OSX
#39
wizardishungry
opened
8 years ago
0
Formatted README.md
#38
mgarciaisaia
opened
8 years ago
1
Heroku?
#37
amandabee
opened
9 years ago
3
Change log pleading ignorance, but it knows.
#36
amandabee
closed
9 years ago
2
TypeError: sequence item 0: expected string or Unicode, NoneType found
#35
ryantate
opened
9 years ago
0
disambuguate URLs
#34
amandabee
opened
9 years ago
0
patched to fix issue #25 https://github.com/ecprice/newsdiffs/issues/25
#33
cwage
opened
9 years ago
0
Django templating
#32
catcosmo
opened
9 years ago
2
Frontend/django template integration
#31
catcosmo
closed
9 years ago
0
delete this
#30
anjakammer
opened
9 years ago
1
Backend/parsers
#29
RobertPiwonski
opened
9 years ago
0
politico scraper fails on "print.cfm"
#28
Fil
opened
9 years ago
0
Fix README formatting
#27
cllns
opened
9 years ago
0
add .md extension to README
#26
cllns
closed
9 years ago
0
ImportError: Could not import settings 'website.settings' (Is it on sys.path?): No module named website.settings
#25
ShahriyarR
opened
10 years ago
6
change input type to email
#24
mhuerster
closed
10 years ago
1
DatabaseError: table Articles has no column named git_dir
#23
ndarville
closed
10 years ago
9
Update about.html
#22
StevenMaude
closed
10 years ago
0
rate-limiting the scraper
#21
faried
opened
10 years ago
0
Track WikiLeaks changes
#20
douglaslucas
opened
11 years ago
0
Fixed baseparser to handle relative URLs
#19
thomasjoulin
closed
11 years ago
0
Parser for www.guardian.co.uk
#18
brkcmd
closed
6 years ago
0
Add support for scraping "section fronts" in addition to the publication's home page
#17
mherdeg
closed
11 years ago
2
Roadmap...?
#16
toyg
opened
11 years ago
0
Next