issues
search
anyweez
/
newsy
News content extraction.
0
stars
0
forks
source link
Empty bodies: prevent them from actually being saved unless there's a body
#6
Closed
anyweez
closed
7 years ago
anyweez
commented
7 years ago
In newsy
anyweez
commented
7 years ago
Using a scraping library now so should have fewer of these in general.
Blocking out empty responses from scraper.
Also excluding articles where no text is extracted. Raw HTML is kept but no records are generated elsewhere.
In newsy