montera34 / pageonex

PageOneX. Analyzing front pages
http://pageonex.com
GNU Affero General Public License v3.0
54 stars 13 forks source link

Missing newspaper that is already in kiosko_media_list.csv #168

Closed numeroteca closed 11 years ago

numeroteca commented 11 years ago

A user reported that the La Nación (Argentina) newspaper was missing.

I checked kiosko_media_list.csv and the newspaper was there. I run the "rake scraping:update_media" and everything was up to date. Then I checked the number of options when creating a thread and I saw that: there are 716 vs. the 721 in kiosko_media_list.csv

Reviewing the list I saw that

<option value="26">Argentina - La Gaceta</option>
<option value="28">Argentina - La Nueva Provincia</option>

Ups, value 27 missing.

<option value="27">Paraguay - La Nación</option>

I think "Paraguay - La Nación" overwrote "Argentina - La Nación" as they both have the same name "nacion". We should check both name of newspaper and country before updating, @rahulbot ?

Usually these common name newspapers in kiosko.net use a prefix, but not in this case. Which other newspapers are missing?

Bug reported by maru: http://blog.pageonex.com/report-bugs/

rahulbot commented 11 years ago

verified on dev by git pull, rake scraping:update_media, touch tmp/restart.txt and then I se Agentina - La Nación on the list.

rahulbot commented 11 years ago

did the same on production - and it works there too