paul-tqh-nguyen / arxiv_as_a_newspaper

arxiv.org portrayed as if it were a news paper.
0 stars 0 forks source link

Repair Author Links In DB #27

Closed paul-tqh-nguyen closed 5 years ago

paul-tqh-nguyen commented 5 years ago

Our ETL scrapes the mirrored pages so that we don't get blocked.

The mirrored pages have incorrect author links.

Let's run our ETL process again to fix this bad data so that we don't have bad links on our front end.

paul-tqh-nguyen commented 5 years ago

I am unable to find cases where the links are dead like I did when I originally reported this ticket. I'll close it for now and reopen it when the issues come again.

paul-tqh-nguyen commented 5 years ago

find/q-bio/1/au:+Clement_N/0/1/0/all/0/1 is an example of a link that doesn't work.

Check out the Biomolecules research field.

paul-tqh-nguyen commented 5 years ago

Ok, it seems that this is mostly due to our machine specifically being blocked sometimes (but not always? Interesting...). Closing for now.