peterhn / cs121-crawler

web crawler for ics.uci.edu
0 stars 0 forks source link

URL Traps #3

Closed pyamsoft closed 9 years ago

pyamsoft commented 9 years ago

Leave this issue open as a reference please.

Explicitly skip all URL Traps URL Traps:

URLs suffixed with (?) are unconfirmed

archive.ics.uci.edu calendar.ics.uci.edu ngs.ics.uci.edu

pyamsoft commented 9 years ago

ngs.ics.uci.edu is a single professors page which redirects to itself, confirmed URL trap.

pyamsoft commented 9 years ago

evoke.ics.uci.edu (?)