poeml / mirrorbrain

MirrorBrain
http://mirrorbrain.org/
Other
75 stars 37 forks source link

unparseable index at http://mirror.bjtu.edu.cn/tdf/ #115

Open poeml opened 9 years ago

poeml commented 9 years ago
                                                                                [          ]

Issue migrated (2015-06-05) from old issue tracker http://mirrorbrain.org/issues/issue115

Title    unparseable index at http://mirror.bjtu.edu.cn/tdf/
 Priority   bug            Status      in-progress
Superseder               Nosy List     floeff, poeml
Assigned To poeml         Keywords
msg404 (view) Author: floeff Date: 2012-07-20.16:29:28

It seems that http://mirror.bjtu.edu.cn/tdf/ changed their layout, resulting in the following error message:

mirror.bjtu.edu.cn: unparseable HTML index in

msg491 (view) Author: poeml Date: 2014-01-31.00:02:03

I suggest a workaround:

Use rsync://mirror.bjtu.edu.cn/tdf/ for scanning, which should give better results and should also be more efficient in general.

msg492 (view) Author: poeml Date: 2014-01-31.00:04:07

There's CSS in front of the HTML which contains