Closed GoogleCodeExporter closed 8 years ago
fixed, added this to cleanHTML():
-----
elif re.search('<article id="WikiaMainContent" class="WikiaMainContent">', raw):
raw = raw.split('<article id="WikiaMainContent" class="WikiaMainContent">')[1].split('</article>')[0]
Original comment by emi...@gmail.com
on 8 Sep 2011 at 8:03
Original comment by emi...@gmail.com
on 8 Sep 2011 at 8:04
Original issue reported on code.google.com by
emi...@gmail.com
on 8 Sep 2011 at 8:00