wolverine2k / crunchy

Automatically exported from code.google.com/p/crunchy
0 stars 0 forks source link

Parsing limitations with Python 3.0 #73

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
Crunchy now works partly with Python 3.0.  However, it is extremely fussy
as to how html files are written; all tags have to be properly closed, and
no special character is handled properly unless they are escaped using, eg.
é instead of é.

As a long term goal, running Crunchy under Python 3.0 should be as easy as
running it with Python 2.x - but we may have to wait for a 3.0 version of
BeautifulSoup.

Original issue reported on code.google.com by andre.ro...@gmail.com on 26 Dec 2007 at 10:30

GoogleCodeExporter commented 8 years ago
Fixed via a hack: a subprocess is launched (assuming that "python" launches a 
2.x
version) which cleans up the file using ElementSoup/BeautifulSoup.  It is not 
ideal,
but will have to do for now.

Original comment by andre.ro...@gmail.com on 30 Dec 2007 at 9:47

GoogleCodeExporter commented 8 years ago

Original comment by andre.ro...@gmail.com on 30 Dec 2007 at 9:50