yibinlin / hw2-team18

hw2 for 11791
2 stars 2 forks source link

Html tags removing #19

Closed HaohanWang closed 11 years ago

HaohanWang commented 11 years ago

Html tags should be removed manually because automatically removing those will delete some useful information

HaohanWang commented 11 years ago

There are two ways for this, one is to replace html tags like &#223 to beta. This is already done by jsoup.parse. The second way is to replace <....beta.gif....> to beta, I have implemented a function replaceDotGif for that.