jhy / jsoup

jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
https://jsoup.org
MIT License
10.89k stars 2.17k forks source link

Not parsing correctly a page from a url #391

Closed ScottMacRay closed 9 years ago

ScottMacRay commented 10 years ago

I am trying to parse a page given its url, but the footer of the page is not well parsed, although, if I download the web page as a html file (local) then jsoup parses it correctly ¿Why? Here you have the related url: http://www.uma.es/ingenieria-informatica/cms/base/ver/base/basecontent/10594/estudios-main/ I have tried to find the problem in the web page, but I could not (no bad-closed tag, for example) ¿Can you help me?

jhy commented 9 years ago

What do you mean by not well parsed -- what's wrong with it? Can you give me a code and snippet example of what's wrong and what you expected?

I parsed the URL and it looked OK at first glance.

http://try.jsoup.org/~IrB1CnXTMx08xzhuMIvdHO3MYZU

ScottMacRay commented 9 years ago

It is a long time since that (months), Jonathan. I am not working at that, now. But I can remember it was something relative to conflicts with javascript.

2014-10-12 0:06 GMT+02:00 Jonathan Hedley notifications@github.com:

What do you mean by not well parsed -- what's wrong with it? Can you give me a code and snippet example of what's wrong and what you expected?

I parsed the URL and it looked OK at first glance.

http://try.jsoup.org/~IrB1CnXTMx08xzhuMIvdHO3MYZU

— Reply to this email directly or view it on GitHub https://github.com/jhy/jsoup/issues/391#issuecomment-58766258.

jhy commented 9 years ago

OK -- sorry it took so long to get to. I'll close this, please feel free to open a new issue if you come across it again.