Closed ravindranathakila closed 10 years ago
Sorry, had to truncate the results to be within the Github length for issues.
Tested on this feed too. Results are same.
<link></link> becomes <link />
Not a bug per se. There's a proper way to parse XML: Use _Parser.xmlParser()_
final Document document = Jsoup.parse(new URL(feedUrl).openStream(), "UTF-8", feedUrl, Parser.xmlParser());
Yes, you need to use the XML parser for XML. Otherwise, the HTML parser applies HTML rules. See the correct parse at http://feeds.bbci.co.uk/news/technology/rss.xml
URL: http://feeds.bbci.co.uk/news/technology/rss.xml Try this code (output below): (At the very bottom is the original source viewed on Google Chrome)
_Note what happens to the link tags_
Output:
Google Chrome View Source: