What steps will reproduce the problem?
1. open any Wikipedia dump
2. navigate to an entry with external links (i.e. links outside Wikipedia)
3. click the link -- your browser is directed to a URL with the
*description* of the link following the actual URL.
What is the expected output? What do you see instead?
External Links should separate URL from description, and thus both display
correctly and be valid links.
I attach a patch that fixes this, as well as improves the progress report
for the tokenizing process.
Original issue reported on code.google.com by asaf.bartov on 1 Dec 2008 at 6:56
Original issue reported on code.google.com by
asaf.bartov
on 1 Dec 2008 at 6:56Attachments: