searope / jwpl

Automatically exported from code.google.com/p/jwpl
0 stars 0 forks source link

getPage("brother") doesnt retrieve the proper page #95

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Have a copy of wikipedia 20120502 or wikipedia 20110405 (possibly same on 
other versions of data dump but i only tested on this two)
2. Do: "new Wikipedia().getPage("brother")"
3. Inspect the page returned

What is the expected output? What do you see instead?
According to Wikipedia, this should return the "sibling" page, but in fact the 
"Brother (2000 film)" is returned.

What version of the product are you using? On what operating system?
JWPL0.9.1, for both datamachine and API. on linux 64

Thanks

Original issue reported on code.google.com by ziqizhan...@googlemail.com on 16 May 2012 at 2:00

GoogleCodeExporter commented 9 years ago
I can reproduce this issues, brother fetches the "Brother (2000 film)" Page 
instead of the "Sibling" page where "brother" should redirect to. On the other 
hand if you search for example "sister" the "Sibling" page is returned where 
"sister" redirects to and not the "Sister (Band)" page or similar. 

I don't know if this is a unique error, or how jwpl prioritises matches to a 
query string. The question is why is "brother" not resolved as a redirect but 
"sister" is?

Original comment by SamyAt...@googlemail.com on 1 Jun 2012 at 1:17

GoogleCodeExporter commented 9 years ago
Actually "brother" is "redirect" (wiki.getPage("brother").isRedirect()=true) 
but it seems it redirects to the wrong target i.e., "brother (2000 film)". So I 
am wondering if there is a bug in the datamachine...

Original comment by ziqizhan...@googlemail.com on 1 Jun 2012 at 1:26

GoogleCodeExporter commented 9 years ago
We have filed a new Issue that might solve this problem: Issue99

Original comment by oliver.ferschke on 3 Jul 2012 at 2:01