davideuler / bzreader

Automatically exported from code.google.com/p/bzreader
0 stars 0 forks source link

Wikilinks containing quote marks are misparsed #9

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. find a wikilink containing a " character (e.g. Hebrew מפא"י in Hebrew
wikipedia.
2. click it -- get page not found

Such entries are common in Hebrew, as it uses the " character in acronyms,
whereas English uses capital letters.

I'll be fixing this soon.

Original issue reported on code.google.com by asaf.bartov on 3 Dec 2008 at 2:46

GoogleCodeExporter commented 9 years ago
Fixed and commited the display issue, but the search problem remains, and is a 
known
limitation of Lucene.NET.  We use version 1.9.0.005, and Lucene.NET only began
supporting escaped characters in query terms in its version 2.1.

So we need to upgrade the Lucene.NET code we bundle, but that should be a 
separate
effort, with other possible side effects.

Original comment by asaf.bartov on 3 Dec 2008 at 6:02

GoogleCodeExporter commented 9 years ago
Specifically, we need this change:

LUCENE-573: QueryParser now allows backslash escaping in
    quoted terms and phrases. (Michael Busch via Yonik Seeley)

Seen here:
https://svn.apache.org/repos/asf/incubator/lucene.net//trunk/C%23/src/CHANGES.tx
t

Original comment by asaf.bartov on 3 Dec 2008 at 6:07

GoogleCodeExporter commented 9 years ago
Also make sure you escape the query correctly while constructing the query 
string - I use the quotes there as 
well.

Original comment by halle...@gmail.com on 5 Dec 2008 at 10:25

GoogleCodeExporter commented 9 years ago
Closing this as this actual issue is fixed now, will leave issue 10 open.

Original comment by halle...@gmail.com on 8 Feb 2009 at 11:09