InCuboEC / ebookdroid

Automatically exported from code.google.com/p/ebookdroid
0 stars 0 forks source link

Search function, hyphenation (PDF) #667

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1.Open the demo file here enclosed
2.Search for the word "hyphenation".
3.

What is the expected output?
Two occurences should be found.

What do you see instead?
The second one is not found, because of the hyphenation. A search for "hyphe" 
would be OK (two occurences found).

What version of the product are you using?
v2.1.0-rc5

On what operating system?
Android 4.2.2

What device are you using?
Tablet ASUS ME173X

Please provide any additional information below.

Original issue reported on code.google.com by philippe...@gmail.com on 20 Dec 2013 at 4:12

Attachments:

GoogleCodeExporter commented 8 years ago
Hi! This problem is still present in new releases of Ebookdroid. I guess it is 
a complicated problem, difficult to solve. I know that hyphenation belong to 
the category of "experimental" features and that it does not work yet. Is there 
a chance for a solution some day? In the meantime, the best (and only?) 
solution for not losing some occurence in a research operation is to 
de-activate the hyphenation function. Am I correct?

Original comment by philippe...@gmail.com on 27 Mar 2014 at 1:15

GoogleCodeExporter commented 8 years ago
hyphenation not experimental (if it is marked so in settings, it's a bug). Main 
problem that text extracted from DJVU or PDF is a set of textboxes: i.e. 
coordinates and char sequence. It is a limitation of used library. In some 
files hyphenation mark is part of textbox in some it is standalone textbox. So, 
for correct search we need to reconstruct original lines, and this task is very 
complicated and not always possible.

Maybe somewhen we will solve this.

Original comment by Andrei.K...@gmail.com on 27 Mar 2014 at 2:34