tdclemens / pdf2htmlEX

Convert PDF to HTML without losing text or format.
http://coolwanglu.github.com/pdf2htmlEX/
Other
0 stars 0 forks source link

Improve Word Position Calculation Performance #6

Open tdclemens opened 11 years ago

tdclemens commented 11 years ago

Right now, the position is calculated for each word by iterating through the list of character positions and doing math.

Improve that algorithm by keeping track of the position of the last word so that the calculation does have to start from the beginning.