Closed mrvisser closed 10 years ago
Assigning to @sathomas for investigation.
I'm using Chrome 30 on Mac OSX.
Some info about the version of pdf2htmlEX that processed this, in case it is something that was recently fixed in github master from our reports:
branden@qa0:~# dpkg -s pdf2htmlex
Package: pdf2htmlex
Status: install ok installed
Priority: extra
Section: universe/web
Installed-Size: 489
Maintainer: WANG Lu <coolwanglu@gmail.com>
Architecture: amd64
Version: 0.11-1~git201310172203re1b11-0ubuntu1~precise1
Depends: libc6 (>= 2.14), libfontforge1, libgcc1 (>= 1:4.1.1), libpoppler27 (>= 0.20.3), libpython2.7 (>= 2.7), libstdc++6 (>= 4.6), libpng12-0, libjpeg8
Suggests: ttfautohint
Description: Converts PDF to HTML without losing format
pdf2htmlEX converts PDF to HTML while retaining text, format & style as much as possible
Homepage: http://github.com/coolwanglu/pdf2htmlEX
The pages also don't seem to be properly centred horizontally.
Can you provide a link to the PDF?
I've attached it in an email
Can't reproduce using my local instance (Chrome 30 on Mac OS X as well). Are you using an accessible (over the Internet) server?
The pdf2htmlEX version is short several commits from the latest, but I don't think that's the problem here
I was using the QA server: oae.oae-qa0.oaeproject.org
Looks like this can be closed now
I uploaded one of our familiar documents: eels.docx (I can email this to you if you like), which contains a bunch of multi-byte characters and ltr language.
The result is this document preview being generated: http://screencast.com/t/05LQUkQIZM4
There should not be so much space between these pages.