yuhongfang / papercrop

Automatically exported from code.google.com/p/papercrop
GNU General Public License v2.0
1 stars 2 forks source link

Processing a book scanned with two book pages appearing on each pdf page #34

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Trying to automatically process a book scanned with two book pages appearing 
on each pdf page, with a thick black line in the middle.
2.
3.

What is the expected output? What do you see instead?
>> I expect papercrop to process two book pages on a pdf page like it would 
process a two columns paper. I think that the problem is that the thick black 
line in the middle (that is the book binding fold appearing in the middle of 
the two book pages on each pdf page) takes the entire page height, and makes it 
seem that there is an image on the entire page, rather than paragraphs on the 
left, a line in the middle, and paragraphs on the right.

I say so because on each page where the black line in the middle happens not to 
be continuous on the whole page (because the scan didn't register as strongly), 
papercrop works perfect and treats paragraphs as paragraphs, and the parts of 
the middle line that we can see are treated as images.

Is there any solution to this?
Thanks!

Let me know if there is anything unclear.

What version of the product are you using? On what operating system?
>> v.0.47 on Windows 7

Please provide any additional information below.

Original issue reported on code.google.com by o.w.i.m...@gmail.com on 9 May 2012 at 5:08