adminchen / papercrop

Automatically exported from code.google.com/p/papercrop
GNU General Public License v2.0
0 stars 0 forks source link

Suggestion: is it possible to leave text in string format? #4

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
Don't know where to post a suggestion, so I post it here
If not suitable, please delete it.

The text in original pdf is converted to image, is it right?
I don't find any option to make papercrop to keep text in raw format, even in 
config.lua.
Is it possible to still keep the text in string, instead to image format, which 
can make the text searchable and notation available.
Thanks for providing this excellent app

Original issue reported on code.google.com by qiu...@gmail.com on 20 Aug 2010 at 6:08

GoogleCodeExporter commented 8 years ago
No, currently it's not possible. It is in my todo list, but these days I don't 
have enough time to update the papercrop. 

Original comment by taesoob...@gmail.com on 20 Aug 2010 at 12:21

GoogleCodeExporter commented 8 years ago
This feature would be my personal perfect Christmas gift!
Anyway, thanks for your effort.

Original comment by ilir...@gmail.com on 4 Dec 2010 at 10:35

GoogleCodeExporter commented 8 years ago
Ya,
I think your program is perfect. The only thing missing is when a crop is auto 
segmented, it should scan its content for purely text or 
"graphics/equations/graph/picture/diagram." If the content is purely text, it 
should keep it text instead of make an image of that segment. This will help in 
highlighting important text when reading your book.
If the auto determination of "cropped segment's" content is hard, maybe user 
intervention can be used to determine which to keep as text and which to turn 
into image.

Original comment by daman...@gmail.com on 5 Oct 2011 at 6:13

GoogleCodeExporter commented 8 years ago
The difficult part is not in the detection algorithm, but in my lack of spare 
time. I need at least a week of full days work to implement this feature, and I 
cannot afford it these days.

Original comment by taesoob...@gmail.com on 5 Oct 2011 at 3:07

GoogleCodeExporter commented 8 years ago
[deleted comment]
GoogleCodeExporter commented 8 years ago
I finished implementing this functionality. Currently, this is in early alpha 
stage. Ubuntu_installer for ubuntu linux should work after installing 
openjdk7-jre.
This functionality is enabled only if "vector PDF" output device is chosen. 
Only the default preset works.

I will upload Windows binary after some testing when I have time. 

Original comment by taesoob...@gmail.com on 6 Jun 2012 at 7:06