4lex4 / scantailor-advanced

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
GNU General Public License v3.0
1.15k stars 128 forks source link

Select content is not so accurate and output processing is slow #151

Open codeitout32 opened 3 years ago

codeitout32 commented 3 years ago

I've been using the latest 2019 early access version. The content selection feature misses many times like page numbers in corners. My main concern is it is very slow I am scanning 40 pages it takes me around 4 mins in dual-core 4 threads. I'm using 600 dpi in output is there any setting to reduce the time required. Thanks.

Piolie commented 3 years ago

Hi. The content selection algorithm currently has that limitations. If you double click near the part that was left out (for example: the page number), you can quickly change the content rectangle. It is manual work, but much quicker than grabbing the borders. (Read the docs.)

The output step takes a while, but I think that 4 minutes for just 40 pages is not right. From my experience, when the output step takes too long it's because the input files are way too big or its DPI is wrong (generally too low). There is an option (Tools > Fix DPI) to fix the latter. You could also try with another batch of images from another source, to see if the problem is with your machine or not.

Good luck.

zahinwadud commented 1 year ago

I have faced that same problem. 'Select Content' wasn't accurate and 'Output' was way too slow. @Piolie Thanks for your reply. DPI was the culprit. Once I fixed the DPI settings everything becomes normal.