4lex4 / scantailor-advanced

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
GNU General Public License v3.0
1.15k stars 128 forks source link

What is the recognition zone for? #116

Closed d4fe closed 4 years ago

d4fe commented 4 years ago

What is the recognition zone for? When is this feature useful? (Automatic translation).

Piolie commented 4 years ago

The automatic translation is not helping here. What is the "recognition zone"? To which step are you referring?

d4fe commented 4 years ago

111

jeremydmoore commented 4 years ago

The "recognition zone" during the Select Content step sets the border of the page that will be used for "original margins" in Step 5. I use this feature to set the page size for all of my images, but I don't think it's necessary unless you're trying to match the original page layout.

d4fe commented 4 years ago

That is, this is the zone with which the program works, and everything else just forms from consideration? "Original fields" - your expression, or is it called in the program?

Piolie commented 4 years ago

I had to experiment a while to figure out how it works.

The Page Box draws an orange rectangle that represents the physical page. That is, the page contents + white margins. The Content Box is the blue rectangle that indicates where the content (text + drawings) is located in the page. Everything outside the blue rectangle is filled white.

As jeremydmoore said, if each rectangle is correctly positioned and Auto Margins from step 5 is activated, STA produces a page that closely matches the original layout.

Suppose you are processing a book that measures 20 cm x 24 cm. Then you can set a Page Box with Width = 200 mm and Height = 240 mm, press Apply To and select All pages. If you then use Auto Margins the output images will measure 4724 pix x 5669 pix (which at 600 dpi means 20 cm x 24 cm) and the contents will be positioned approximately in the same place as in the original scan.

Reference pictures

PageBox
AutoMargins

d4fe commented 4 years ago

I do not understand what benefit this function can give in practice?

Piolie commented 4 years ago

The benefit you get (if used correctly) is the images in the out folder will have the dimensions of the physical pages, and the contents aligned in exactly the same way as the physical pages you scanned.

For some people achieving this level of fidelity is irrelevant. I personally find it very satisfying. To each its own.

d4fe commented 4 years ago

That is, to keep the original position of the text (and pictures) relative to the edges of the page?

Piolie commented 4 years ago

Yes. And also to keep the original size of the pages.