4lex4 / scantailor-advanced

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
GNU General Public License v3.0
1.18k stars 129 forks source link

feature_request(split_pages): split by horizontally #68

Closed Kristinita closed 5 years ago

Kristinita commented 5 years ago

1. Summary

It would be nice, if would be possible cut pages by horizontally, not by vertically solely, in Split Pages stage.

2. Example of expected behavior

Pay attention to blue horizontal line in image top:

Horizontal line

It would be nice, if user can remove area above it for further page processing, use Apply cut option of Split Pages filter.

3. Data

4. Argumentation

4.1. Summary

Less manual working for users, more automatic.

4.2. Details

Problems for scans, that I need to process — black borders in top, left and right of the books pages. They can have different size and positions. Examples:

24 borders

55 borders

Select Content filter can select this borders. I need to make manual actions, that deselect them. If would be possible cut area above top border, Select Content will not select borders; user will have more chances automatically get correct selected area.

5. Current behavior

24

44

See also pages 10, 12, 14, 32, 64, 66, 70, 80 and so on in KiraSuperheroSelectContent.ScanTailor. If would be possible split pages by horizontally, possibly, selected area would be more expected.

6. Environment

Thanks.

4lex4 commented 5 years ago

That's because your source is not from a scanner and has already been incorrectly processed with a program. For correct scans ST automatically excludes the outside scanner area on finding content as it is.

Instead of moving split lines try to use https://github.com/4lex4/scantailor-advanced#auto-adjusting-content-area

4lex4 commented 5 years ago

Just remembered, use page area instead of split lines, as the page area feature is intended to solve problems like yours and this is more effective than that you're suggesting!

Go to the fourth stage:

  1. Set page area to manual mode and adjust orange frame so that trash must be outside the orange frame
  2. Apply page area to all pages via checking Apply page box checkbox (you can use apply to every other page to separately set the page area for even and odd pages)
  3. Check (and adjust if necessary) the page area for each page (the page area frame can be moved via Shift button pressed just like the content one), trash must be outside the orange frame.
  4. Set content detection to Auto, Apply to all the pages (having unchecked Apply page box!) and then run batch processing. Auto content will be searched only inside the page area.


Some extra info and a similar situation can be found here (in Russian):
https://www.forum.ru-board.com/topic.cgi?forum=5&bm=1&topic=32945&start=3024#lt