The ScanTailor version that merges the features of the ScanTailor Featured
and ScanTailor Enhanced
versions,
brings new ones and fixes.
ScanTailor is an interactive post-processing tool for scanned pages. It performs operations such as:
You give it raw scans, and you get pages ready to be printed or assembled into a PDF or DjVu file. Scanning, optical character recognition, and assembling multi-page documents are out of scope of this project.
Go to this repository and follow the instructions given there.
Unfortunately, the repository of @4lex4 seems to be no longer active.
For this reason I have created this fork. It contains:
PRs are highly welcome ;-)
Auto margins feature allows keep page content on original place. In the Margins step you can choose from Auto, Manual (default) and Original mode. The manual mode is the original one. Auto mode try to decide if it is better to align page top, bottom or center. Original mode keeps page on their vertical original position.
This feature has been improved. See page area feature description. Also see ScanTailor Advanced fixes & improvements
Page detect feature allows detect page in black margins or switch off page content detection and keep original page layout.
This feature has been reworked. See ScanTailor Advanced fixes & improvements for more information.
Deviation feature enables highlighting of different pages. Highlighted in red are pages from Deskew filter with too high skew, from Select Content filter pages with different size of content and in Margins filter are highlighted pages which does not match others.
This feature has been reworked. See ScanTailor Advanced fixes & improvements for more information.
Picture shape feature adds option for mixed pages to choose from free shape and rectangular shape images. This patch does not improve the original algoritm but creates from the detected "blobs" rectangular shapes and the rectangles that intersects joins to one.
This feature has been reworked. See rectangular picture shape feature description.
This allows to expand and un-dock thumbnails view to see more thumbnails at a time.
This feature had performance and drawing issues and has been reworked.
You can move the topmost (bottom-most) horizontal blue line of the dewarping mesh up and down as a whole - if you grab it at the most left (right) red point - holding down the CTRL key.
You can create the rectangular picture zones - holding down the CTRL key. You can move the (rectangular) picture zones corners in an orthogonal manner - holding down the CTRL key.
This feature has been reworked and is now a part of new zone interaction modes feature.
Set the "auto-save project" checked in the Settings menu and you will get your project auto-saved provided you have originally saved your new project. Works at the batch processing too.
This feature had performance issues and has been optimized.
Another rectangular picture zone shape. This option is based on Picture shape, Square picture zones. It squeezes every Picture shape zone down to the real rectangular picture outline and then replaces it (the resulting raster zone) by a vector rectangular zone, so that a user could easily adjust it afterwards (by moving its corners in an orthogonal manner).
This feature has been reworked. See rectangular picture shape feature description.
An automatic dewarping mode. Works ONLY with such raw scans that have the top and bottom curved page borders (on the black background). It automatically sets the red points of the blue mesh along these borders (to create a distortion model) and then dewarps the scan according to them. Works best on the low-curved scans.
Note: Other features of this version, such as Export, Dont_Equalize_Illumination_Pic_Zones, Original_Foreground_Mixed has't been moved due to dirty realization. Their functionality is fully covered by full control over settings on output and splitting output features.
Improvements for the thumbnail view.
Alt+Wheel
.Fixed some bugs of official and Enhanced version.
Portability.
The settings and program files are stored in the folder with the application.
Note: If installed into a system directory, where config and data files can't be written into the
folder with the application executable, ScanTailor Advanced works as a standalone app and stores
its settings and application data in the appropriate system specific paths.
Page splitting settings now influence on the output by filling offcut. Fill offcut option has been added.
Page layout and all the other views now consider splitting settings. Corresponding improvements are done for thumbnails.
Changed ScanTailor behavior on page split stage.
Reworking on multi column thumbnails view feature from ver. Enhanced. Now thumbnails are shown evenly.
Added option to control highlighting the thumbnails of pages with high deviation with red asterisks. The option refreshes the thumbnails instantly.
Deviation feature reworked.
Page/content boxes and auto margins features fixes & improvements.
Shift+LMB
combination.Auto and original alignment modes reworked:
Changed the way of the adjustment of the despeckle strength.
Now that's set via the slider. It allows to adjust the despeckle strength more smoothly and exactly.
Value 1.0 matches the old cautious mode, 2.0 - normal and 3.0 - aggressive.
Improvements on the thumbnails view and navigation:
Saving selection of pages on filter switch.
Separate highlighting for selection leader in thumbnails.
Navigating between selected pages.
Use Shift+PgUp/Q
and Shift+PgDown/W
to navigate between selected pages.
Added buttons to navigate between pages.
Multi page selection mode without using keyboard.
The question as to whether cancel multi page selection.
Go to a page by its number (Ctrl+G
shortcut).
Added options in the settings to manage the quality and size of thumbnails.
It's possible to switch thumbnail quality while working on a project with the changes applied immediately.
For every quality chosen a cache is created.
Usage example: you could use this feature for the preview purpose, which is faster than simple navigating between pages.
Undock the thumbnails panel, set the quality, for ex., to 700, and size to 1000. You can mark problem pages with Ctrl+Click
on a thumbnail
and navigate between these selected pages after finishing to fix them. To finish the preview change the quality and size values back
and dock the panel.
Fixed other bugs of official, Enhanced and Featured versions and made lots of other improvements.
You can choose a desired color scheme in settings.
This significantly increases the speed of processing. The count of threads to use can be adjusted while processing.
Warning! More threads requires more memory to use. Exclude situations of that to be overflowed.
This feature enables to control filling margins, normalizing illumination before binarization, normalizing illumination in color areas and Savitzky-Golay and morphological smoothing options at the output stage in any mode (of course, those setting that can be applied in the current mode).
Now outside pixels can be filled with the background color of the page.
Added filling setting with the following options:
Tiff compression options allow to disable or change compression method in tiff files.
There are two options in settings dialog: B&W and color compression.
Sauvola and Wolf binarization algorithms have been added. They can be applied when normalizing illumination does not help.
The feature allows to split the mixed output scans into the pairs of a foreground (letters) and background (images) layer.
You can choose between B&W or color (original) foreground.
It can be useful:
Note: That does not rename files to 0001, 0002... It can be made by a third party app, for example Bulk Rename Utility
This feature is a part of the splitting output feature.
It allows to preserve the original image background in the format ready for the further processing, when BW foreground is used. It can be used to encode into DjVu the pages with the complex background using the semi-auto "split layers" method which gives much higher quality results than DjVu auto segmenter. Also this feature can be used to extract high contrast elements of gradient images into the foreground layer by using second processing of the layer with pictures ("background").
Properties of the original background:
#000000
) and white (#ffffff
) colors of original background image are reserved into #010101
and #fefefe
, respectively.Color segmentation and posterization (color quantization) features have been implemented.
Color segmentation allows to split the image into color segments and colorize b&w mask. Posterization allows to reduce the number of colors of the image by grouping similar colors. The main use of posterization is to be applied to segmented image to get an indexed image, that can then be encoded into DjVu as the foreground layer. It allows to create high-quality DjVu files with color text and elements having maximal compression level. Posterization can also be used in color mode and can be applied to usual color gradient images for different purposes, for example, to increase their compression efficiency.
"Quadro" picture shape mode from Featured was merged with Rectangular one from Enhanced. Also removed restriction of ver. Featured on deleting all the auto zones. Before it resulted in resetting all the auto zones back. Added sensitivity option. If sensitivity equals 25%, the results will be the same as they were in old "Quadro" mode, if 100% - as in old "Rectangular".
Z
to switch to polygonal mode.X
to switch to lasso (free drawing) mode.C
to switch to rectangular mode.Z
or X
while creating a new zone to switch between polygonal and rectangular modes respectively. Shift+LMB
on a zone - drag the zone.Ctrl+Shift+LMB
on a zone - drag the zone copying.Ctrl+Alt+Click
- copy the latest created zone to the current cursor position.Del
when the cursor is over a zone - delete the zone. D
when the cursor is over a zone vertex - delete the vertex.Ctrl
when dragging a zone vertex - make the angle of the vertex right. The save is precise and considers the images transformations.
Also added a feature of swithing the output tabs by using Ctrl+1..5
keys combinations.
The settings are available in the main window menu. Available units: pixels, millimetres, centimetres and inches.
The system affects every aspect of the program, so, for example, it's now possible to adjust margins in pixels, but not only in millimetres or inches.
The panel shows the next information: zone creation mode icon, mouse position relative to the image, physical size of the image, position of the selected page in current order and the page name and type ([L]
or [R]
- left or right page, if the page has been splitted).
This feature is also affected by measurement units system.
Default parameters system supporting custom profiles has been implemented.
The system allows to manage the default filter settings for every stage. Those filter parameters will be set as defaults for any new project created.
For example, it allows to set your own default margins standard, but not default 5, 10, 5, 10 mm, and so for the other parameters.
Peculiarities:
config/profiles
folder or in an system specific one for application data.Now group boxes containing filter options can be collapsed/expanded.
The collapse status is preserved between restarts of the application.
Use double-click
on content to automatically adjust the content area.
If the content is outside the area, the later will automatically be expanded and adjusted to the content at the position where double-click has been,
otherwise the area edge, nearest to that position, will be adjusted (on clicking hold Shift
pressed to select
left or right edge only or Ctrl
to select top or bottom one, or Shift+Ctrl
to adjust both the nearest vertical and horizontal edges).
It's much faster now to correct the content area if, for example, the page number has been missed by the auto algorithm. It is no more required to manually and laboriously move the corners and edges of the content box.
This feature allows to process images with light content on dark background correctly by correcting auto algorithms.
Peculiarities:
This feature gives you a more flexible and precise way of positioning content in the page layout.
These are horizontal or vertical lines you can display on a page at the margins stage (when aligning enabled).
Note: Guides are adaptive to the page soft margins, i.e. when the latter changed the guides on the page
are automatically adjusted to match the new content position without requiring any manual re-adjusting.
Capabilities:
Right-click
to create/remove guides from the context menu called. Right-click
on a guide to delete that guide from the context menu called. Ctrl+Alt+LMB
- drag the guide under the cursor. Shift/Ctrl+LMB
on the content rectangle - drag the page content.
Hold Shift
pressed to restrict moving along the horizontal axis only or Ctrl
for the vertical one.
Hold Shift+Ctrl
for usual dragging. Double-click
on content - automatically attach that content to the nearest guide.
Hold Shift
pressed to select vertical guides only or Ctrl
for horizontal ones.
Hold Shift+Ctrl
to attach that to both the nearest vertical and horizontal guides. This software is licensed under GNU GPLv3, you can read more about it on our LICENSE file.