Handwritten digits detection using a YOLOv8 detection model and ONNX pre/post processing. An example of how model works in real world scenario can be viewed at https://thawro.github.io/web-object-detector/.
The dataset consists of images created with the use of a HWD+ dataset.
The HWD+ dataset consists of gray images of single handwritten digits in high resolution (500x500 pixels).
The yolo_HWD+
dataset is composed of images which are produced with the use of HWD+
dataset. Each yolo_HWD+
image has many single digits on one image and each digit is properly annotated (class x_center y_center width height). The processing of HWD+
to obtain yolo_HWD+
:
HWD+
images have a lot of white background around)imgsz
and apply transform to it (pre_transform
attribute) - e.g. RGB shift/shufflenrows * ncols
digit images and form a nrows x ncols grid.obj_transform
attribute) - e.g. invert color, RGB shift/shufflepost_transform
attribute) - e.g. rotationExample below:
Each pipeline step is done with ONNX models. The complete pipeline during inference is the following: