Data extraction with ML
Sparrow is an innovative open-source solution designed for efficient data extraction and processing from various documents and images. It seamlessly handles forms, invoices, receipts, and other unstructured data sources. Sparrow stands out with its modular architecture, offering independent services such as OCR, Donut fine-tuning/inference, and a data labeling UI, all optimized for robust performance.
Follow the install steps outlined here:
Donut Data install steps
Donut ML install steps
Donut UI install steps
Follow the steps outlined here:
Donut Data usage steps
Donut ML usage steps
Donut UI usage steps
Sparrow UI:
Licensed under the Apache License, Version 2.0. Copyright 2020-2024 Katana ML, Andrej Baranovskij. Copy of the license.