naderabdalghani / doclense

A simple web app that utilizes image processing techniques and a deep convolutional neural network to extract text from photos of computer-printed text into Microsoft Word .docx files
Apache License 2.0
0 stars 0 forks source link
convolutional-neural-networks flask image-processing ocr-recognition python pytorch resnet-50


Logo

An app for extracting text from camera-captured photos into .docx files

Table of Contents

About The Project

App Showcase

Built With

Getting Started

Prerequisites

Installation

  1. Create a virtual environment cd <project-directory>

    • On Unix-based OS's: $ python3 -m venv venv
    • On Windows: > py -3 -m venv venv
  2. Activate the environment

    • On Unix-based OS's: $ . venv/bin/activate
    • On Windows: > venv\Scripts\activate
  3. Install app dependencies pip install -r requirements.txt

  4. Create the following directories in the project main directory

    • <project-directory>\results
    • <project-directory>\uploads
  5. Download the trained model state dictionary and place it in <project-directory>\model\

  6. [Optional] Download the dataset used and extract it in <project-directory>\model\

Running

Usage

Simply click on the 'Upload' button and select a photo that contains printed text. Click on the 'Submit' button and wait briefly for your .docx file to start downloading.

Results

Test Case 0

test_0

Test Case 1

test_1

Roadmap

List of Proposed Improvements

Contributors

Acknowledgements