awslabs / handwritten-text-recognition-for-apache-mxnet

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.
Apache License 2.0
488 stars 189 forks source link

Retrain the model with additional dataset #46

Open vdinesh18 opened 4 years ago

vdinesh18 commented 4 years ago

Hello, How can i retrain the mode with new dataset? I looked at the XML files for bounding box information but it looks different.

What are the preparation steps for retraining the model?

Please provide information!

Thanks Dinesh

jonomon commented 4 years ago

Hi Dinesh, you'll have to develop your own mxnet dataset class (https://mxnet.apache.org/api/python/docs/tutorials/packages/gluon/data/datasets.html) for your specific dataset.

Then run through the notebooks 1_b, 2, 3, and 4 with your dataset.