Open funkyvoong opened 1 year ago
Herbarium of the Future Paper: https://www.cell.com/trends/ecology-evolution/fulltext/S0169-5347(22)00295-6 Detecting Handwritten and Printed Text from Doctor's Notes: https://www.proquest.com/docview/2505259735?pq-origsite=gscholar&fromopenview=true
Updates (27th June 2023):
Datasets in use:
Handwritten
Machine Printed
Unused for now:
Work done:
Model Performance: Our pipeline accuracy is capped at 80% now. I have identified problematic images which the model misclassified, and I am working on adding more suitable images to counter this problem.
I think this is a good direction, as the COCO-Text dataset has not been included this time (last week, it was part of the training set). Individual preprocessing of each dataset has been proven effective, as our model's performance hasn't dropped much without the dataset.
Current Tasks:
Updates (30th June 2023):
Updates (3rd July 2023):
Updates (24th July 2023):
Collecting some candidates for handwritten datasets:
Another option is to synthesize handwritten data. See ScrabbleGAN: https://github.com/amzn/convolutional-handwriting-gan