PlathC / VitalRecordAnalyser

This repos host the code of our paper French vital records data gathering and analysis through image processing and machine learning algorithms.
1 stars 0 forks source link

Enhance handwritten text segmentation #5

Closed PlathC closed 3 years ago

PlathC commented 4 years ago

The following documentation might be useful to implement a more precise text segmentation in order to enhance the final result :

This one described some processing that have been used to process some pages that could be similar to our case :

PlathC commented 4 years ago
PlathC commented 4 years ago

Tested with following result :

91046856-72a1d200-e619-11ea-9b05-bfd53ace806e.png

Python source code

PlathC commented 4 years ago

In order to provide uniform image to the recognition system, we may needs to deskew text line based on some known techniques :