Open emigomez opened 2 years ago
I have install easyocr with pip, I don't know if maybe this is the problem and i have to installed with this repo from source
To train new set of characters, you can set new_prediction
to True
.
What's the effect of set new_prediction to True? Can I still do a finetuning of the latin model or does it means that a do a complete retraining?
@emigomez I'm pretty sure this is due to the shape of the network, I used:
input_channel = 1 output_channel = 256 hidden_size = 256
And it gets past the error, but now I'm having trouble with the character set, I don't know what to use for lating_g2 but I get this error
UnicodeEncodeError: 'charmap' codec can't encode characters in position 654-701: character maps to <undefined>
@emigomez Are you able to solve the mismatch
error? I am getting same error
Is there a solution for the mismatch error please?
Hello! Did anyone find a solution for the mismatch error? I'm facing the same for english_g2.pth model and I'm following this article to finetune: https://pub.towardsai.net/how-to-fine-tune-easyocr-to-achieve-better-ocr-performance-1540f5076428
For the latin model, I've put the parameters as such:
'number': '', 'symbol': '', 'langchar': " !\"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^`abcdefghijklmnopqrstuvwxyz{|}~ªÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝÞßàáâãäåæçèéêëìíîïðñòóôõöøùúûüýþÿĀāĂ㥹ĆćČčĎďĐđĒēĖėĘęĚěĞğĨĩĪīĮįİıĶķĹĺĻļĽľŁłŃńŅņŇňŒœŔŕŘřŚśŞşŠšŤťŨũŪūŮůŲųŸŹźŻżŽžƏƠơƯưȘșȚțə̇ḌḍḶḷṀṁṂṃṄṅṆṇṬṭẠạẢảẤấẦầẨẩẪẫẬậẮắẰằẲẳẴẵẶặẸẹẺẻẼẽẾếỀềỂểỄễỆệỈỉỊịỌọỎỏỐốỒồỔổỖỗỘộỚớỜờỞởỠỡỢợỤụỦủỨứỪừỬửỮữỰựỲỳỴỵỶỷỸỹ€",
Hello,
I'm trying to fine-tune the latin_g2 model (based on your script on the /trainer folder), but I'm having errors in the execution. My first step was to finetune the english_g2 model, with that I hadn't any problems in the training but also none during the inference.
This is the error at the beginning of the fine tuning with the latin_g2 model (I also tried with latin model):
I think that the error is because I have set badly the symbol and lang_char parameter of the yml file for this latin_g2 model. I tried with several options like the original one use for english, antoher lists write by my and the list specified for this model in https://github.com/JaidedAI/EasyOCR/blob/master/easyocr/config.py#L149. But the result for all of this test is the same, the above error.
This is the .yml file that I have used for the fine tuning:
In that sense I have some questions:
Thank you for your help