Normalization failed for string 'ଜୀବନକୁ ନିବିଡ଼ ଭାବେ ଏକନ୍ୱିତ କରିଛନ୍ତି'
Invalid start of grapheme sequence:D=0xb71
Normalization failed for string 'ପରମ୍ପରାକୁ ଅବଲମ୍ୱନ କରିଛନ୍ତି, ସେତିକି ମଧ୍ୟ'
Invalid start of grapheme sequence:M=0xb48
Normalization failed for string 'ଦ୍ୱୈତ ରୂପରେ ଦେଖିଥିଲେ, ଏଠାରେ ପୁରୁଷ'
Invalid start of grapheme sequence:M=0xb47
Normalization failed for string 'ତାଙ୍କ ହୃଦୟ ବିଭୋର ହୋଇଛି ସମ୍ୱେଦନଶୀଳତାରେ;'
Invalid start of grapheme sequence:D=0xb71
I'm getting this error while training the tesseract ocr model for Oriya language , please help me to resolve this issue .
I'm attaching the ground truth files .
Training on tesseract 4.1.1 :
tesseract 4.1.1
leptonica-1.82.0
Normalization failed for string 'ଜୀବନକୁ ନିବିଡ଼ ଭାବେ ଏକନ୍ୱିତ କରିଛନ୍ତି' Invalid start of grapheme sequence:D=0xb71 Normalization failed for string 'ପରମ୍ପରାକୁ ଅବଲମ୍ୱନ କରିଛନ୍ତି, ସେତିକି ମଧ୍ୟ' Invalid start of grapheme sequence:M=0xb48 Normalization failed for string 'ଦ୍ୱୈତ ରୂପରେ ଦେଖିଥିଲେ, ଏଠାରେ ପୁରୁଷ' Invalid start of grapheme sequence:M=0xb47 Normalization failed for string 'ତାଙ୍କ ହୃଦୟ ବିଭୋର ହୋଇଛି ସମ୍ୱେଦନଶୀଳତାରେ;' Invalid start of grapheme sequence:D=0xb71
I'm getting this error while training the tesseract ocr model for Oriya language , please help me to resolve this issue . I'm attaching the ground truth files .
Training on tesseract 4.1.1 : tesseract 4.1.1 leptonica-1.82.0
ocr_training.zip