Japanese ref image to create language train data is not recognized correctly

What steps will reproduce the problem?
1. run tesseract with the attached image for OCR output
2.
3.

What is the expected output? What do you see instead?
All characters in the image file to be recognized correctly as the image was 
used to create traindata. But there are many mistakes in the output file.

What version of the product are you using? On what operating system?
Windows XP, 3.02

Please provide any additional information below.

In some cases, the output of default japanese language(jpn) traindata output is 
better than the one I created.

Original issue reported on code.google.com by sivakuma...@gmail.com on 30 Jan 2015 at 8:34

Attachments:

uni.Arial_Unicode_MS.exp0.box
uni.Arial_Unicode_MS.exp0.png
uni.traineddata
uniout.txt

kcobra / tesseract-ocr

Japanese ref image to create language train data is not recognized correctly #1414