What You Get Is What You See: A Visual Markup Decompiler
Yuntian Deng, Anssi Kanervisto, Alexander Rush
https://arxiv.org/pdf/1609.04938.pdf
Note that the model is slightly different.
I also recommend reading this paper:
Robust Scene Text Recognition with Automatic Rectification
Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai
https://arxiv.org/abs/1603.03915
I have a more recent version of this model here:
Note that the model is slightly different.
I also recommend reading this paper: