harvardnlp / im2markup

Neural model for converting Image-to-Markup (by Yuntian Deng yuntiandeng.com)
https://im2markup.yuntiandeng.com
MIT License
1.21k stars 214 forks source link

not working for below type of images (other than given by you). I think we need to put images in particular format #31

Closed vyaslkv closed 4 years ago

vyaslkv commented 4 years ago

I tested on below type of images and it is giving results like (one diterminant & one matrix) \hspace { 0 . 5 c m } \hspace { 5 m m } determ matrix

vyaslkv commented 4 years ago

int for this also not working

giving results as \begin{array} { r c l } \end{array}

vyaslkv commented 4 years ago

I have even done the preprocessing step but still not getting results: onmt_preprocess -data_type img \ -src_dir data/im2text/images/ \ -train_src data/im2text/src-train.txt \ -train_tgt data/im2text/tgt-train.txt -valid_src data/im2text/src-val.txt \ -valid_tgt data/im2text/tgt-val.txt -save_data data/im2text/demo \ -tgt_seq_length 150 \ -tgt_words_min_frequency 2 \ -shard_size 500 \ -image_channel_size 1

vyaslkv commented 4 years ago

is there any way I could convert my images to your type of images

vyaslkv commented 4 years ago

If I need to retrain the model for my type of images then can you tell me the format it will be required

format of data to retrain the model using python

vyaslkv commented 4 years ago

is there any dataset which I could use with the python implementation to train the model and get the results on normal other images as well

vyaslkv commented 4 years ago

http://bit.ly/2E7H8tX

I found the above data but it is in different format can you help me how can I use this type of data for training

da03 commented 4 years ago

Sorry for the delay. Are you training a new model or are you using the pretrained one? The pretrained one is only trained on a particular rendering setup so it's not surprising that it fails on out-of-training-distribution data at test time. To make it work, a new model needs to be trained with data encompassing enough test-style images (font, size, etc).

vyaslkv commented 4 years ago

I used the pretrained model. Thanks for replying!!