Open Keneyr opened 5 years ago
Thanks for attention. The 'data' directory stores the unary features extracted from strokes and ground truth(.label). The 'edge' directory stores the pairwise features and relationship between strokes(.edge). The raw dataset is called IAMonDo dataset which is organized in XML form and the preprocess process is a little bit complicated. I will try to fix this issue soon.
Hi @maplefallfrost , very interested on how you preprocess the inkml data from IAMonDo to the uploaded data. It would be nice if you can upload the code or send me the paper so that I learn more from it.
Hi @maplefallfrost , very interested on how you preprocess the inkml data from IAMonDo to the uploaded data. It would be nice if you can upload the code or send me the paper so that I learn more from it.
Thank you for your interest and sorry for the late reply. My feature extraction method follows the same procedure as in my previous paper https://ieeexplore.ieee.org/document/7900138 and also similar in these two papers: https://www.sciencedirect.com/science/article/pii/S0031320313001878 http://web.tuat.ac.jp/~nakagawa/pub/2015/pdf/Truyen_et_al_Pattern_Recognition.pdf Recently I am struggling for my graduation and maybe I have no much time to reorganize my C++ code. I will try to release this part as soon as possible and the time might be at the beginning of October, soon after ICDAR2019.
Thankyou very much for share! I had train your model by using features, which extracted by my code. The result is lower than using your features data, so could you share the code of feature extraction When you are convenient.
Thankyou very much for share! I had train your model by using features, which extracted by my code. The result is lower than using your features data, so could you share the code of feature extraction When you are convenient.
I feel very sorry that I still don't have time to release my code for feature extraction as I am still preparing for graduation now. So how is your result? Is there a serious drop with accuracy? I think the accuracy is accepted if it is > 98.4
Thankyou very much for your reply! On the IAM database, I got 95.498%. And for Diagram-FA, the accuracy is 98.426%, for Diagram-FC, the accyracy is 97.431%. I trained the three model respectively!I have tried different data preprocessing such as resampling and normalization, which have a little influence on the performance(about 0.8% one IAM). So Maybe we should check the feature extraction section?
Thankyou very much for your reply! On the IAM database, I got 95.498%. And for Diagram-FA, the accuracy is 98.426%, for Diagram-FC, the accuracy is 97.431%. I trained the three models respectively!I have tried different data preprocessing such as resampling and normalization, which have a little influence on the performance(about 0.8% one IAM). So maybe we should check the feature extraction section?
Is it the text/non-text classification accuracy mentioned here? Or the general multi-class accuracy?
Thankyou very much for your reply! On the IAM database, I got 95.498%. And for Diagram-FA, the accuracy is 98.426%, for Diagram-FC, the accuracy is 97.431%. I trained the three models respectively!I have tried different data preprocessing such as resampling and normalization, which have a little influence on the performance(about 0.8% one IAM). So maybe we should check the feature extraction section?
Is it the text/non-text classification accuracy mentioned here? Or the general multi-class accuracy?
yes, text/non-text classification !
Thankyou very much for your reply! On the IAM database, I got 95.498%. And for Diagram-FA, the accuracy is 98.426%, for Diagram-FC, the accuracy is 97.431%. I trained the three models respectively!I have tried different data preprocessing such as resampling and normalization, which have a little influence on the performance(about 0.8% one IAM). So maybe we should check the feature extraction section?
Is it the text/non-text classification accuracy mentioned here? Or the general multi-class accuracy?
yes, text/non-text classification !
Ok, that's a little bit weird as I think it is not hard for any structured prediction model to get over 97% accuracy on the IAMOnDo dataset. I need to finish my thesis this month so maybe I can upload my code in the next month. Sorry for the inconvience= =
Hi @maplefallfrost , can you share the code of feature extraction When you are convenient ?
Thankyou very much for share! I had train your model by using features, which extracted by my code. The result is lower than using your features data, so could you share the code of feature extraction When you are convenient.
Hi, @sheirving , can you share your code of feature extraction?
Any update on this? @sheirving @maplefallfrost Can you share the feature extraction module?
What's the meaning of the folders 'data' and 'edge'? How did you get the files such as 0.label2,0.stroke_feature???? Can you upload the raw dataset ?