Open zxgx opened 4 years ago
i have meet the same issue with you. But i also have not deal with this problem. Did you have finish it ?
@liuqingpu no
@liuqingpu no Its difficult for me
I have trouble reproducing the results as well, did anyone make it work?
Exception: ('Found {} checkpoint files but need at least {}', 0, 1)
The error sounds like that the script tries inference, but there are no checkpoint files under the experiment directory. Have you managed to get training working?
I am also facing same issue. The architecture dwnstack_merge2seq_node_iwslt_onvalue_base_upmean_mean_mlesubenc_allcross_hier
mentioned in README.md
is not getting registered as a fairseq architecture. Prior to the error mentioned by OP, the code throws another error:
fairseq-train: error: argument --arch/-a: invalid choice: 'dwnstack_merge2seq_node_iwslt_onvalue_base_upmean_mean_mlesubenc_allcross_hier' (choose from 'fconv_lm',
'fconv_lm_dauphin_wikitext103', 'fconv_lm_dauphin_gbw', 'fconv', 'fconv_iwslt_de_en', 'fconv_wmt_en_ro', 'fconv_wmt_en_de',
'fconv_wmt_en_fr', 'fconv_self_att', 'fconv_self_att_wp', 'lightconv_lm', 'lightconv_lm_gbw', 'lightconv', 'lightconv_iwslt_de_en',
'lightconv_wmt_en_de', 'lightconv_wmt_en_de_big', 'lightconv_wmt_en_fr_big', 'lightconv_wmt_zh_en_big', 'lstm',
'lstm_wiseman_iwslt_de_en', 'lstm_luong_wmt_en_de', 'transformer_lm', 'transformer_lm_big', 'transformer_lm_wiki103',
'transformer_lm_gbw', 'transformer', 'transformer_iwslt_de_en', 'transformer_wmt_en_de',
'transformer_vaswani_wmt_en_de_big', 'transformer_vaswani_wmt_en_fr_big', 'transformer_wmt_en_de_big',
'transformer_wmt_en_de_big_t2t', 'multilingual_transformer', 'multilingual_transformer_iwslt_de_en')
The code then continues to perform inference without any training. In the absence of a valid checkpoint it then throws OP's error. Probably looking at https://github.com/nxphi47/tree_transformer/blob/master/src/models/nstack_archs.py#L615 will help. @nxphi47 Could you please help with this?
I have the same problem, did anyone solve it?
Hi, I'm trying to reproduce the result by following the instructions in README.md. After a long time to preprocess the data, I encountered an exception as follows:
I suppose that some checkpoint files generated during training are missed. Would you please tell me how can I work this out?