Open susht3 opened 5 years ago
I'm sorry but the bidaf++.json can't be used. If u wanna run a simple model , please use the bidaf_elmo.json but it's out of memory recently too bui i think it can be solved by reduce the batch_size
I'm working on coqa now and btw can we speak Chinese? :)
I'm working on coqa now and btw can we speak Chinese? :) 哈哈哈你现在还爆内存吗,可能是因为elmo
I'm working on coqa now and btw can we speak Chinese? :) 哈哈哈你现在还爆内存吗,可能是因为elmo
我把elmo去了 加了lazy读数据 然后还去了两层lstm还是不行 我现在打算全注释掉然后一层一层加上去测试orz
我把elmo去了 加了lazy读数据 然后还去了两层lstm还是不行 我现在打算全注释掉然后一层一层加上去测试orz
我知道为啥了!allennlp可能是每次都把所有的数据加载到内存里,就会爆内存,我自己试过改小了训练集,内存占用就小很多很多。这可能是allennlp的封装问题,不知道有没有办法可以解决。我邮箱在我主页上,你发邮件我们交流一下吧。
我运行bidaf_elmo预处理完后停在0/4380这里好长时间,然后显示Killed,这是因为什么? 监视显存,好像一千多,没有变化。 我是设置的batch=20,10G显存,其他的参数基本没变。
可能是因为elmo的模型数据挂在s3上,不好下载,建议下载到本地,把url也改成本地的elmo的文件再试试
elmo两个文件都是放在本地的。会不会batch大了?表示没有其他想法了😲
batch size建议开个8试一下
谢谢,我过几天试一下
Hi, I want to try CoQA with AllenNLP. I search CoQA + AllenNLP in github and come here. What is the current status of this repo now? Can you reproduce the bidaf++ results on CoQA? and have you solve the memory issue now? Thanks!
Hi, I want to try CoQA with AllenNLP. I search CoQA + AllenNLP in github and come here. What is the current status of this repo now? Can you reproduce the bidaf++ results on CoQA? and have you solve the memory issue now? Thanks!
Hi,currently the repo haven't been updated for a long time because this is just the baseline. I can reproduct the results of BiDAF++ on CoQA but the model is slightly different. I just concat previous questions and answers before current question instead using the answer position embedding. The out of memory issue is because of large batch size(32) while using ELMo. Besides, while working on CoQA, I didn't use this model but a modified version of dialog_qa instead.
as what you say, bidaf++.json and reader is for coqa, but i find that the dataset of bidaf++ json file is coqa-squad, not coqa.
do you work on coqa now?