Closed neubig closed 5 years ago
FYI: I've found out that the problem was because I'm using a 2080Ti, which fails when you use CUDA less than version 10. The environment suggested by TranX is using CUDA version 9. I started working on fixing this by fixing this issue https://github.com/pcyin/tranX/issues/10 and making a more modern environment, but the newest version of PyTorch doesn't work with tranX, and there are several places that need fixing. Will update when I finish.
Should be fixed by https://github.com/pcyin/tranX/pull/15
Hi, Prof. Neubig. I merged your PR onto my fork manually but there were still some issues for WikiSQL task. I made the following changes in the model/wikisql/parser.py file in order to make it work.
L 247 From
action_prob_var = torch.cat([torch.cat(action_probs_i).log().sum() for action_probs_i in action_probs])
->
action_prob_var = torch.stack([torch.stack(action_probs_i).log().sum() for action_probs_i in action_probs])
L 459 From
new_hyp_scores = torch.cat([x['new_hyp_score'] for x in new_hyp_meta])
->
new_hyp_scores = torch.stack([x['new_hyp_score'].cuda() for x in new_hyp_meta])
This is the same error that was reported by @gsh2014 in #4, but I figure it'd be better to have it as a separate issue. I'm running into the same problem:
I'm not sure if it's related but, I did find this: https://github.com/pytorch/pytorch/issues/953 That post seemed to indicate it might be an out-of-memory error, so I tried to reduce the batch size and size of the hidden dimensions, but this didn't change anything...
@pcyin: I was able to reproduce the error on
ogma
, so maybe you'd be able to as well?