Open drumilT opened 4 years ago
I tried this but creates a subsequent error as a list has no attribute .cpu() , furthermore if you iterate over the list and set each to element to its cpu() return value, it leads to another error related to max pooling
I have tested scripts/yelp/train_yelp.sh
with beam_size=2
without errors in the eval steps. Can you post your running log?
Traceback (most recent call last):
File "src/main.py", line 787, in
I get the following error at beam sizes higher than 1 while training the model, during eval steps
TypeError: can't convert CUDA tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first.
at line https://github.com/cindyxinyiwang/deep-latent-sequence-model/blob/9d55aa02207a028b24439ee73ad60e339f376fda/src/model.py#L678