Closed achyudh closed 5 years ago
Corresponding pull-request in Castor-Data: https://git.uwaterloo.ca/jimmylin/Castor-data/merge_requests/11
@achyudhk Regarding the data format, if you keep label text in dataset, you can use some function for conversion. I use this scripts format for myself. You could do similar thing:
def one_hot_representation(shape, dim, idx, value):
one_hot = torch.LongTensor(*shape).zero_().to(idx.device)
one_hot.scatter_(dim, idx, value)
return one_hot
@Impavidity Thanks, I'll change the existing datasets I pushed to Castor-data to this format.
Steps for LSTM_Regularzation :
@Impavidity I made the changes you requested. Please take a look at the diff.
[x] Pack padded sequences in LSTM_baseline
[x] Add TensorBoardX support for Reuters trainer
[x] Add Arxiv Academic Paper Dataset (AAPD)
[ ] SGM for multi-label classification (https://arxiv.org/abs/1806.04822)
[ ] Regularizing and optimizing LSTM_baseline (https://arxiv.org/abs/1708.02182)
Work in progress: Please don't merge until all of the tasks above are done.