Open lingjieyi opened 2 years ago
also, line 90, why to use
data[b, -pred_terms:, :] = x[rand_num, -pred_terms:, :]
instead of
data[b, pred_terms:, :] = x[rand_num, pred_terms:, :]
Yes, it is a shallow copy but we only need the 'data', after line 90, the 'x' will not be used again, so I think it is ok.
about your second question, here we use '-pred_terms' to get the training data.
line 80 of the model.py file:
data = x is the show copy, then in the line 90, data[b, -pred_terms:, :] = x[rand_num, -pred_terms:, :], both x and data changes simultaneously
Is it still "draws a result from a distribution that excludes itself"