Questions about DropNode

THUDM / GRAND

Source code and dataset of the NeurIPS 2020 paper "Graph Random Neural Network for Semi-Supervised Learning on Graphs"

MIT License

206 stars 40 forks source link

if training: masks = torch.bernoulli(1. - drop_rates).unsqueeze(1) features = masks.cuda() * features # did not scale features after dropping nodes randomly else: features = features * (1. - drop_rate) # scaled features during inference

Questions:

The code did not scale features after dropping nodes randomly while training. It's not consist with paper.

Should it be original features as DropNode output during inference?

Should (1 - droprate) or 1 / (1 - droprate) be scale factor?

Description of DropNode in paper:

Code of DropNode:
if training:
    masks = torch.bernoulli(1. - drop_rates).unsqueeze(1)
    features = masks.cuda() * features  # did not scale features after dropping nodes randomly
else:
    features = features * (1. - drop_rate)  # scaled features during inference

Hi, thanks for your interests! DropNode/dropout can be implemented with two ways: 1) Scaling features with 1 / (1 - droprate) during training. 2) Scaling features with (1 - droprate) during inference. Our paper only describes the first method, but we implement it with the second. Both methods are correct.

THUDM / GRAND

Questions about DropNode #7