No Positive Samples in MolPCBA Test Set Assay #45

snap-stanford / ogb

Benchmark datasets, data loaders, and evaluators for graph machine learning

MIT License

1.93k stars 397 forks source link

Minimal reproducible example:

from ogb.graphproppred import PygGraphPropPredDataset
from torch_geometric.data import DataLoader
import torch

dataset = PygGraphPropPredDataset(name = 'ogbg-molpcba')
split_idx = dataset.get_idx_split()
print(sum([torch.nan_to_num(point.y) for point in dataset[split_idx['test']]])[0, 45])

This results in an output of 0 meaning there are no positive examples for this specific endpoint (index 45). How can we properly calculate the overall average precision in this case? This metric is undefined with zero positive samples.

snap-stanford / ogb

No Positive Samples in MolPCBA Test Set Assay #45 #269