Why is link prediction setting in this literature referred to as 'unsupervised'?

You raise a good point. In the common link prediction problem, we are given a partially observed graph with known links and non-links as the starting point. These are values of the data set, similar to pixel values of images. We use the values of the network to make predictions on unobserved entries in the network or adjacency matrix without requiring additional ground truth labels. In this sense, we are doing unsupervised link prediction by using only the input values of the adjacency matrix.

But your point is valid because we still need the ground truth link labels to make progress. A true unsupervised link prediction problem is where the adjacency matrix is completely unknown and the task is to predict the adjacency matrix from scratch. So what we are doing here is not entirely unsupervised link prediction, and you are correct.

vuptran / graph-representation-learning

Why is link prediction setting in this literature referred to as 'unsupervised'? #4