about the ranking operation

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

MIT License

2.68k stars 355 forks source link

Hi, @okhat Thanks for the great repo! I have a hard time understanding the code for reranking. Especially in this section: https://github.com/stanford-futuredata/ColBERT/blob/706a7265b06c6b8de1e3236294394e5ada92134e/colbert/ranking/index_ranker.py#L56C7-L112

I have searched the relevant github issue and I understand these code are for efficiency from this issue.

But can I find relevant documentations somewhere to help me better understand this? It seems that it would first turn 2D embeddings to 3D embeddings with different strides (108 and 180) for matrix multiplication. But I don't get it why we need this stride parameter? Why couldn't we just do something like this:

load all embeddings and corresponding doclens
get the embedding per passage based on the pids
padding them to the same length for matrix multiplication
maxsim operation and select the topk

stanford-futuredata / ColBERT

about the ranking operation #281