Use of negatives during training

nomic-ai / contrastors

Train Models Contrastively in Pytorch

Apache License 2.0

459 stars 35 forks source link

Closed gangiswag closed 4 days ago

gangiswag commented 4 days ago

So the only negatives during contrastive loss are the in-batch negatives?

zanussbaum commented 4 days ago

for weakly supervised contrastive retraining, yes we only use in batch negatives. for fine tuning we do use hard negatives. the link above is old code and instead the negatives get added alongside the documents: https://github.com/nomic-ai/contrastors/blob/main/src/contrastors/dataset/text_text_loader.py#L419

there’s a bunch of bad indirection here that should be cleaned up eventually

gangiswag commented 4 days ago

Ah I see, so you add them in the dataloader itself. Thanks for confirming!