Closed Erica-Yang closed 2 months ago
We replaced the MAX operation with a softmax operation in LOCA (lines 101-109) for more efficient training. You have two options: either train/use the original LOCA version and modify lines 338-348 in dave.py to match the original implementation in loca.py, or retrain the loca with the softmax operation/use weights base_3_shot.pth. Either way, the performance should be similar.
Thanks a lot for your reply!
Hi, how to generate "base_3_shot.pth"? or is it the same as "loca_few_shot.pth" in LOCA?