Closed drunkinlove closed 3 years ago
Hi,
Can you please ask this question on the forum rather than here? For example, this comment might help you already.
The authors of HuggingFace like to keep Github issues for bugs/feature requests.
Thank you!
Thank you, closing this.
Hi! This is more of a question than a bug report. Can I use BERTModel without any modifications to train a BERT-based siamese encoder?
(Not sure if this really is a BERT-specific question, but I will tag @LysandreJik just in case)
This is how my training step looks like:
Should this work? Most other examples of siamese models in PyTorch simply modify the forward pass to include the second input, but I don't see why gradients shouldn't accumulate properly in my case.