Closed elricwan closed 3 years ago
The problem comes from self.transformer = GPT2Model(config) self.lm_head = GPT2LMHead(self.transformer.wte.weight, config)
In gpt2, the self.transformer.wte.weight would be used twice.
How to use bagua with gpt2?
@liuhatry please help take a look
Currently,Bagua does not support duplicated tensors. We will develop this feature as soon as possible.
@elricwan We have fixed the problem. Please try master branch and let us know if there are any other issues :)
python3 -m pip install git+https://github.com/BaguaSys/bagua.git
It will be available in next release (0.7).
Thanks
I follow the code instruction and run my gpt2 model with bagua. 1 node 2 gpus. But I got this error. My code works on pure pytorch distribution environment. Here is the source code:
Here is the full error messege:
Can anyone help? thank you!